Fundamental Tools of Data Wrangling
About this Course
Data wrangling is a crucial step in the data analysis process, as it involves the transformation and preparation of raw data into a suitable format for analysis. The \"Fundamental Tools for Data Wrangling\" course is designed to provide participants with essential skills and knowledge to effectively manipulate, clean, and analyze data. Participants will be introduced to the fundamental tools commonly used in data wrangling, including Python, data structures, NumPy, and pandas. Through hands-on exercises and practical examples, participants will gain the necessary proficiency to work with various data formats and effectively prepare data for analysis. In this course, participants will dive into the world of data manipulation using Python as the primary programming language. They will learn about data structures, such as lists, dictionaries, and arrays, and how to use them to store and organize different types of data. Furthermore, participants will explore the power of Python packages like random and math for generating and performing mathematical operations on data. They will also be introduced to NumPy, a powerful library for numerical computing, and learn how to efficiently work with multi-dimensional arrays and matrices. A significant focus of the course will be on pandas, a versatile library for data manipulation and analysis. Participants will discover various techniques to clean, reshape, and aggregate data using pandas, enabling them to derive valuable insights from messy datasets.Created by: University of Colorado Boulder

Related Online Courses
This course will teach you how machines can be trained to understand and process human language using various NLP algorithms. You\'ll explore lexical processing, basic syntactic processing, and... more
In this course you will explore concepts and approaches involved in creating successful character designs that can be applied to video games. Following a first week delving into some foundational... more
This course provides the fundamental knowledge necessary for program managers and implementors in a hypertension control program, especially in resource-limited settings. The course is interactive... more
This is a self-paced lab that takes place in the Google Cloud console. In this lab, you\'ll learn how to use BigQuery to create machine learning models for datasets to create a model that predicts... more
Have you ever wondered what it would take for humans to travel beyond the comforts of our home planet, Earth? You are invited to join us in Space Medicine - an online experience facilitated by two... more