Fundamental Tools of Data Wrangling
About this Course
Data wrangling is a crucial step in the data analysis process, as it involves the transformation and preparation of raw data into a suitable format for analysis. The \"Fundamental Tools for Data Wrangling\" course is designed to provide participants with essential skills and knowledge to effectively manipulate, clean, and analyze data. Participants will be introduced to the fundamental tools commonly used in data wrangling, including Python, data structures, NumPy, and pandas. Through hands-on exercises and practical examples, participants will gain the necessary proficiency to work with various data formats and effectively prepare data for analysis. In this course, participants will dive into the world of data manipulation using Python as the primary programming language. They will learn about data structures, such as lists, dictionaries, and arrays, and how to use them to store and organize different types of data. Furthermore, participants will explore the power of Python packages like random and math for generating and performing mathematical operations on data. They will also be introduced to NumPy, a powerful library for numerical computing, and learn how to efficiently work with multi-dimensional arrays and matrices. A significant focus of the course will be on pandas, a versatile library for data manipulation and analysis. Participants will discover various techniques to clean, reshape, and aggregate data using pandas, enabling them to derive valuable insights from messy datasets.Created by: University of Colorado Boulder
Related Online Courses
Do you work in finance or interact with finance professionals? Is it necessary to speak, write or understand English in your career? Follow the authentic characters in this course as they work... more
In this anatomy course, part of the Anatomy Specialization, you will explore the interactive relationships of the cardiovascular, respiratory and urinary systems, and the roles they play in your... more
The aim of this course is to introduce learners to open-source R packages that can be used to perform clinical data reporting tasks. The main emphasis of the course will be the clinical data flow... more
Your department is steadily inundated with projects and has tight deadlines to meet. The last thing you want is to sift through non-technical user stories, translate them into technical... more
By the end of this project, you will be able to add the data in the CSV file to Pandas data frame, plot the graph, and set marker type and color. You will also be able to apply labels, change font... more