Data Science: Wrangling
About this Course
In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.Created by: Harvard University
Level: Introductory

Related Online Courses
This course provides an introduction to basic statistical concepts. We begin by walking through a library of probability distributions, where we motivate their uses and go over their fundamental... more
Can you think of an area of your life that is influenced by statistics? Many times when we think about statistics in our daily lives, we think about numerical expressions of statistics, such as the... more
Statistics 1 Part 2 is a self-paced course from LSE which aims to introduce you to and develop your understanding of essential statistical concepts, methods and techniques, emphasising the... more
A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging. Part of our... more
In data science, data is called "big" if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or... more