Data Science: Wrangling
About this Course
In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.Created by: Harvard University
Level: Introductory

Related Online Courses
This advanced Excel course builds on the teachings of Course 1: Core Foundations and Course 2: Data Management. Designed for experienced Excel users, master the techniques needed to draw insights... more
Perhaps the most popular data science methodologies come from machine learning. What distinguishes machine learning from other computer guided decision processes is that it builds prediction... more
El futuro pertenece a la ciencia de datos y a quienes la entiendan. Al igual que el petróleo y el gas impulsaron las economías de los siglos XX y XXI, los datos impulsan cada vez mas la i... more
Este curso te permitirá desarrollar habilidades como un tomador de decisiones con base a las siguientes competencias: análisis de elementos estadístico de la información conceptos y fun... more
In this course, you will learn how to organize your data within the Microsoft Office Excel software tool. Once organized, we will discuss data cleaning. You will learn how to identify outliers and... more