Data Science: Wrangling
About this Course
In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.Created by: Harvard University
Level: Introductory

Related Online Courses
Este curso se dirige a usuarios de Tableau que han madurado un sólido conocimiento del software en los cursos de nivel básico e intermedio. En los precedentes módulos, hemos podido aprender a an... more
Este curso de análisis de datos (business intelligence: BI) y estadísticas es para todos aquellos que quieran mejorar sus habilidades en el análisis de datos. ¿Buscas una forma inteligente de vis... more
This proctored examination assesses all concepts, methods and techniques introduced across the following four courses within the LSE MicroBachelors program in Statistics Fundamentals: Statistics 1:... more
Basics of Bayesian Data Analysis Using R is part one of the Bayesian Data Analysis in R professional certificate. Bayesian approach is becoming increasingly popular in all fields of data analysis,... more
The job of a data scientist is to glean knowledge from complex and noisy datasets. Reasoning about uncertainty is inherent in the analysis of noisy data. Probability and Statistics provide the... more