Data Science: Wrangling

About this Course

In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point. Very rarely is data easily accessible in a data science project. It's more likely for the data to be in a file, a database, or extracted from documents such as web pages, tweets, or PDFs. In these cases, the first step is to import the data into R and tidy the data, using the tidyverse package. The steps that convert data from its raw form to the tidy form is called data wrangling. This process is a critical step for any data scientist. Knowing how to wrangle and clean data will enable you to make critical insights that would otherwise be hidden.

Created by: Harvard University

Level: Introductory


Related Online Courses

This course discusses properties and applications of random variables. When you’re done, you’ll have enough firepower to undertake a wide variety of modeling and analysis problems; and you’ll be we... more
What makes a good business decision? How can we combine effective data analytics and feed robust foresight and scenario planning processes? We need to rethink the organization, and see it as... more
Through a series of fun and engaging hands-on activities in Microsoft Excel, this module aims to equip the learner with the ability to thoughtfully apply computational tools when solving complex... more
This course, presented by the IMF's Statistics Department, teaches you how to compile timely, high quality national accounts statistics based on the system of national accounts (SNA) framework. The... more
This course allows you to develop skills of a decision maker leader based on the following competencies: analysis of statistical elements of information concepts and statistical foundations for... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL