Importing Data in the Tidyverse
About this Course
Getting data into your statistical analysis system can be one of the most challenging parts of any data science project. Data must be imported and harmonized into a coherent format before any insights can be obtained. You will learn how to get data into R from commonly used formats and harmonizing different kinds of datasets from different sources. If you work in an organization where different departments collect data using different systems and different storage formats, then this course will provide essential tools for bringing those datasets together and making sense of the wealth of information in your organization. This course introduces the Tidyverse tools for importing data into R so that it can be prepared for analysis, visualization, and modeling. Common data formats are introduced, including delimited files, spreadsheets and relational databases, and techniques for obtaining data from the web are demonstrated, such as web scraping and web APIs. In this specialization we assume familiarity with the R programming language. If you are not yet familiar with R, we suggest you first complete R Programming before returning to complete this course.Created by: Johns Hopkins University

Related Online Courses
Artificial intelligence (AI) and machine learning (ML) have become an essential part of the toolset for many organizations. When used effectively, these tools provide actionable insights that drive... more
This specialization provides a robust introduction to the key principles and practices of Information Systems Management.\\n\\nThe first course describes how Information Systems address business... more
Operations Research (OR) is a field in which people use mathematical and engineering methods to study optimization problems in Business and Management, Economics, Computer Science, Civil... more
Amazon DynamoDB Fundamentals is a skill enhancement course designed for candidates aiming to enhance their knowledge as Database professionals. This course will help learners understand working... more
This is a Google Cloud Self-Paced Lab. In this lab, you create SQL derived and native derived tables in LookML to define new tables that do not already exist in the underlying database.Created by:... more