Data Analysis with R

About this Course

The R programming language is purpose-built for data analysis. R is the key that opens the door between the problems that you want to solve with data and the answers you need to meet your objectives. This course starts with a question and then walks you through the process of answering it through data. You will first learn important techniques for preparing (or wrangling) your data for analysis. You will then learn how to gain a better understanding of your data through exploratory data analysis, helping you to summarize your data and identify relevant relationships between variables that can lead to insights. Once your data is ready to analyze, you will learn how to develop your model and evaluate and tune its performance. By following this process, you can be sure that your data analysis performs to the standards that you have set, and you can have confidence in the results. You will build hands-on experience by playing the role of a data analyst who is analyzing airline departure and arrival data to predict flight delays. Using an Airline Reporting Carrier On-Time Performance Dataset, you will practice reading data files, preprocessing data, creating models, improving models, and evaluating them to ultimately choose the best model. Watch the videos, work through the labs, and add to your portfolio. Good luck! Note: The pre-requisite for this course is basic R programming skills. For example, ensure that you have completed a course like Introduction to R Programming for Data Science from IBM.

Created by: IBM


Related Online Courses

This course will serve as a \"deep dive\" into the concepts and trends related to diversity and inclusion. One of the barriers to sustained organizational effectiveness in this area has been... more
This Specialization will introduce non-native speakers of English to methods for developing English language and communication skills for the workplace, doing business, cross-cultural... more
By the end of this project, you will learn how to use Canva to create a simple 3D effect for a customised cover image to enhance your Linkedin profile. Canva is a graphic design platform, used to... more
This is a self-paced lab that takes place in the Google Cloud console. This lab shows you how to create a Google Cloud Dataproc cluster, run a simple Apache Spark job in the cluster, then modify... more
This is a self-paced lab that takes place in the Google Cloud console. In addition to batch pipelines, Data Fusion also allows you to create real-time pipelines, that can process events as they are... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL