Statistical Learning
About this Course
This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and linear discriminant analysis; cross-validation and the bootstrap, model selection and regularization methods (ridge and lasso); nonlinear models, splines and generalized additive models; tree-based methods, random forests and boosting; support-vector machines; neural networks and deep learning; survival models; multiple testing. Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical). This is not a math-heavy class, so we try and describe the methods without heavy reliance on formulas and complex mathematics. We focus on what we consider to be the important elements of modern data science. Computing is done in R. There are lectures devoted to R, giving tutorials from the ground up, and progressing with more detailed sessions that implement the techniques in each chapter. The lectures cover all the material in An Introduction to Statistical Learning, with Applications in R (second addition) by James, Witten, Hastie and Tibshirani (Springer, 2021). The pdf for this book is available for free on the book website.Created by: Stanford University
Level: Introductory
Related Online Courses
Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve acquired in thi... more
The R language plays a critical role in data analysis and a common programming language when working in the field of data science & analytics. This course will introduce you to R language... more
In autonomous vehicles such as self-driving cars, we find a number of interesting and challenging decision-making problems. Starting from the autonomous driving of a single vehicle, to the... more
This course provides an introduction to basic probability concepts. Our emphasis is on applications in science and engineering, with the goal of enhancing modeling and analysis skills for a variety... more
Basics of Bayesian Data Analysis Using R is part one of the Bayesian Data Analysis in R professional certificate. Bayesian approach is becoming increasingly popular in all fields of data analysis,... more