Statistical Learning
About this Course
This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and linear discriminant analysis; cross-validation and the bootstrap, model selection and regularization methods (ridge and lasso); nonlinear models, splines and generalized additive models; tree-based methods, random forests and boosting; support-vector machines; neural networks and deep learning; survival models; multiple testing. Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical). This is not a math-heavy class, so we try and describe the methods without heavy reliance on formulas and complex mathematics. We focus on what we consider to be the important elements of modern data science. Computing is done in R. There are lectures devoted to R, giving tutorials from the ground up, and progressing with more detailed sessions that implement the techniques in each chapter. The lectures cover all the material in An Introduction to Statistical Learning, with Applications in R (second addition) by James, Witten, Hastie and Tibshirani (Springer, 2021). The pdf for this book is available for free on the book website.Created by: Stanford University
Level: Introductory
Related Online Courses
What makes a good business decision? How can we combine effective data analytics and feed robust foresight and scenario planning processes? We need to rethink the organization, and see it as... more
Hoy parece que se esta hablando de "Big Data" por todas partes. Pero ¿qué tan importante o relevante es esto y qué oportunidades ofrece para las organizaciones y nuestros países? ¡Inscríbete al MO... more
Big data is transforming the health care industry relative to improving quality of care and reducing costs--key objectives for most organizations. Employers are desperately searching for... more
Power BI is a robust business analytics and visualization tool from Microsoft that helps data professionals bring their data to life and tell more meaningful stores. This four-week course is a... more
The first in our Professional Certificate Program in Data Science, this course will introduce you to the basics of R programming. You can better retain R when you learn it to solve a specific... more