Probability and Statistics in Data Science using Python
About this Course
The job of a data scientist is to glean knowledge from complex and noisy datasets. Reasoning about uncertainty is inherent in the analysis of noisy data. Probability and Statistics provide the mathematical foundation for such reasoning. In this course, part of the Data Science MicroMasters program, you will learn the foundations of probability and statistics. You will learn both the mathematical theory, and get a hands-on experience of applying this theory to actual data using Jupyter notebooks. Concepts covered included: random variables, dependence, correlation, regression, PCA, entropy and MDL.Created by: The University of California, San Diego
Level: Advanced

Related Online Courses
Learn data literacy online using R programming What is data literacy and why is it important? In this data literacy course, you will learn how to become data literate. This will be accomplished by... more
In Data Literacy Foundations, you will learn how critical thinking is an essential data literacy skill in today’s data-driven world. You’ll begin by considering how you use data every day, dis... more
This course, presented by the IMF's Statistics Department, teaches you how to compile timely, high quality national accounts statistics based on the system of national accounts (SNA) framework. The... more
Discover practical ways to critically appraise scientific literature, including the conduction and interpretation of systematic reviews and meta-analyses. Additionally, you will learn how to... more
In data science, data is called "big" if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or... more