Mining Massive Datasets

About this Course

The course is based on the text Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, and Jeff Ullman, who by coincidence are also the instructors for the course. The book is published by Cambridge Univ. Press, but by arrangement with the publisher, you can download a free copy Here. The material in this on-line course closely matches the content of the Stanford course CS246. The major topics covered include: MapReduce systems and algorithms, Locality-sensitive hashing, Algorithms for data streams, PageRank and Web-link analysis, Frequent itemset analysis, Clustering, Computational advertising, Recommendation systems, Social-network graphs, Dimensionality reduction, and Machine-learning algorithms.

Created by: Stanford University

Level: Advanced


Related Online Courses

Knowing how to work with Version Control Systems Git and GitHub are essential tools for today’s Software Developers, Data Scientists, and Data Engineers. They provide both a version control s... more
A tech talent shortage continues as organizations adopt higher security standards to address the ongoing risk of threats and breaches. This course builds key skills to address common security... more
In this course you will learn about the different experiences patients go through in a medical context. The patient journey explores the interaction between the patient and the healthcare providers... more
In this course you will learn how to design relational databases and model those designs for others to understand. All forms of Normal Form will be covered so your database designs conform to best... more
This course is designed for accountancy, finance and business professionals working in all organisations from small business, large corporates or financial services or who are just interested in... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL