Mining Massive Datasets

About this Course

The course is based on the text Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, and Jeff Ullman, who by coincidence are also the instructors for the course. The book is published by Cambridge Univ. Press, but by arrangement with the publisher, you can download a free copy Here. The material in this on-line course closely matches the content of the Stanford course CS246. The major topics covered include: MapReduce systems and algorithms, Locality-sensitive hashing, Algorithms for data streams, PageRank and Web-link analysis, Frequent itemset analysis, Clustering, Computational advertising, Recommendation systems, Social-network graphs, Dimensionality reduction, and Machine-learning algorithms.

Created by: Stanford University

Level: Advanced


Related Online Courses

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve acquired in thi... more
Information security refers to securing data and information systems from unauthorized access, unauthorized use, misuse, destruction, or alteration. The goal of information security is to protect... more
The Relational Database Management Systems course provides you with a basic understanding of relational databases. You will create databases and gain real-world experience with several popular... more
In this course you will learn about the different experiences patients go through in a medical context. The patient journey explores the interaction between the patient and the healthcare providers... more
Reduction of energy consumption of buildings is an important step in the move towards a sustainable economy. How can buildings be made net zero energy, in different climates? This course... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL