Mining Massive Datasets

About this Course

The course is based on the text Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, and Jeff Ullman, who by coincidence are also the instructors for the course. The book is published by Cambridge Univ. Press, but by arrangement with the publisher, you can download a free copy Here. The material in this on-line course closely matches the content of the Stanford course CS246. The major topics covered include: MapReduce systems and algorithms, Locality-sensitive hashing, Algorithms for data streams, PageRank and Web-link analysis, Frequent itemset analysis, Clustering, Computational advertising, Recommendation systems, Social-network graphs, Dimensionality reduction, and Machine-learning algorithms.

Created by: Stanford University

Level: Advanced


Related Online Courses

This Data Structures & Algorithms course extends beyond linear data structures in CS1332xI to the nonlinear and hierarchical data structures here in CS1332xII. A short Java review is presented... more
DevOps is the combination of cultural philosophies, practices, and tools that increase an organization’s ability to deliver applications and services at high velocity: evolving and improving p... more
This course is designed for accountancy, finance and business professionals working in all organisations from small business, large corporates or financial services or who are just interested in... more
Most data science projects fail. There are various reasons why, but one of the primary reasons is the challenge of deployment. One piece to the deployment puzzle is understanding how to automate... more
El curso presenta los elementos básicos y las tendencias actuales para realizar una evaluación completa del entorno construido y desarrolla las herramientas para el análisis de impacto de la po... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL