Introduction to Designing Data Lakes on AWS

About this Course

Designing a data lake is challenging because of the scale and growth of data. Developers need to understand best practices to avoid common mistakes that could be hard to rectify. In this course we will cover the foundations of what a Data Lake is, how to ingest and organize data into the Data Lake, and dive into the data processing that can be done to optimize performance and costs when consuming the data at scale. This course is for professionals (Architects, System Administrators and DevOps) who need to design and build an architecture for secure and scalable Data Lake components. Students will learn about the use cases for a Data Lake and, contrast that with a traditional infrastructure of servers and storage.

Created by: Amazon Web Services

Level: Intermediate


Related Online Courses

En este curso en línea el estudiante aprenderá los conceptos estadísticos básicos para realizar un análisis aplicado de datos, haciendo los cálculos en Excel y buscando la interpretación de cada u... more
While randomized controlled trials are considered to be the "gold standard" in health research, they cannot always be performed, for ethical or practical reasons. Observational studies gather... more
A majority of the world's data resides in databases. SQL (or Structured Query Language) is a powerful language for communicating with and extracting data from databases. A working knowledge of... more
En este curso de análisis e interpretación de datos te presentaremos técnicas avanzadas de importación de datos y estrategias diversas para consolidarlos y prepararlos una vez importados de for... more
The job of a data scientist is to glean knowledge from complex and noisy datasets. Reasoning about uncertainty is inherent in the analysis of noisy data. Probability and Statistics provide the... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL