NYU Classifieds>NYU Online Courses>Building ETL and Data Pipelines with Bash, Airflow and Kafka

Building ETL and Data Pipelines with Bash, Airflow and Kafka

About this Course

Well-designed and automated data pipelines and ETL processes are the foundation of a successful Business Intelligence platform. Defining your data workflows, pipelines and processes early in the platform design ensures the right raw data is collected, transformed and loaded into desired storage layers and available for processing and analysis as and when required. This course is designed to provide you the critical knowledge and skills needed by Data Engineers and Data Warehousing specialists to create and manage ETL, ELT, and data pipeline processes. Upon completing this course you’ll gain a solid understanding of Extract, Transform, Load (ETL), and Extract, Load, and Transform (ELT) processes; practice extracting data, transforming data, and loading transformed data into a staging area; create an ETL data pipeline using Bash shell-scripting, build a batch ETL workflow using Apache Airflow and build a streaming data pipeline using Apache Kafka. You’ll gain hands-on experience with practice labs throughout the course and work on a real-world inspired project to build data pipelines using several technologies that can be added to your portfolio and demonstrate your ability to perform as a Data Engineer. This course pre-requisites that you have prior skills to work with datasets, SQL, relational databases, and Bash shell scripts.

Created by: IBM

Level: Introductory


Related Online Courses

Las redes de computadores, o renombradas hoy en día como Networks of Things , son esenciales para soportar los servicios de Internet que usamos a diario, dado que soportan la transformación d... more
This course takes you through the last nine lessons of CS6750: Human-Computer Interaction as taught in the Georgia Tech Online Master of Science in Computer Science program. In this final course... more
Este curso online da un repaso por los distintos menús de la herramienta Word, revisando a fondo sus distintas opciones e incidiendo en las funcionalidades de la herramienta que no son intuitivas, ... more
Prototyping is part art, part science. In this MOOC you will learn both UI design (user interface design) and the ergonomic criteria (grounded in cognitive psychology), which underlies it. With... more
The world and internet are full of textual information. We search for information using textual queries and read websites, books and e-mails. These are all strings from a computer science point of... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL