Building ETL and Data Pipelines with Bash, Airflow and Kafka

About this Course

Well-designed and automated data pipelines and ETL processes are the foundation of a successful Business Intelligence platform. Defining your data workflows, pipelines and processes early in the platform design ensures the right raw data is collected, transformed and loaded into desired storage layers and available for processing and analysis as and when required. This course is designed to provide you the critical knowledge and skills needed by Data Engineers and Data Warehousing specialists to create and manage ETL, ELT, and data pipeline processes. Upon completing this course you’ll gain a solid understanding of Extract, Transform, Load (ETL), and Extract, Load, and Transform (ELT) processes; practice extracting data, transforming data, and loading transformed data into a staging area; create an ETL data pipeline using Bash shell-scripting, build a batch ETL workflow using Apache Airflow and build a streaming data pipeline using Apache Kafka. You’ll gain hands-on experience with practice labs throughout the course and work on a real-world inspired project to build data pipelines using several technologies that can be added to your portfolio and demonstrate your ability to perform as a Data Engineer. This course pre-requisites that you have prior skills to work with datasets, SQL, relational databases, and Bash shell scripts.

Created by: IBM

Level: Introductory


Related Online Courses

Automation requires you to have both, a deep understanding of business logic and the ability to translate it into an automated program using the syntax and semantics of a programming language.... more
Most data science projects fail. There are various reasons why, but one of the primary reasons is the challenge of deployment. One piece to the deployment puzzle is understanding how to automate... more
Cloud Computing is revolutionizing today’s world. It affects the way we communicate, do business, interact with everyday things and one another. This course discusses the fundamental theoretical a... more
In this course, we start with some services where the training model and raw inference is handled for you by Amazon. We'll cover services which do the heavy lifting of computer vision, data... more
The project provides you with a real-life challenge that requires you to flex and firm your cloud native application development muscle. You will start with a problem situation. To address this... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL