Data Engineering Capstone Project

About this Course

In this Capstone you’ll demonstrate your ability to perform like a Data Engineer. Your mission is to design, implement, and manage a complete data and analytics platform consisting of relational and non-relational databases, data warehouses, data pipelines, big data processing engines, and Business Intelligence (BI) tools. This Capstone project will require that you apply and sharpen the skills and knowledge you developed in the various courses in the IBM Data Engineering Professional Certificate and utilize multiple tools and technologies to design databases, collect data from multiple sources, extract, transform and load data into a data warehouse, and utilize a cloud-based BI tool to create analytic reports and visualizations. You will also implement predictive analytics and machine learning models using big data tools and techniques. This capstone requires significant amount of hands-on lab effort throughout the course. You’ll exhibit your knowledge and proficiency working with Python, Bash scripts, SQL, NoSQL, RDBMSes, ETL, MySQL, PostgreSQL, Db2, MongoDB, Apache Airflow, Apache Spark, and Cognos Analytics. Upon successfully completing this Capstone, you should have the confidence and portfolio to take on real-world data engineering projects and showcase your abilities to perform as an entry-level data engineer.

Created by: IBM

Level: Introductory


Related Online Courses

This course is one of the 5 courses of an introductory business information systems series, designed to introduce you to the amazing world of Information Technology. The series of courses is... more
Wollen Sie realistische Probleme der Ingenieurmathematik lösen? Das ist ohne Einsatz von Computersoftwarelösungen, wie sie etwa MATLAB bietet, nicht möglich. In diesem Kurs zeigen wir, wie Sie di... more
In this course you will work on your very own project where you will go through the process of designing and prototyping your very own app.Created by: Curtin UniversityLevel: Introductory more
Real time operating systems (RTOS) play an important role in any embedded system, enabling users to control the time critical functions required to be handled within specific timeframes for the... more
As Cloud Computing shapes businesses of all sizes, it is vital to understand the technologies behind cloud infrastructure, both public and private. This course, part of the Cloud Computing... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL