KU Classifieds>KU Online Courses>Data Management with Databricks: Big Data with Delta Lakes

Data Management with Databricks: Big Data with Delta Lakes

About this Course

In this 2-hour guided project, \"Data Management with Databricks: Big Data with Delta Lakes\" you will collaborate with the instructor to achieve the following objectives: 1-Create Delta Tables in Databricks and write data to them. Gain hands-on experience in setting up and managing Delta Tables, a powerful data storage format optimized for performance and reliability. 2-Transform a Delta table using Python and leverage SQL to query the data for creating a comprehensive dashboard. Learn how to apply Python-based transformations to Delta Tables, and use SQL queries to extract the necessary insights for building a Supply Chain dashboard. 3-Utilize Delta Lake\'s merge operation and version control capabilities to efficiently update Delta Tables. Explore the capabilities of Delta Lake\'s merge operation to perform upserts and other data updates efficiently. Additionally, learn how to leverage Delta Lake\'s built-in version control to track and access previous versions of Delta Tables as needed. Throughout a real-world business scenario, you will use Databricks to build an end-to-end data pipeline that integrates various JSON data files and applies transformations, ultimately providing valuable insights and analysis-ready data. This intermediate-level guided project is designed for data engineers who build data pipelines for their companies using Databricks. In order to be successful in this guided project, you need prior knowledge of writing Python scripts including importing libraries, setting-up variables, manipulating data frames, and using functions. You will also need to be familiar with writing SQL queries such as aggregating, filtering, and joining tables.

Created by: Coursera Project Network


Related Online Courses

Did you know that personalized product recommendations can increase sales by up to 20%? As consumers, we all appreciate suggestions tailored to our tastes, and as AI engineers, we can harness data... more
This is the final course of the Exam Prep AZ-400: Microsoft DevOps Engineer Expert Specialization. This course focuses on understanding how to analyze metrics from instrumentation to gain insights... more
The concepts of large language models (LLMs) took the world by storm in November 2022, positioning Artificial Intelligence as one of the most invested-in and promising technology sectors. This... more
Gain a foundational understanding of key terms and concepts in public administration and public policy while learning foundational programming techniques using the R programming language. You will... more
Unlock the power of ChatGPT\'s free AI tools to excel with powerful and intuitive techniques that can be applied across any professional domain, from digital marketing and data analytics to project... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL