University of Maryland Classifieds>University of Maryland Online Courses>Data Cleaning in Snowflake: Techniques to Clean Messy Data

Data Cleaning in Snowflake: Techniques to Clean Messy Data

About this Course

in 2006, the British mathematician Clive Humby coined the phrase \"Data is the new Oil\". This analogy has been proven correct as data powers entire industries nowadays but if left unrefined, is effectively worthless. This 2.5 hours-long guided project is designed for business analysts & data engineers eager to learn how to Clean Messy Data in Snowflake Data Platform. By the end of the project, you will -Be able to identify common data quality issues then use SQL String functions to remove unwanted characters and split rows into multiple columns. -Extract dates from Text fields then use SQL date functions for comparisons and calculations. -Identify and correct missing and duplicated data then answer business questions using SQL statements. To achieve these objectives, we will work on a real example from the field, you will play the role of a Data Analyst in the marketing department, who has been tasked with answering a business question, but the customer data they have received presents several data quality challenges. Note: To be successful in this project you need to have Snowflake beginner knowledge such as Creating a trial account, Databases, Tables, and Virtual Warehouses. If you are not familiar with Snowflake and want to learn the basics, start with my previous Guided Project: Snowflake for Beginners: Make your First Snowsight Dashboard which will give you basic knowledge about Snowflake and will teach you how to create your trial account.

Created by: Coursera Project Network


Related Online Courses

Kursus ini memperkenalkan Vertex AI Studio, sebuah alat untuk membuat prototipe dan menyesuaikan model AI generatif. Melalui materi yang imersif, demo yang menarik, dan lab interaktif, Anda akan... more
In this 1-hour long project, you will learn how to clean and preprocess data for language classification. You will learn some theory behind Naive Bayes Modeling, and the impact that class imbalance... more
In this course, we\'ll look at the object oriented patterns available in PHP. You\'ll learn how to connect to a MySQL using the Portable Data Objects (PDO) library and issue SQL commands in the the... more
This course will educate you in the characteristics and properties of natural gas, preparing you with the ability to summarize gas system components and new pipeline technologies. You will be... more
This course will help prepare students for developing code that can process large amounts of data in parallel on Graphics Processing Units (GPUs). It will learn on how to implement software that... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL