Data Science and Machine Learning Capstone Project
About this Course
Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve acquired in this course. Enroll to learn more, complete the course and claim your badge! Now that you've taken several courses on data science and machine learning, it’s time to put your learning to work on a data problem involving a real life scenario. Employers really care about how well you can apply your knowledge and skills to solve real world problems, and the work you do in this capstone project will make you stand out in the job market. In this capstone project, you’ll explore data sets in New York’s 311 system, which is used by New Yorkers to report complaints for the non-emergency problems they face. Upon being reported, various agencies in New York get assigned to resolve these problems. The data related to these complaints is available in the New York City Open Dataset. On investigation, one can see that in the last few years the 311 complaints coming to the Department of Housing Preservation and Development in New York City have increased significantly. Your task is to find out the answers to some of the questions that would help the Department of Housing Preservation and Development in New York City effectively tackle the 311 complaints coming to them. You will need to use the techniques you learned in your previous Python, data science, and machine learning courses, including data ingestion, data exploration, data visualization, feature engineering, probabilistic modeling, model validation, and more. By the end of this course, you will have used real world data science tools to create a showcase project and demonstrate to employers that you are job ready and a worthy candidate in the field of data science.Created by: IBM
Level: Intermediate

Related Online Courses
Develop the skills necessary to create structured database environments using a relational database management system (RDBMS), such as MySQL, that incorporates basic processing functionality and... more
Learn data literacy online using R programming What is data literacy and why is it important? In this data literacy course, you will learn how to become data literate. This will be accomplished by... more
Designing a data lake is challenging because of the scale and growth of data. Developers need to understand best practices to avoid common mistakes that could be hard to rectify. In this course we... more
This online course will equip participants with an understanding of computer modelling of breeding programmes to enhance genetic improvements in agriculture. The modelling is done through the... more
This course helps prepare you for positions that require the analysis of large data sets, providing the statistics foundation you’ll need for data analysis. You’ll learn how to model real-world app... more