Visual Perception

About this Course

The ultimate goal of a computer vision system is to generate a detailed symbolic description of each image shown. This course focuses on the all-important problem of perception. We first describe the problem of tracking objects in complex scenes. We look at two key challenges in this context. The first is the separation of an image into object and background using a technique called change detection. The second is the tracking of one or more objects in a video. Next, we examine the problem of segmenting an image into meaningful regions. In particular, we take a bottom-up approach where pixels with similar attributes are grouped together to obtain a region. Finally, we tackle the problem of object recognition. We describe two approaches to the problem. The first directly recognize an object and its pose using the appearance of the object. This method is based on the concept of dimension reduction, which is achieved using principal component analysis. The second approach is to use a neural network to solve the recognition problem as one of learning a mapping from the input (image) to the output (object class, object identity, activity, etc.). We describe how a neural network is constructed and how it is trained using the backpropagation algorithm.

Created by: Columbia University


Related Online Courses

By the end of this course, a learner will have a solid understanding of Large Language Models running locally. You\'ll be able to setup a local environment using powerful tooling to run different... more
The purpose of this course is to help those who are qualified develop confidence to attempt the exam, and to help those not yet qualified to develop their own plan for preparation.Created by:... more
This is a self-paced lab that takes place in the Google Cloud console. In this lab, you use API products to package your APIs, and create a developer portal so application developers can try your... more
This is a self-paced lab that takes place in the Google Cloud console. In addition to batch pipelines, Data Fusion also allows you to create real-time pipelines, that can process events as they are... more
\"Responsible Generative AI\" is a Specialization exploring the possibilities and risks of generative artificial intelligence (AI). You will establish a comprehensive understanding of the impact of... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL