Visual Perception

About this Course

The ultimate goal of a computer vision system is to generate a detailed symbolic description of each image shown. This course focuses on the all-important problem of perception. We first describe the problem of tracking objects in complex scenes. We look at two key challenges in this context. The first is the separation of an image into object and background using a technique called change detection. The second is the tracking of one or more objects in a video. Next, we examine the problem of segmenting an image into meaningful regions. In particular, we take a bottom-up approach where pixels with similar attributes are grouped together to obtain a region. Finally, we tackle the problem of object recognition. We describe two approaches to the problem. The first directly recognize an object and its pose using the appearance of the object. This method is based on the concept of dimension reduction, which is achieved using principal component analysis. The second approach is to use a neural network to solve the recognition problem as one of learning a mapping from the input (image) to the output (object class, object identity, activity, etc.). We describe how a neural network is constructed and how it is trained using the backpropagation algorithm.

Created by: Columbia University


Related Online Courses

This specialization introduces Red Hat Enterprise Linux system administration and private cloud capabilities of IBM Systems. IBM Power servers will be used to demonstrate these concepts. Through... more
This Specialization helps you improve your professional communication in English for successful business interactions. Each course focuses on a particular area of communication in English: writing... more
This learning path provides a journey into leveraging Gemini within BigQuery for advanced data and AI workflows. Starting with foundational productivity enhancements, it progresses to building... more
Master the world of Large Language Models through this comprehensive specialization from Coursera and Duke University, a top Data Science and AI program. Dive into topics ranging from generative AI... more
This is a self-paced lab that takes place in the Google Cloud console. This hands-on lab shows you how to perform basic tasks in Cloud Storage using the gsutil command-line tool. Watch the short... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL