Visual Perception

About this Course

The ultimate goal of a computer vision system is to generate a detailed symbolic description of each image shown. This course focuses on the all-important problem of perception. We first describe the problem of tracking objects in complex scenes. We look at two key challenges in this context. The first is the separation of an image into object and background using a technique called change detection. The second is the tracking of one or more objects in a video. Next, we examine the problem of segmenting an image into meaningful regions. In particular, we take a bottom-up approach where pixels with similar attributes are grouped together to obtain a region. Finally, we tackle the problem of object recognition. We describe two approaches to the problem. The first directly recognize an object and its pose using the appearance of the object. This method is based on the concept of dimension reduction, which is achieved using principal component analysis. The second approach is to use a neural network to solve the recognition problem as one of learning a mapping from the input (image) to the output (object class, object identity, activity, etc.). We describe how a neural network is constructed and how it is trained using the backpropagation algorithm.

Created by: Columbia University


Related Online Courses

This specialization is intended for working professionals, in both leadership and non-leadership roles, seeking to learn ways they can establish a better work-life balance for themselves and their... more
In 2020 the world will generate 50 times the amount of data as in 2011. And 75 times the number of information sources (IDC, 2011). Being able to use this data provides huge opportunities and to... more
This is a self-paced lab that takes place in the Google Cloud console. In this lab, you learn to use BigQuery to find data, query the data-to-insights public dataset, and write and execute... more
Begin your journey into Natural Language Processing (NLP) with an introduction to text data and encoding techniques, delving into the intricacies of regular expressions through extensive practice... more
Join Us in a Top 50 MOOC of All Time!\\n\\nHow do we sequence and compare genomes? How do we identify the genetic basis for disease? How do we construct an evolutionary Tree of Life for all species... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL