NYU Classifieds>NYU Online Courses>Decision Making and Reinforcement Learning

Decision Making and Reinforcement Learning

About this Course

This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Created by: Columbia University


Related Online Courses

What\'s the best and quickest way to convert Word documents to PDF, get rid of the old version, automatically upload the new PDF version to the folder of our choice and get a notification about it,... more
More than two decades into the new millennium, it is difficult to envision a future in which digital technologies do not play a significant role. Digital technologies frequently make a lot of... more
The Discover Best Practice Farming for a Sustainable 2050 Course is based on a clear vision: imagine best practice farming for 2050, start to implement these strategies now, all the while making... more
This is a self-paced lab that takes place in the Google Cloud console. In this lab you will build a time series model to forcast demand of multiple products using BigQuery ML. This lab is based on... more
This course will provide information on the various stages of semiconductor package manufacturing, including sort, assembly, and final test. In addition, we will also describe how to select, build,... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL