Decision Making and Reinforcement Learning

About this Course

This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Created by: Columbia University


Related Online Courses

This course explores Hawthorne\'s aptitude in the genre of romance. It also discusses how he turned into an author from a civil servant. Additionally, you will explore a few important plot points... more
In today\'s rapidly evolving digital landscape, the importance of cybersecurity cannot be overstated. Threats to organizations, both large and small, are on the rise, and the consequences of... more
This is a self-paced lab that takes place in the Google Cloud console. In this lab, you use API products to package your APIs, and create a developer portal so application developers can try your... more
Learn about the materials that have advanced the performance of artificial intelligence, and the machine learning models that could help accelerate the design and development of novel materials.... more
This class presents the fundamental probability and statistical concepts used in elementary data analysis. It will be taught at an introductory level for students with junior or senior... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL