Decision Making and Reinforcement Learning

About this Course

This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Created by: Columbia University

Related Online Courses

In today\'s job market, leaders need to understand the fundamentals of data to be competitive. An essential procedure to understand business and analytics is hypothesis testing. This short course,... more
This course covers the very important topic of Supplier Partnerships. After a brief introduction to the course, the course will cover why partnerships are important and what exactly is a... more
The Cloud Migration Factory on AWS solution uses a serverless architecture to coordinate and automate your organization\'s medium-scale to large-scale migrations to the Amazon Web Services (AWS)... more
Marketing is an important application of Generative AI to create personalized and targeted marketing campaigns to stay ahead of the competition. Customers expect more personalized and engaging... more
By the end of the specialization, you will be able to:\\n\\nApply creative thinking to a variety of workplace situations and challenges. Decide between various alternatives, determining which is... more