Decision Making and Reinforcement Learning
About this Course
This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.Created by: Columbia University
Related Online Courses
Are you a builder who is interested in using Amazon Elastic File System (Amazon EFS)? Do you want to understand how to get started with Amazon EFS? Then, this course is for you! Amazon EFS provides... more
The Raspberry Pi is a small, affordable single-board computer that you will use to design and develop fun and practical IoT devices while learning programming and computer hardware. In addition,... more
Develop a greater appreciation for how the air, water, land, and life formed and have interacted over the last 4.5 billion years.Created by: University of Manchester more
Welcome to the Amazon Elastic Container Service course, where you\'ll embark on a journey to acquire practical expertise in Amazon Elastic Container Service and harness the power of Amazon Web... more
The specialization \"Leading Technical Organizations\" is intended for post-graduate students seeking to develop advanced leadership skills for technical environments. Through three courses, you... more