KU Classifieds>KU Online Courses>Decision Making and Reinforcement Learning

Decision Making and Reinforcement Learning

About this Course

This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Created by: Columbia University


Related Online Courses

Unlock the potential of blockchain and smart contracts in this comprehensive course designed to guide you from the fundamentals to creating decentralized applications (DApps). Learn how blockchain... more
The urgent transition towards a low-carbon economy will profoundly change our economy. Households, companies and financial intermediaries have to be ready in order to avoid the downside risks and... more
Dive into the comprehensive PMP Certification Exam Preparation course, designed to ensure success on your first attempt at the Project Management Professional (PMP) exam. Aligning with the latest... more
The practice of investment management has been transformed in recent years by computational methods. Instead of merely explaining the science, we help you build on that foundation in a practical... more
In this course, you will see how web apps in Azure allow you to publish and manage your website easily without having to work with the underlying servers, storage, or network assets. Instead, you... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL