Decision Making and Reinforcement Learning
About this Course
This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.Created by: Columbia University

Related Online Courses
The React Fundamentals course is designed to provide a comprehensive introduction to React, the popular JavaScript library for building user interfaces. This course is ideal for web developers who... more
in 2006, the British mathematician Clive Humby coined the phrase \"Data is the new Oil\". This analogy has been proven correct as data powers entire industries nowadays but if left unrefined, is... more
This comprehensive course covers the foundational principles of Continuous Integration (CI) and Continuous Deployment (CD), emphasizing the integral role of automation in the software development... more
In this Specialization, you will master design thinking competencies in an engaging hands-on, project-based format. We will guide you through a detailed 14-Step process where you will tackle a... more
In the rapidly evolving world of design and innovation, prototyping is essential for bringing ideas to life swiftly and effectively. Building on your existing knowledge of Generative AI tools like... more