Decision Making and Reinforcement Learning
About this Course
This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.Created by: Columbia University

Related Online Courses
This specialization is intended for software engineers, development and product managers, testers, QA analysts, product analysts, tech writers, and security engineers. Even if you have experience... more
This course will introduce learners to different approaches to thinking about housing justice, bringing together material, ecological, social and spatial approaches to thinking about housing.... more
Get ready to enter the thriving field of Information Technology (IT), with job ready skills! This beginner friendly course provides the core hardware and operating system knowledge needed by anyone... more
This Specialization will develop and enhance your understanding of the global field of journalism. You\'ll learn best practices and ethical standards for newsgathering processes and compiling a... more
By the end of this course you will be able to start writing your own Jenkins scripts either in the Jenkins UI or as Jenkinsfile over a multi-stage CD workflow.Created by: Coursera Project Network more