Juniata Classifieds>Juniata Online Courses>Decision Making and Reinforcement Learning

Decision Making and Reinforcement Learning

About this Course

This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Created by: Columbia University


Related Online Courses

This course examines how the digestive system processes food, absorbs nutrients, and eliminates waste. Learners will explore the roles of the stomach, intestines, liver, pancreas, and gallbladder,... more
This course provides a foundational understanding of human anatomy and physiology, exploring how body structures and systems work together to maintain health. Learners will develop essential... more
This comprehensive Agile, Scrum, and Project Management specialization equips you with the skills to lead successful projects and excel in Agile environments. Through focused modules, you\'ll... more
Where have you experienced biology today? Journey through the science of life through the lens of our daily lives. This specialization is intended to bridge the gap between traditional biology... more
In this course, you will understand the influence of the angle of attack and speed on the lift. Then we will focus on hazards and limitations, like stall, spiral dive, or flutter. You will... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL