Markov Decision Processes: Foundations of Reinforcement Learning

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (30)

Page 3 of 3

Coin Change Problem

Explores the coin change problem, comparing greedy and dynamic programming algorithms for optimal solutions.

Simplex Algorithm: Basics

Introduces the Simplex algorithm for solving flow problems and handling negative cost cycles.

Reinforcement Learning for Pacman

Covers the application of reinforcement learning to teach Pacman to play autonomously by trial and error.

Dynamic Programming: Steinitz Sequence

Explores dynamic programming with the Steinitz sequence to optimize solutions efficiently.

Model-Free Prediction in Reinforcement Learning: Key Methods

Covers model-free prediction methods in reinforcement learning, focusing on Monte Carlo and Temporal Differences for estimating value functions without transition dynamics knowledge.

Markov Chains and Algorithm Applications

Covers the application of Markov chains and algorithms for function optimization and graph colorings.

Optimization Principles

Covers optimization principles, including linear optimization, networks, and concrete research examples in transportation.

Cutset Formulation: MST Problem

Explores the cutset formulation for the MST Problem and Gomory Cutting Planes method.

Dynamic Programming: Rod Cutting and Change Making

Explores dynamic programming through rod cutting and change making optimization problems.

Reinforcement Learning: Basics and Applications

Covers the basics of reinforcement learning, including Markov Decision Processes and policy gradient methods, and explores real-world applications and recent advances.