Skip to main content

Search

Show all results for

Home

Lecture

Temporal difference learning

About
Privacy
Disclaimer

Copyright © 2026 EPFL, all rights reserved

Graph Chatbot

Description

This lecture covers the theory of Reinforcement Learning, the Exploration/Exploitation dilemma, Temporal Difference Learning, Eligibility Traces, and strategies for Continuous State/Action Spaces. It also introduces the Q-Learning algorithm, optimal paths, and the Bellman equation for multi-step horizons.

Official source

https://mediaspace.epfl.ch/media/0_mtvnsxur

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (30)

Infinite-Horizon Problems: Formulation & Complexity

Covers infinite-horizon problems in Applied Probability and Stochastic Processes.

Interactive Lecture: Reinforcement Learning

Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.

Markov Decision Processes: Dynamic Programming Techniques

Discusses Markov Decision Processes and dynamic programming techniques for solving optimal policies in various scenarios.

Introduction to Reinforcement Learning: Concepts and Applications

Introduces reinforcement learning, covering its concepts, applications, and key algorithms.

Reinforcement Learning: One-step Horizon (Bandit Problems)

Covers Bandit Problems in Reinforcement Learning, focusing on one-step horizon games and Q-values.