Lecture

Temporal difference learning

Description

This lecture covers the theory of Reinforcement Learning, the Exploration/Exploitation dilemma, Temporal Difference Learning, Eligibility Traces, and strategies for Continuous State/Action Spaces. It also introduces the Q-Learning algorithm, optimal paths, and the Bellman equation for multi-step horizons.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.