Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture introduces Temporal Difference Learning (TD learning) and the standard TD(0) algorithm for estimating values in reinforcement learning. It covers the concept of V-values, state-values, and the Bellman equation for value consistency of neighboring states. The lecture also explains how TD methods explore the graph over time, compare values at neighboring time steps, and update values based on 'temporal difference'.