Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the theory of Reinforcement Learning, the Exploration/Exploitation dilemma, Temporal Difference Learning, and Eligibility Traces, focusing on updating previous action values along the trajectory. The SARSA algorithm is presented, along with its initialization and update rules. Additional reading material is recommended for further understanding.