Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the application of reinforcement learning to teach Pacman to play autonomously, focusing on policy gradient methods and Markov decision processes. It discusses the challenges faced, such as the large parameter space, and proposes solutions like log linear parametrization and vectorization.