Lecture

Reinforcement Learning: Policy Gradient and Actor-Critic Methods

Description

This lecture covers the principles of reinforcement learning, focusing on policy gradient and actor-critic methods. It begins with an introduction to reinforcement learning in deep artificial neural networks, explaining the REINFORCE algorithm with baseline and the actor-critic algorithm. The instructor discusses the differences between model-based and model-free reinforcement learning, emphasizing the importance of understanding these concepts for applying reinforcement learning techniques or reading related research papers. The lecture reviews key concepts such as Q-values and V-values, and introduces eligibility traces for policy gradients. The actor-critic method is presented as a combination of policy gradient and temporal difference learning, highlighting its advantages. The session concludes with a summary of deep reinforcement learning techniques, including the use of eligibility traces and the significance of learning two neural networks: the actor and the critic. Overall, this lecture provides a comprehensive overview of advanced reinforcement learning methods, preparing students for practical applications and further study in the field.

Login to watch the video

Official source

https://mediaspace.epfl.ch/media/0_4nl5mrzf

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Reinforcement Learning: Policy Gradient and Actor-Critic Methods

Graph Chatbot

Chat with Graph Search