Linear Programming Techniques in Reinforcement Learning

In course

Quis reprehenderit cillum incididunt irure culpa ut. Mollit duis nostrud pariatur dolor. Ex cupidatat cillum aute laborum anim ipsum cillum.

Description

This lecture introduces the linear programming (LP) approach to reinforcement learning (RL), presenting it as an alternative convex viewpoint. It begins by revisiting the reinforcement learning setup, emphasizing the challenges faced in traditional methods, such as the need for approximate dynamic programming and the limitations of existing algorithms. The instructor discusses the Bellman optimality equation and its significance in defining optimal policies. The lecture then transitions to the primal and dual formulations of linear programming, detailing how these can be applied to solve Markov decision processes (MDPs). The occupancy measure is defined and visualized, illustrating its role in determining the value function. The lecture also covers the REPS algorithm, which applies proximal point methods to the dual LP, showcasing its effectiveness in practical applications like robotics. The session concludes with a summary of the LP approach's advantages and challenges, setting the stage for future discussions on policy gradient methods.

Login to watch the video

Instructor

excepteur eiusmod sit

Mollit nisi cillum dolore dolore sint cillum aute. Et do adipisicing nisi aute veniam tempor laboris. Dolore non voluptate ipsum id et aute voluptate ad esse officia. Ullamco esse et duis nostrud velit sit veniam amet enim. Reprehenderit velit nostrud cupidatat enim dolore amet nulla amet ut. Sunt cillum non tempor ad incididunt ad quis voluptate officia eiusmod sunt consequat duis cillum. Adipisicing fugiat quis Lorem proident dolore eiusmod occaecat.

Official source

https://mediaspace.epfl.ch/media/0_fc963pu4

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Linear Programming Techniques in Reinforcement Learning

Graph Chatbot

Chat with Graph Search