Lecture

Reinforcement Learning: SARSA Algorithm

In course
DEMO: id esse eiusmod aliqua
Id aliqua est mollit eiusmod. Duis dolore cupidatat nisi laboris labore exercitation magna eu sint. Consectetur in aute enim nostrud culpa. Aliqua sint excepteur nostrud Lorem. Cillum elit mollit sint laborum pariatur adipisicing do esse. Elit sunt qui quis veniam do ex voluptate amet nisi laboris dolor ipsum. Magna voluptate commodo qui officia officia nisi enim in anim.
Login to see this section
Description

This lecture covers the SARSA algorithm, a powerful on-policy algorithm used in reinforcement learning. The sequence 'state-action-reward-state-action' is crucial for updating Q-values. The lecture explains the iterative update process for Q-values in multistep environments, compares SARSA with the Bellman equation, and provides practical examples of applying SARSA in a one-dimensional environment. Additionally, it discusses the convergence of SARSA and the importance of exploration in reinforcement learning.

Instructor
occaecat ea laborum
Cupidatat officia laborum est aliqua occaecat ipsum labore duis occaecat commodo tempor ipsum. Mollit est sint velit anim aliquip officia et laboris adipisicing ea nulla quis et voluptate. Reprehenderit do nostrud occaecat aute fugiat consectetur minim aute aliquip id id.
Login to see this section
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.