Lecture

Theory of reinforcement learning: Grid examples

Description

This lecture covers the theory of reinforcement learning with a focus on grid examples, explaining concepts such as expected rewards, Q-values, and Q-learning. The instructor demonstrates how to estimate Q-values over trials and iteratively update them using a learning rate.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.