Lecture

Reinforcement Learning: Q-Learning

Description

This lecture covers Q-Learning, a model-free reinforcement learning algorithm. It explains how Q-Learning estimates action values, stops at convergence, and compares to Monte Carlo Estimation. The application to Tic-Tac-Toe is discussed with examples and quizzes.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.