Skip to main content

Search

Show all results for

Home

Lecture

Reinforcement Learning: Q-Learning

About
Privacy
Disclaimer

Copyright © 2026 EPFL, all rights reserved

Graph Chatbot

Description

This lecture covers Q-Learning, a model-free reinforcement learning algorithm. It explains how Q-Learning estimates action values, stops at convergence, and compares to Monte Carlo Estimation. The application to Tic-Tac-Toe is discussed with examples and quizzes.

This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.

Watch on Mediaspace

Official source

https://mediaspace.epfl.ch/media/0_r1ynys4u

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (29)

Learning Agents: Exploration-Exploitation Tradeoff

Explores the exploration-exploitation tradeoff in learning unknown effects of actions using multi-armed bandits and Q-learning.

Deep Learning Agents: Reinforcement Learning

Explores Deep Learning Agents in Reinforcement Learning, emphasizing neural network approximations and challenges in training multiagent systems.

Autonomous Vehicles: Intelligence and Perception

Explores intelligence, perception, and AI applications in autonomous vehicles, emphasizing rational thinking and social intelligence.

Collective Learning Dynamics: Similarity Exploitation

Delves into collective learning dynamics with similarity exploitation, covering structured learning, adaptive frameworks, modeling, simulation, and experimental results.

Perception: Data-Driven Approaches

Explores perception in deep learning for autonomous vehicles, covering image classification, optimization methods, and the role of representation in machine learning.