Lecture

Exploration and exploitation

Description

This lecture covers the theory of Reinforcement Learning, focusing on the Exploration/Exploitation dilemma, Temporal Difference Learning, and Eligibility Traces in continuous state/action spaces. It discusses the challenges of estimating reward probabilities and the strategies to balance exploration to estimate rewards and exploitation to maximize rewards.

In MOOCs (3)
Neuro Robotics
At the same time, several different tutorials on available data and data tools, such as those from the Allen Institute for Brain Science, provide you with in-depth knowledge on brain atlases, gene exp
Neurorobotics
The MOOC on Neuro-robotics focuses on teaching advanced learners to design and construct a virtual robot and test its performance in a simulation using the HBP robotics platform. Learners will learn t
Neurorobotics
The MOOC on Neuro-robotics focuses on teaching advanced learners to design and construct a virtual robot and test its performance in a simulation using the HBP robotics platform. Learners will learn t
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related lectures (24)
Deep Learning Agents: Reinforcement Learning
Explores Deep Learning Agents in Reinforcement Learning, emphasizing neural network approximations and challenges in training multiagent systems.
Reinforcement Learning: Q-Learning
Covers Q-Learning in reinforcement learning, exploring action values, policies, and the societal impact of algorithms.
TD Learning: Temporal Difference Learning
Covers Temporal Difference Learning, V-values, state-values, and TD methods in reinforcement learning.
Reinforcement Learning: BackUp Diagrams
Introduces the BackUp diagram as a key graphic representation in reinforcement learning.
Reinforcement Learning: Basics and Applications
Covers the basics of reinforcement learning, including Markov Decision Processes and policy gradient methods, and explores real-world applications and recent advances.
Show more

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.