**Are you an EPFL student looking for a semester project?**

Work with us on data science and visualisation projects, and deploy your project as an app on top of GraphSearch.

Lecture# Continuous Reinforcement Learning: Advanced Machine Learning

Description

This lecture covers the challenges of continuous-state reinforcement learning, such as the curse of dimensionality and the need for function approximation to estimate the value function. It explains how to learn the value function using Monte-Carlo and Temporal Difference methods, and how to update the value function through roll-outs. The lecture also delves into function approximation for the value function, providing examples of parametrizing the value function and choosing features. It discusses the transition from the value function to the policy, and introduces policy gradients as an alternative approach. Additionally, it explores Policy Gradients and Policy learning by Weighted Exploration with the Returns (POWER) for reinforcement learning, including human demonstration for imitation learning. The lecture concludes with examples of reinforcement learning policies after multiple trials.

Login to watch the video

Official source

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related concepts (47)

Reinforcement learning

Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning differs from supervised learning in not needing labelled input/output pairs to be presented, and in not needing sub-optimal actions to be explicitly corrected.

Deep reinforcement learning

Deep reinforcement learning (deep RL) is a subfield of machine learning that combines reinforcement learning (RL) and deep learning. RL considers the problem of a computational agent learning to make decisions by trial and error. Deep RL incorporates deep learning into the solution, allowing agents to make decisions from unstructured input data without manual engineering of the state space. Deep RL algorithms are able to take in very large inputs (e.g.

Q-learning

Q-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations. For any finite Markov decision process (FMDP), Q-learning finds an optimal policy in the sense of maximizing the expected value of the total reward over any and all successive steps, starting from the current state.

Deep learning

Deep learning is part of a broader family of machine learning methods, which is based on artificial neural networks with representation learning. The adjective "deep" in deep learning refers to the use of multiple layers in the network. Methods used can be either supervised, semi-supervised or unsupervised.

Monte Carlo method

Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be deterministic in principle. They are often used in physical and mathematical problems and are most useful when it is difficult or impossible to use other approaches. Monte Carlo methods are mainly used in three problem classes: optimization, numerical integration, and generating draws from a probability distribution.

Related lectures (13)

Acquiring Data for Learning

Explores training robots through reinforcement learning and learning from demonstration, highlighting challenges in human-robot interaction and data collection.

Reinforcement Learning: Basics and Applications

Covers the basics of reinforcement learning, including trial-and-error learning, Q-learning, deep RL, and applications in gaming and planning.

Self-supervised Learning for Autonomous Vehicles

Explores self-supervised learning for autonomous vehicles, deriving labels from data itself and discussing its applications and challenges.

Introduction to Machine Learning

Provides an overview of Machine Learning, including historical context, key tasks, and real-world applications.

Deep Learning for Autonomous Vehicles: Learning

Explores learning in deep learning for autonomous vehicles, covering predictive models, RNN, ImageNet, and transfer learning.