Reinforcement learning

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (29)

Page 3 of 3

Explores safe learning in robotics, covering the state of the art, open challenges, and vision in the field, emphasizing the importance of interdisciplinary collaboration.

Interactive Lecture: Reinforcement Learning

Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.

Reinforcement Learning: Reward-based Learning

Explores artificial neural networks, reward information in the brain, animal conditioning, deep reinforcement learning, and a quiz on rewards.

Reinforcement Learning: SARSA Algorithm

Explores the SARSA algorithm for reinforcement learning, focusing on updating Q-values and the importance of exploration in learning by rewards.

Modern NLP: From GPT to ChatGPT

Explores the evolution of modern NLP from GPT-2 to GPT-3, emphasizing in-context learning and the development of ChatGPT.

Reinforcement Learning: Basics and Applications

Covers the basics of reinforcement learning, including Markov Decision Processes and policy gradient methods, and explores real-world applications and recent advances.

Subtracting the mean reward via the value function

Covers the significance of subtracting the mean reward in policy gradient methods for deep reinforcement learning, reducing noise in the stochastic gradient.

Reinforcement Learning: Exploration, Credit Assignment, TRPO, PPO

Delves into Reinforcement Learning problems, TRPO, PPO, and limitations in RL.

Policy Gradient Methods: Multiple Time Steps

Explores Policy Gradient methods over multiple time steps, focusing on updating policy parameters to maximize rewards.