Model-Free Prediction in Reinforcement Learning: Key Methods

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (30)

Page 3 of 3

Landscape and Generalisation in Deep Learning

Explores the challenges and insights of deep learning, focusing on loss landscape, generalization, and feature learning.

Principled Reinforcement Learning with Human Feedback

Delves into Reinforcement Learning with Human Feedback, discussing convergence of estimators and introducing a pessimistic approach for improved performance.

Advanced Machine Learning: Discrete Reinforcement Learning

Introduces the basics of Reinforcement Learning, covering discrete states, actions, policies, value functions, MDPs, and optimal policies.

Gradient-Based Algorithms in High-Dimensional Learning

Provides insights on gradient-based algorithms, deep learning mysteries, and the challenges of non-convex problems.

Policy Gradient Methods in Reinforcement Learning

Covers policy gradient methods in reinforcement learning, focusing on optimization techniques and practical applications like the cartpole problem.

Dynamic Programming: Optimal Control

Explores Dynamic Programming for optimal control, focusing on stability, stationary policy, and recursive solutions.

Gradient Descent on Two-Layer ReLU Neural Networks

Analyzes gradient descent on two-layer ReLU neural networks, exploring global convergence, regularization, implicit bias, and statistical efficiency.

Reinforcement Learning: Non-Stationary Policies and OPPO

Covers finite horizon reinforcement learning, non-stationary policies, and the optimistic variant of Proximal Policy Optimization (OPPO).

Asset Selling Problem

Explores the Asset Selling Problem to maximize long-term reward without a deadline.

Exploration Bias

Explores regularization, learning algorithms, and subgaussian assumptions in machine learning.