Skip to main content
Graph
Search
fr
|
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Model-Free Prediction in Reinforcement Learning: Key Methods
Graph Chatbot
Related lectures (30)
Previous
Page 3 of 3
Next
Landscape and Generalisation in Deep Learning
Explores the challenges and insights of deep learning, focusing on loss landscape, generalization, and feature learning.
Principled Reinforcement Learning with Human Feedback
Delves into Reinforcement Learning with Human Feedback, discussing convergence of estimators and introducing a pessimistic approach for improved performance.
Advanced Machine Learning: Discrete Reinforcement Learning
Introduces the basics of Reinforcement Learning, covering discrete states, actions, policies, value functions, MDPs, and optimal policies.
Gradient-Based Algorithms in High-Dimensional Learning
Provides insights on gradient-based algorithms, deep learning mysteries, and the challenges of non-convex problems.
Policy Gradient Methods in Reinforcement Learning
Covers policy gradient methods in reinforcement learning, focusing on optimization techniques and practical applications like the cartpole problem.
Dynamic Programming: Optimal Control
Explores Dynamic Programming for optimal control, focusing on stability, stationary policy, and recursive solutions.
Gradient Descent on Two-Layer ReLU Neural Networks
Analyzes gradient descent on two-layer ReLU neural networks, exploring global convergence, regularization, implicit bias, and statistical efficiency.
Reinforcement Learning: Non-Stationary Policies and OPPO
Covers finite horizon reinforcement learning, non-stationary policies, and the optimistic variant of Proximal Policy Optimization (OPPO).
Asset Selling Problem
Explores the Asset Selling Problem to maximize long-term reward without a deadline.
Exploration Bias
Explores regularization, learning algorithms, and subgaussian assumptions in machine learning.