Skip to main content
Graph
Search
fr
|
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Reinforcement Learning: BackUp Diagrams
Graph Chatbot
Related lectures (29)
Previous
Page 2 of 3
Next
Policy Gradient Methods: Direct Action Learning in Reinforcement Learning
Covers policy gradient methods, focusing on direct action learning and optimizing rewards in reinforcement learning.
The Hidden Convex Optimization Landscape of Deep Neural Networks
Explores the hidden convex optimization landscape of deep neural networks, showcasing the transition from non-convex to convex models.
Statistical Physics in Machine Learning: Understanding Deep Learning
Explores the application of statistical physics in understanding deep learning with a focus on neural networks and machine learning challenges.
Neural Networks Optimization
Explores neural networks optimization, including backpropagation, batch normalization, weight initialization, and hyperparameter search strategies.
Reinforcement Learning: Basics and Applications
Covers the basics of reinforcement learning, including trial-and-error learning, Q-learning, deep RL, and applications in gaming and planning.
Neural Networks: Learning Features & Linear Prediction
Explores neural networks' ability to learn features and make linear predictions, emphasizing the importance of data quantity for effective performance.
Introduction to Machine Learning
Provides an overview of Machine Learning, including historical context, key tasks, and real-world applications.
Machine Learning for Solving PDEs: Random Feature Method
Explores the Random Feature Method for solving PDEs using machine learning algorithms to approximate high-dimensional functions efficiently.
General Introduction into Artificial Neural Networks: part 3
Covers learning by rewards in deep reinforcement learning without math details.
First steps toward deep reinforcement learning
Explores the shift to deep reinforcement learning through neural networks for direct policy learning, bypassing Q-values and V-values.