Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Concept
Reinforcement learning from human feedback
Graph Chatbot
Related lectures (24)
Login to filter by course
Login to filter by course
Reset
Previous
Page 3 of 3
Next
Reinforcement Learning: TD Learning and SARSA Variants
Discusses reinforcement learning, focusing on temporal difference learning and SARSA algorithm variations.
First steps toward deep reinforcement learning
Explores the shift to deep reinforcement learning through neural networks for direct policy learning, bypassing Q-values and V-values.
General Introduction into Artificial Neural Networks: part 3
Covers learning by rewards in deep reinforcement learning without math details.
Deep Q-Learning: DeepRL1.1
Covers Deep Q-learning in deep neural networks, its application in games, backpropagation, Q-values, and V-values.