Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Concept
Q-learning
Applied sciences
Information engineering
Machine learning
Reinforcement learning
Graph Chatbot
Related lectures (29)
Login to filter by course
Login to filter by course
Reset
Previous
Page 2 of 3
Next
Reinforcement Learning: Basics and Applications
Covers the basics of reinforcement learning, including trial-and-error learning, Q-learning, deep RL, and applications in gaming and planning.
Reinforcement Learning Basics
Introduces the basics of reinforcement learning, including Q-learning and epsilon-greedy policies.
Convolutional Neural Networks
Introduces Convolutional Neural Networks (CNNs) for autonomous vehicles, covering architecture, applications, and regularization techniques.
Multilayer Neural Networks: Deep Learning
Covers the fundamentals of multilayer neural networks and deep learning.
Self-supervised Learning for Autonomous Vehicles
Explores self-supervised learning for autonomous vehicles, deriving labels from data itself and discussing its applications and challenges.
Autonomous Vehicles: Intelligence and Perception
Explores intelligence, perception, and AI applications in autonomous vehicles, emphasizing rational thinking and social intelligence.
Reinforcement Learning: Q-Learning
Covers Q-Learning, a model-free reinforcement learning algorithm, and its application to Tic-Tac-Toe with examples and quizzes.
Introduction to Reinforcement Learning: Concepts and Applications
Introduces reinforcement learning, covering its concepts, applications, and key algorithms.
Reinforcement Learning: SARSA Algorithm
Explores the SARSA algorithm for reinforcement learning, focusing on updating Q-values and the importance of exploration in learning by rewards.
Mini-Batches in On- and Off-Policy Deep Reinforcement Learning
Explains the significance of mini-batches in Deep Reinforcement Learning and the differences between on-policy and off-policy methods.