Skip to main content
Graph
Search
fr
|
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Temporal difference learning
Graph Chatbot
Related lectures (30)
Previous
Page 1 of 3
Next
Infinite-Horizon Problems: Formulation & Complexity
Covers infinite-horizon problems in Applied Probability and Stochastic Processes.
Interactive Lecture: Reinforcement Learning
Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.
Markov Decision Processes: Dynamic Programming Techniques
Discusses Markov Decision Processes and dynamic programming techniques for solving optimal policies in various scenarios.
Introduction to Reinforcement Learning: Concepts and Applications
Introduces reinforcement learning, covering its concepts, applications, and key algorithms.
Reinforcement Learning: One-step Horizon (Bandit Problems)
Covers Bandit Problems in Reinforcement Learning, focusing on one-step horizon games and Q-values.
Policy Iteration and Linear Programming in MDPs
Discusses policy iteration and linear programming methods for solving Markov Decision Processes.
Advanced Machine Learning: Discrete Reinforcement Learning
Introduces the basics of Reinforcement Learning, covering discrete states, actions, policies, value functions, MDPs, and optimal policies.
Optimal Marketing Strategy
Covers decision-making in marketing based on customer behavior for optimal strategies.
Reinforcement Learning: Eligibility Traces
Explores Reinforcement Learning, focusing on updating previous action values along the trajectory using the SARSA algorithm.
Deep Learning Agents: Reinforcement Learning
Explores Deep Learning Agents in Reinforcement Learning, emphasizing neural network approximations and challenges in training multiagent systems.