Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Temporal difference learning
Graph Chatbot
Related lectures (30)
Previous
Page 1 of 3
Next
Infinite-Horizon Problems: Formulation & Complexity
Covers infinite-horizon problems in Applied Probability and Stochastic Processes.
Interactive Lecture: Reinforcement Learning
Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.
Markov Decision Processes: Dynamic Programming Techniques
Discusses Markov Decision Processes and dynamic programming techniques for solving optimal policies in various scenarios.
Introduction to Reinforcement Learning: Concepts and Applications
Introduces reinforcement learning, covering its concepts, applications, and key algorithms.
Reinforcement Learning: One-step Horizon (Bandit Problems)
Covers Bandit Problems in Reinforcement Learning, focusing on one-step horizon games and Q-values.
Policy Iteration and Linear Programming in MDPs
Discusses policy iteration and linear programming methods for solving Markov Decision Processes.
Advanced Machine Learning: Discrete Reinforcement Learning
Introduces the basics of Reinforcement Learning, covering discrete states, actions, policies, value functions, MDPs, and optimal policies.
Optimal Marketing Strategy
Covers decision-making in marketing based on customer behavior for optimal strategies.
Reinforcement Learning: Eligibility Traces
Explores Reinforcement Learning, focusing on updating previous action values along the trajectory using the SARSA algorithm.
Deep Learning Agents: Reinforcement Learning
Explores Deep Learning Agents in Reinforcement Learning, emphasizing neural network approximations and challenges in training multiagent systems.