Lectures related to Temporal difference learning

Infinite-Horizon Problems: Formulation & Complexity

Covers infinite-horizon problems in Applied Probability and Stochastic Processes.

Interactive Lecture: Reinforcement Learning

Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.

Markov Decision Processes: Dynamic Programming Techniques

Discusses Markov Decision Processes and dynamic programming techniques for solving optimal policies in various scenarios.

Introduction to Reinforcement Learning: Concepts and Applications

Introduces reinforcement learning, covering its concepts, applications, and key algorithms.

Reinforcement Learning: One-step Horizon (Bandit Problems)

Covers Bandit Problems in Reinforcement Learning, focusing on one-step horizon games and Q-values.

Policy Iteration and Linear Programming in MDPs

Discusses policy iteration and linear programming methods for solving Markov Decision Processes.

Advanced Machine Learning: Discrete Reinforcement Learning

Introduces the basics of Reinforcement Learning, covering discrete states, actions, policies, value functions, MDPs, and optimal policies.

Optimal Marketing Strategy

Covers decision-making in marketing based on customer behavior for optimal strategies.

Reinforcement Learning: Eligibility Traces

Explores Reinforcement Learning, focusing on updating previous action values along the trajectory using the SARSA algorithm.

Deep Learning Agents: Reinforcement Learning

Explores Deep Learning Agents in Reinforcement Learning, emphasizing neural network approximations and challenges in training multiagent systems.

Reinforcement Learning: Q-Learning

Introduces Q-Learning, Deep Q-Learning, REINFORCE algorithm, and Monte-Carlo Tree Search in reinforcement learning, culminating in AlphaGo Zero.

Reinforcement Learning: Q-Learning

Covers Q-Learning in reinforcement learning, exploring action values, policies, and the societal impact of algorithms.

Policy Gradient and Actor-Critic Methods: Eligibility Traces Explained

Discusses policy gradient and actor-critic methods, focusing on eligibility traces and their application in reinforcement learning tasks.

Reinforcement Learning: Policy Gradient and Actor-Critic Methods

Provides an overview of reinforcement learning, focusing on policy gradient and actor-critic methods for deep artificial neural networks.

Asset Selling Problem

Explores the Asset Selling Problem to maximize long-term reward without a deadline.

Continuous Reinforcement Learning: Advanced Machine Learning

Explores continuous-state reinforcement learning challenges, value function estimation, policy gradients, and Policy learning by Weighted Exploration.

Introduction to Reinforcement Learning: Key Concepts and Applications

Introduces reinforcement learning, covering its definitions, applications, and theoretical foundations, while outlining the course structure and objectives.

Model-Free Prediction in Reinforcement Learning: Key Methods

Covers model-free prediction methods in reinforcement learning, focusing on Monte Carlo and Temporal Differences for estimating value functions without transition dynamics knowledge.

Reinforcement Learning: TD Learning and SARSA Variants

Discusses reinforcement learning, focusing on temporal difference learning and SARSA algorithm variations.

Markov Decision Processes: Foundations of Reinforcement Learning

Covers Markov Decision Processes, their structure, and their role in reinforcement learning.