Skip to main content
Graph
Search
fr
|
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Value Iteration Acceleration: PID and Operator Splitting
Graph Chatbot
Related lectures (31)
Previous
Page 1 of 4
Next
Markov Decision Processes: Foundations of Reinforcement Learning
Covers Markov Decision Processes, their structure, and their role in reinforcement learning.
Introduction to Reinforcement Learning: Key Concepts and Applications
Introduces reinforcement learning, covering its definitions, applications, and theoretical foundations, while outlining the course structure and objectives.
Controlled Stochastic Processes
Explores controlled stochastic processes, focusing on analysis, behavior, and optimization, using dynamic programming to solve real-world problems.
Infinite-Horizon Problems: Formulation & Complexity
Covers infinite-horizon problems in Applied Probability and Stochastic Processes.
Asset Selling Problem
Explores the Asset Selling Problem to maximize long-term reward without a deadline.
Optimal Marketing Strategy
Covers decision-making in marketing based on customer behavior for optimal strategies.
Markov Decision Processes: Dynamic Programming Techniques
Discusses Markov Decision Processes and dynamic programming techniques for solving optimal policies in various scenarios.
Jacobi and Gauss-Seidel methods
Explains the Jacobi and Gauss-Seidel methods for solving linear systems iteratively.
Policy Iteration and Linear Programming in MDPs
Discusses policy iteration and linear programming methods for solving Markov Decision Processes.
Convergence Analysis: Iterative Methods
Covers the convergence analysis of iterative methods and the conditions for convergence.