Markov Chain ConvergenceExplores Markov chain convergence, emphasizing invariant distribution, Law of Large Numbers, and mean rewards computation.
Reinforcement Learning for PacmanExplores applying reinforcement learning to teach Pacman to play autonomously using policy gradient methods and Markov decision processes.