Explores safe learning in robotics, covering the state of the art, open challenges, and vision in the field, emphasizing the importance of interdisciplinary collaboration.
Covers the basics of reinforcement learning, including Markov Decision Processes and policy gradient methods, and explores real-world applications and recent advances.
Covers the significance of subtracting the mean reward in policy gradient methods for deep reinforcement learning, reducing noise in the stochastic gradient.