Skip to main content
Lecture

How to change the policy with a gradient method