Skip to main content
Lecture

Policy Estimation: The Log-Likelihood Trick