Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture introduces policy gradient methods using a simple example of a single neuron with binary output. The lecture covers how to associate actions with stimuli, optimize rewards directly, and change neuron weights to maximize expected rewards.