Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the relationship between policy gradient algorithms and V values, explaining how V values can be used to speed up the convergence of the algorithms through active critic networks. It also discusses the calculation of V values in a separate network and the potential sharing of neurons with the actual network.