Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture presents a quiz on the exploration vs. exploitation dilemma using the softmax policy, discussing the importance of Q value differences and the impact of the beta parameter on action selection after iterative updates.