Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers the concept of multi-arm bandits, focusing on the Upper Confidence Bound algorithm to balance exploration and exploitation. Topics include confidence intervals, regret analysis, and the trade-off between exploration and exploitation.