Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture presents a quiz on the implementation of Monte-Carlo methods, focusing on estimating the total return in a network with a thousand states and four action choices in each state, including a single terminal state. The quiz challenges the audience to determine the number of return variables to open and allocate in an episode, emphasizing the exploration of the graph to estimate new return variables along the way.