Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Covers MuZero, a model that learns to predict rewards and actions iteratively, achieving state-of-the-art performance in board games and Atari video games.
Compares model-based and model-free reinforcement learning, highlighting the advantages of the former in adapting to reward changes and planning future actions.
Discusses advanced reinforcement learning techniques, focusing on deep and robust methods, including actor-critic frameworks and adversarial learning strategies.