Bayesian multitask inverse reinforcement learning

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or as different experts trying to solve the same task. Our main contribution is to formalise the problem as statistical preference elicitation, via a number of structured priors, whose form captures our biases about the relatedness of different tasks or expert policies. In doing so, we introduce a prior on policy optimality, which is more natural to specify. We show that our framework allows us not only to learn to efficiently from multiple experts but to also effectively differentiate between the goals of each. Possible applications include analysing the intrinsic motivations of subjects in behavioural experiments and learning from multiple teachers.

Bayesian multitask inverse reinforcement learning

Graph Chatbot

Chat with Graph Search

Incorporating Projective Geometry into Deep Learning

Seeking the new, learning from the unexpected: Computational models of surprise and novelty in the brain

Social Opinion Formation and Decision Making Under Communication Trends

Incorporating Projective Geometry into Deep Learning

Seeking the new, learning from the unexpected: Computational models of surprise and novelty in the brain

Social Opinion Formation and Decision Making Under Communication Trends