One-shot learning and behavioral eligibility traces in sequential decision making
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
School routes are paths where children learn, gain independence, forge their identity and interact with other beings – be they human or non-human. Many aspects that determine this journey are of social and cultural nature. But the spatial shaping of the wa ...
This paper studies the operation of multi-agent networks engaged in multi-task decision problems under the paradigm of simultaneous learning and adaptation. Two scenarios are considered:one in which a decision must be taken among multiple states of nature ...
In practice, most operational activity-based models have focused on single-day analyses. This common simplifying assumption significantly limits the models' behavioural realism, as they cannot adequately capture the dynamics and processes involved in the s ...
We introduce contextual stochastic bilevel optimization (CSBO) -- a stochastic bilevel optimization framework with the lower-level problem minimizing an expectation conditioned on some contextual information and the upper-level decision variable. This fram ...
An animals' ability to learn how to make decisions based on sensory evidence is often well described by Reinforcement Learning (RL) frameworks. These frameworks, however, typically apply to event-based representations and lack the explicit and fine-grained ...
We consider a learning system based on the conventional multiplicative weight ( MW) rule that combines experts' advice to predict a sequence of true outcomes. It is assumed that one of the experts is malicious and aims to impose the maximum loss on the sys ...
With the rising focus on academic safety, there has been an effort to improve the academic safety climate and develop lab-specific risk assessment tools. Despite the progress made in recent years, there is still a deficit of reliable data statistics on saf ...
There is a need for a tool that facilitates safety decision-making in the academic environment. As this environment is very different from that of industry or other public sectors, there is no information available on the factors that influence the decisio ...
It is natural for humans to judge the outcome of a decision under uncertainty as a percentage of an ex-post optimal performance. We propose a robust decision-making framework based on a relative performance index. It is shown that if the decision maker's p ...
The reuse of structural components in new buildings has great potential to reduce the environmental impacts of the construction sector but remains uncommon practice. An obstacle to its wider implementation is the lack of robust assessment methods and decis ...