One-shot learning and behavioral eligibility traces in sequential decision making
Publications associées (52)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
We introduce contextual stochastic bilevel optimization (CSBO) -- a stochastic bilevel optimization framework with the lower-level problem minimizing an expectation conditioned on some contextual information and the upper-level decision variable. This fram ...
With the rising focus on academic safety, there has been an effort to improve the academic safety climate and develop lab-specific risk assessment tools. Despite the progress made in recent years, there is still a deficit of reliable data statistics on saf ...
School routes are paths where children learn, gain independence, forge their identity and interact with other beings – be they human or non-human. Many aspects that determine this journey are of social and cultural nature. But the spatial shaping of the wa ...
It is natural for humans to judge the outcome of a decision under uncertainty as a percentage of an ex-post optimal performance. We propose a robust decision-making framework based on a relative performance index. It is shown that if the decision maker's p ...
There is a need for a tool that facilitates safety decision-making in the academic environment. As this environment is very different from that of industry or other public sectors, there is no information available on the factors that influence the decisio ...
We consider a learning system based on the conventional multiplicative weight ( MW) rule that combines experts' advice to predict a sequence of true outcomes. It is assumed that one of the experts is malicious and aims to impose the maximum loss on the sys ...
In practice, most operational activity-based models have focused on single-day analyses. This common simplifying assumption significantly limits the models' behavioural realism, as they cannot adequately capture the dynamics and processes involved in the s ...
This paper studies the operation of multi-agent networks engaged in multi-task decision problems under the paradigm of simultaneous learning and adaptation. Two scenarios are considered:one in which a decision must be taken among multiple states of nature ...
An animals' ability to learn how to make decisions based on sensory evidence is often well described by Reinforcement Learning (RL) frameworks. These frameworks, however, typically apply to event-based representations and lack the explicit and fine-grained ...
The reuse of structural components in new buildings has great potential to reduce the environmental impacts of the construction sector but remains uncommon practice. An obstacle to its wider implementation is the lack of robust assessment methods and decis ...