Learning continuous-time working memory tasks with on-policy neural reinforcement learning
Related publications (45)
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
There are important individual differences in adaptation and reactivity to stressful challenges. Being subjected to strict social confinement is a distressful psychological experience leading to reduced emotional well-being, but it is not known how it can ...
Background: Previous studies on possible memory deficits in 22q11DS often focused on quantifying the information memorized, whereas learning processes have been mostly overlooked. Furthermore, methodological differences in task design have made verbal and ...
In the past few years, the importance of electric mobility has increased in response to growing concerns about climate change. However, limited cruising range and sparse charging infrastructure could restrain a massive deployment of electric vehicles (EVs) ...
Objective.Mobile Brain/Body Imaging (MoBI) frameworks allowed the research community to find evidence of cortical involvement at walking initiation and during locomotion. However, the decoding of gait patterns from brain signals remains an open challenge. ...
Fueled by recent advances in deep neural networks, reinforcement learning (RL) has been in the limelight because of many recent breakthroughs in artificial intelligence, including defeating humans in games (e.g., chess, Go, StarCraft), self-driving cars, s ...
We establish probabilistic small data global well-posedness of the energy-critical Maxwell-Klein-Gordon equation relative to the Coulomb gauge for scaling super-critical random initial data. The proof relies on an induction on frequency procedure and a mod ...
Auditory perception is an essential part of a robotic system in Human-Robot Interaction (HRI), and creating an artificial auditory perception system that is on par with human has been a long-standing goal for researchers. In fact, this is a challenging res ...
Touch is commonly used to mediate human-machine interactions, notably in the setting of Digital Musical Instruments (DMIs), where touch screens are prevalent. The lack of rich haptic feedback has an impact on the richness and quality of the interaction. Pi ...
In reinforcement learning (RL), an agent makes sequential decisions to maximise the reward it can obtain from an environment. During learning, the actual and expected outcomes are compared to tell whether a decision was good or bad. The difference between ...
Previous single-site neurostimulation experiments have unsuccessfully attempted to shift decision-making away from habitual control, a fast, inflexible cognitive strategy, towards goal-directed control, a flexible, though computationally expensive strategy ...