Logical Team Q-learning: An approach towards factored policies in cooperative MARL
Related publications (34)
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Mountain regions provide essential ecosystem goods and services (EGS) for both mountain dwellers and people living outside these areas. Global change endangers the capacity of mountain ecosystems to provide key services. The Mountland project focused on th ...
Smart specialisation is a policy concept that has enjoyed a short but very exciting life! Elaborated by a group of academic “experts” in 2008, it very quickly made a significant impact on the policy audience, particularly in Europe. Such a success story in ...
Industrial symbiosis (IS) emerged as a self-organizing business strategy among firms that are willing to cooperate to improve their economic and environmental performance. The adoption of such cooperative strategies relates to increasing costs of waste man ...
The objective of the thesis is to deepen the understanding of the interplay between ICT-enabled innovation and governance by providing evidence of the changes they are producing on governance processes and policy making mechanisms. Moreover, the thesis aim ...
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous work on Bayesian inverse reinforcement learning and allows us to obtain a pos ...
In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinforcement learning problems. While utility bounds are known to exist for this ...
Cohesiveness in teams is an essential part of ensuring the smooth running of task-oriented groups. Research in social psychology and management has shown that good cohesion in groups can be correlated with team effectiveness or productivity, so automatical ...
Cohesiveness in teams is an essential part of ensuring the smooth running of task-oriented groups. Research in social psychology and management has shown that good cohesion in groups can be correlated with team effectiveness or productivity so automaticall ...
In order to cope with the challenges of climate change, fundamental changes are needed in established systems of service provision and consumption. A key rationale for climate policy making is that it induces firms and other actors to develop new ‘climate- ...
Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schemes without value functions, which focus on policy representation using class ...