Graph Exploration for Effective Multiagent Q-Learning

This article proposes an exploration technique for multiagent reinforcement learning (MARL) with graph-based communication among agents. We assume that the individual rewards received by the agents are independent of the actions by the other agents, while their policies are coupled. In the proposed framework, neighboring agents collaborate to estimate the uncertainty about the state-action space in order to execute more efficient explorative behavior. Different from existing works, the proposed algorithm does not require counting mechanisms and can be applied to continuous-state environments without requiring complex conversion techniques. Moreover, the proposed scheme allows agents to communicate in a fully decentralized manner with minimal information exchange. And for continuous-state scenarios, each agent needs to exchange only a single parameter vector. The performance of the algorithm is verified with theoretical results for discrete-state scenarios and with experiments for the continuous ones.

Graph Exploration for Effective Multiagent Q-Learning

Graph Chatbot

Chat with Graph Search

The connection of the acyclic disconnection and feedback arc sets - On an open problem of Figueroa et al.

Beyond Spectral Gap: The Role of the Topology in Decentralized Learning

Equivariant Neural Architectures for Representing and Generating Graphs

The connection of the acyclic disconnection and feedback arc sets - On an open problem of Figueroa et al.

Beyond Spectral Gap: The Role of the Topology in Decentralized Learning

Equivariant Neural Architectures for Representing and Generating Graphs