Bandit Online Learning of Nash Equilibria in Monotone Games

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

We address online bandit learning of Nash equilibria in multi-agent convex games. We propose an algorithm whereby each agent uses only obtained values of her cost function at each joint played action, lacking any information of the functional form of her cost or other agents' costs or strategies. In contrast to past work where convergent algorithms required strong monotonicity, we prove that the algorithm converges to a Nash equilibrium under mere monotonicity assumption. The proposed algorithm extends the applicability of bandit learning in several games including zero-sum convex games with possibly unbounded action spaces, mixed extension of finite-action zero-sum games, as well as convex games with linear coupling constraints.

Bandit Online Learning of Nash Equilibria in Monotone Games

Graph Chatbot

Chattez avec Graph Search

New Perspectives on Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization

Equilibria in Network Constrained Energy Markets

Ride-hail vehicle routing (RIVER) as a congestion game

Equilibria in Network Constrained Energy Markets

Ride-hail vehicle routing (RIVER) as a congestion game

New Perspectives on Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization