Learning to reach the Pareto optimal Nash equilibrium as a team

被引：0

作者：

Verbeeck, K ^{[1
]}

Nowé, A ^{[1
]}

Lenaerts, T ^{[1
]}

Parent, J ^{[1
]}

机构：

[1] Free Univ Brussels, COMO, B-1050 Brussels, Belgium

来源：

AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2002年 / 2557卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Coordination is an important issue in multi-agent systems when agents want to maximize their revenue. Often coordination is achieved through communication, however communication has its price. We are interested in finding an approach where the communication between the agents is kept low, and a global optimal behavior can still be found. In this paper we report on an efficient approach that allows independent reinforcement learning agents to reach a Pareto optimal Nash equilibrium with limited communication. The communication happens at regular time steps and is basically a signal for the agents to start an exploration phase. During each exploration phase, some agents exclude their current best action so as to give the team the opportunity to look for a possibly better Nash equilibrium. This technique of reducing the action space by exclusions was only recently introduced for finding periodical policies in games of conflicting interests. Here, we explore this technique in repeated common interest games with deterministic or stochastic outcomes.

引用

页码：407 / 418

页数：12

共 50 条

[31] Optimal equilibrium solution algorithm for non-inferior Nash strategies in multi-team game systems
Yu, Guo-Lin
Liu, San-Yang
Li, Bing-Jie
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2007, 24 (05): : 785 - 789
[32] Nash-based criteria for selection of Pareto Optimal PI controller
Sabina Sanchez, Helem
Vilanova, Ramon
2013 17TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2013, : 465 - 472
[33] Pareto-based evolutionary multiobjective approaches and the generalized Nash equilibrium problem
Lung, Rodica Ioana
Gasko, Noemi
Suciu, Mihai Alexandru
JOURNAL OF HEURISTICS, 2020, 26 (04) : 561 - 584
[34] Generic Stability of the Weakly Pareto-Nash Equilibrium with Strategy Transformational Barriers
Liu, Luping
Jia, Wensheng
Zhou, Li
JOURNAL OF FUNCTION SPACES, 2022, 2022
[35] Pareto-based evolutionary multiobjective approaches and the generalized Nash equilibrium problem
Rodica Ioana Lung
Noémi Gaskó
Mihai Alexandru Suciu
Journal of Heuristics, 2020, 26 : 561 - 584
[36] STEADY-STATE LEARNING AND NASH EQUILIBRIUM
FUDENBERG, D
LEVINE, DK
ECONOMETRICA, 1993, 61 (03) : 547 - 573
[37] When does learning lead to Nash equilibrium?
Börgers, T
OPERATIONS RESEARCH PROCEEDINGS 1999, 2000, : 176 - 202
[38] Learning to learn, pattern recognition, and Nash equilibrium
Sonsino, D
GAMES AND ECONOMIC BEHAVIOR, 1997, 18 (02) : 286 - 331
[39] Probably Approximately Correct Nash Equilibrium Learning
Fele, Filiberto
Margellos, Kostas
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (09) : 4238 - 4245
[40] Decentralized Interference Channels with Noisy Feedback Possess Pareto Optimal Nash Equilibria
Perlaza, Samir M.
Tandon, Ravi
Poor, H. Vincent
2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 408 - 411

← 1 2 3 4 5 →