Learning to reach the Pareto optimal Nash equilibrium as a team

被引:0
|
作者
Verbeeck, K [1 ]
Nowé, A [1 ]
Lenaerts, T [1 ]
Parent, J [1 ]
机构
[1] Free Univ Brussels, COMO, B-1050 Brussels, Belgium
来源
AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2002年 / 2557卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coordination is an important issue in multi-agent systems when agents want to maximize their revenue. Often coordination is achieved through communication, however communication has its price. We are interested in finding an approach where the communication between the agents is kept low, and a global optimal behavior can still be found. In this paper we report on an efficient approach that allows independent reinforcement learning agents to reach a Pareto optimal Nash equilibrium with limited communication. The communication happens at regular time steps and is basically a signal for the agents to start an exploration phase. During each exploration phase, some agents exclude their current best action so as to give the team the opportunity to look for a possibly better Nash equilibrium. This technique of reducing the action space by exclusions was only recently introduced for finding periodical policies in games of conflicting interests. Here, we explore this technique in repeated common interest games with deterministic or stochastic outcomes.
引用
收藏
页码:407 / 418
页数:12
相关论文
共 50 条
  • [31] Optimal equilibrium solution algorithm for non-inferior Nash strategies in multi-team game systems
    Yu, Guo-Lin
    Liu, San-Yang
    Li, Bing-Jie
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2007, 24 (05): : 785 - 789
  • [32] Nash-based criteria for selection of Pareto Optimal PI controller
    Sabina Sanchez, Helem
    Vilanova, Ramon
    2013 17TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2013, : 465 - 472
  • [33] Pareto-based evolutionary multiobjective approaches and the generalized Nash equilibrium problem
    Lung, Rodica Ioana
    Gasko, Noemi
    Suciu, Mihai Alexandru
    JOURNAL OF HEURISTICS, 2020, 26 (04) : 561 - 584
  • [34] Generic Stability of the Weakly Pareto-Nash Equilibrium with Strategy Transformational Barriers
    Liu, Luping
    Jia, Wensheng
    Zhou, Li
    JOURNAL OF FUNCTION SPACES, 2022, 2022
  • [35] Pareto-based evolutionary multiobjective approaches and the generalized Nash equilibrium problem
    Rodica Ioana Lung
    Noémi Gaskó
    Mihai Alexandru Suciu
    Journal of Heuristics, 2020, 26 : 561 - 584
  • [36] STEADY-STATE LEARNING AND NASH EQUILIBRIUM
    FUDENBERG, D
    LEVINE, DK
    ECONOMETRICA, 1993, 61 (03) : 547 - 573
  • [37] When does learning lead to Nash equilibrium?
    Börgers, T
    OPERATIONS RESEARCH PROCEEDINGS 1999, 2000, : 176 - 202
  • [38] Learning to learn, pattern recognition, and Nash equilibrium
    Sonsino, D
    GAMES AND ECONOMIC BEHAVIOR, 1997, 18 (02) : 286 - 331
  • [39] Probably Approximately Correct Nash Equilibrium Learning
    Fele, Filiberto
    Margellos, Kostas
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (09) : 4238 - 4245
  • [40] Decentralized Interference Channels with Noisy Feedback Possess Pareto Optimal Nash Equilibria
    Perlaza, Samir M.
    Tandon, Ravi
    Poor, H. Vincent
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 408 - 411