Learning to reach the Pareto optimal Nash equilibrium as a team

被引:0
|
作者
Verbeeck, K [1 ]
Nowé, A [1 ]
Lenaerts, T [1 ]
Parent, J [1 ]
机构
[1] Free Univ Brussels, COMO, B-1050 Brussels, Belgium
来源
AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2002年 / 2557卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coordination is an important issue in multi-agent systems when agents want to maximize their revenue. Often coordination is achieved through communication, however communication has its price. We are interested in finding an approach where the communication between the agents is kept low, and a global optimal behavior can still be found. In this paper we report on an efficient approach that allows independent reinforcement learning agents to reach a Pareto optimal Nash equilibrium with limited communication. The communication happens at regular time steps and is basically a signal for the agents to start an exploration phase. During each exploration phase, some agents exclude their current best action so as to give the team the opportunity to look for a possibly better Nash equilibrium. This technique of reducing the action space by exclusions was only recently introduced for finding periodical policies in games of conflicting interests. Here, we explore this technique in repeated common interest games with deterministic or stochastic outcomes.
引用
收藏
页码:407 / 418
页数:12
相关论文
共 50 条
  • [41] Pareto efficiency as alternative to Nash equilibrium in competitive discrete location under delivered pricing
    Pelegrin, Blas
    Fernandez, Pascual
    Garcia, Maria Dolores
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2024, 75 (04) : 731 - 741
  • [42] Relationship between Nash Equilibria and Pareto Optimal Solutions for Games of Pure Coordination
    Das, Rohini
    Goswami, Sayan
    Konar, Amit
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [43] OPTIMAL REACH ESTIMATION AND METRIC LEARNING
    Aamari, Eddie
    Berenfeld, Clement
    Levrard, Clement
    ANNALS OF STATISTICS, 2023, 51 (03): : 1086 - 1108
  • [44] Equilibrium Models and Managerial Team Learning
    Fuglseth, Anna Mette
    Gronhaug, Kjell
    ENERGY, NATURAL RESOURCES AND ENVIRONMENTAL ECONOMICS, 2010, : 101 - 114
  • [45] Pareto Optimal Allocation and Price Equilibrium for a Duopoly with Negative Externality
    Pablo Dorta-González
    Dolores-Rosa Santos-Peñate
    Rafael Suárez-Vega
    Annals of Operations Research, 2002, 116 : 129 - 152
  • [46] Pareto optimal allocation and price equilibrium for a duopoly with negative externality
    Dorta-González, P
    Santos-Peñate, DR
    Suárez-Vega, R
    ANNALS OF OPERATIONS RESEARCH, 2002, 116 (1-4) : 129 - 152
  • [47] Pareto-Optimal Active Learning with Cost
    Adams, Stephen
    Cody, Tyler
    Beling, Peter A.
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1519 - 1526
  • [48] COURNOT-NASH EQUILIBRIUM AND OPTIMAL TRANSPORT IN A DYNAMIC SETTING
    Acciaio, Beatrice
    Veraguas, Julio Backhoff
    Jia, Junchao
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2021, 59 (03) : 2273 - 2300
  • [49] On simulation of optimal strategies and Nash equilibrium in the financial market context
    Jonas Mockus
    Journal of Global Optimization, 2010, 48 : 129 - 143
  • [50] On simulation of optimal strategies and Nash equilibrium in the financial market context
    Mockus, Jonas
    JOURNAL OF GLOBAL OPTIMIZATION, 2010, 48 (01) : 129 - 143