Learning to reach the Pareto optimal Nash equilibrium as a team

被引:0
|
作者
Verbeeck, K [1 ]
Nowé, A [1 ]
Lenaerts, T [1 ]
Parent, J [1 ]
机构
[1] Free Univ Brussels, COMO, B-1050 Brussels, Belgium
来源
AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2002年 / 2557卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coordination is an important issue in multi-agent systems when agents want to maximize their revenue. Often coordination is achieved through communication, however communication has its price. We are interested in finding an approach where the communication between the agents is kept low, and a global optimal behavior can still be found. In this paper we report on an efficient approach that allows independent reinforcement learning agents to reach a Pareto optimal Nash equilibrium with limited communication. The communication happens at regular time steps and is basically a signal for the agents to start an exploration phase. During each exploration phase, some agents exclude their current best action so as to give the team the opportunity to look for a possibly better Nash equilibrium. This technique of reducing the action space by exclusions was only recently introduced for finding periodical policies in games of conflicting interests. Here, we explore this technique in repeated common interest games with deterministic or stochastic outcomes.
引用
收藏
页码:407 / 418
页数:12
相关论文
共 50 条
  • [21] PARETO-OPTIMAL NASH EQUILIBRIA ARE COMPETITIVE IN A REPEATED ECONOMY
    KURZ, M
    HART, S
    JOURNAL OF ECONOMIC THEORY, 1982, 28 (02) : 320 - 346
  • [22] Essential components of the set of weakly Pareto-Nash equilibrium points
    Yang, H
    Yu, J
    APPLIED MATHEMATICS LETTERS, 2002, 15 (05) : 553 - 560
  • [23] THE PARETO-OPTIMALITY OF NASH EQUILIBRIUM IN DYNAMIC CONTROLLED SYSTEMS WITH CONFLICT
    MAMEDOV, MB
    USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1990, 30 (04): : 16 - 24
  • [24] The Optimal Nash Equilibrium Strategies Under Competition
    孟力
    王崇喜
    汪定伟
    张爱玲
    JournalofShanghaiJiaotongUniversity, 2004, (04) : 91 - 96
  • [25] The Optimal Nash Equilibrium Strategies under the Competition
    Meng Li
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON PUBLIC ADMINISTRATION (4TH), VOL II, 2008, : 610 - 615
  • [26] Recency, consistent learning, and Nash equilibrium
    Fudenberg, Drew
    Levine, David K.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 : 10826 - 10829
  • [27] The optimal Nash equilibrium strategies under the competition
    Meng Li
    Wang Chong-xi
    Wang Ding-wei
    Zhang Ai-ling
    Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 714 - +
  • [28] Learning, hypothesis testing, and Nash equilibrium
    Foster, DP
    Young, HP
    GAMES AND ECONOMIC BEHAVIOR, 2003, 45 (01) : 73 - 96
  • [29] Reinforcement Learning for Nash Equilibrium Generation
    Cittern, David
    Edalat, Abbas
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1727 - 1728
  • [30] RATIONAL LEARNING LEADS TO NASH EQUILIBRIUM
    KALAI, E
    LEHRER, E
    ECONOMETRICA, 1993, 61 (05) : 1019 - 1045