Learning to reach the Pareto optimal Nash equilibrium as a team

被引：0

作者：

Verbeeck, K ^{[1
]}

Nowé, A ^{[1
]}

Lenaerts, T ^{[1
]}

Parent, J ^{[1
]}

机构：

[1] Free Univ Brussels, COMO, B-1050 Brussels, Belgium

来源：

AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2002年 / 2557卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Coordination is an important issue in multi-agent systems when agents want to maximize their revenue. Often coordination is achieved through communication, however communication has its price. We are interested in finding an approach where the communication between the agents is kept low, and a global optimal behavior can still be found. In this paper we report on an efficient approach that allows independent reinforcement learning agents to reach a Pareto optimal Nash equilibrium with limited communication. The communication happens at regular time steps and is basically a signal for the agents to start an exploration phase. During each exploration phase, some agents exclude their current best action so as to give the team the opportunity to look for a possibly better Nash equilibrium. This technique of reducing the action space by exclusions was only recently introduced for finding periodical policies in games of conflicting interests. Here, we explore this technique in repeated common interest games with deterministic or stochastic outcomes.

引用

页码：407 / 418

页数：12

共 50 条

[21] PARETO-OPTIMAL NASH EQUILIBRIA ARE COMPETITIVE IN A REPEATED ECONOMY
KURZ, M
HART, S
JOURNAL OF ECONOMIC THEORY, 1982, 28 (02) : 320 - 346
[22] Essential components of the set of weakly Pareto-Nash equilibrium points
Yang, H
Yu, J
APPLIED MATHEMATICS LETTERS, 2002, 15 (05) : 553 - 560
[23] THE PARETO-OPTIMALITY OF NASH EQUILIBRIUM IN DYNAMIC CONTROLLED SYSTEMS WITH CONFLICT
MAMEDOV, MB
USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1990, 30 (04): : 16 - 24
[24] The Optimal Nash Equilibrium Strategies Under Competition
孟力
王崇喜
汪定伟
张爱玲
JournalofShanghaiJiaotongUniversity, 2004, (04) : 91 - 96
[25] The Optimal Nash Equilibrium Strategies under the Competition
Meng Li
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON PUBLIC ADMINISTRATION (4TH), VOL II, 2008, : 610 - 615
[26] Recency, consistent learning, and Nash equilibrium
Fudenberg, Drew
Levine, David K.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 : 10826 - 10829
[27] The optimal Nash equilibrium strategies under the competition
Meng Li
Wang Chong-xi
Wang Ding-wei
Zhang Ai-ling
Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 714 - +
[28] Learning, hypothesis testing, and Nash equilibrium
Foster, DP
Young, HP
GAMES AND ECONOMIC BEHAVIOR, 2003, 45 (01) : 73 - 96
[29] Reinforcement Learning for Nash Equilibrium Generation
Cittern, David
Edalat, Abbas
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1727 - 1728
[30] RATIONAL LEARNING LEADS TO NASH EQUILIBRIUM
KALAI, E
LEHRER, E
ECONOMETRICA, 1993, 61 (05) : 1019 - 1045

← 1 2 3 4 5 →