Mean Field Multi-Agent Reinforcement Learning

被引：0

作者：

Yang, Yaodong ^{[1
]}

Luo, Rui ^{[1
]}

Li, Minne ^{[1
]}

Zhou, Ming ^{[2
]}

Zhang, Weinan ^{[2
]}

Wang, Jun ^{[1
]}

机构：

[1] UCL, London, England

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80 | 2018年 / 80卷

关键词：

GAMES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of agent interactions. In this paper, we present Mean Field Reinforcement Learning where the interactions within the population of agents are approximated by those between a single agent and the average effect from the overall population or neighboring agents; the interplay between the two entities is mutually reinforced: the learning of the individual agent's optimal policy depends on the dynamics of the population, while the dynamics of the population change according to the collective patterns of the individual policies. We develop practical mean field Q-learning and mean field Actor-Critic algorithms and analyze the convergence of the solution to Nash equilibrium. Experiments on Gaussian squeeze, Ising model, and battle games justify the learning effectiveness of our mean field approaches. In addition, we report the first result to solve the Ising model via model-free reinforcement learning methods.

引用

页数：10

共 50 条

[41] Multi-agent Reinforcement Learning for Service Composition
Lei, Yu
Yu, Philip S.
PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 790 - 793
[42] Multi-agent reinforcement learning with adaptive mimetism
Yamaguchi, T
Miura, M
Yachida, M
ETFA '96 - 1996 IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, PROCEEDINGS, VOLS 1 AND 2, 1996, : 288 - 294
[43] Multi-agent Reinforcement Learning in Network Management
Bagnasco, Ricardo
Serrat, Joan
SCALABILITY OF NETWORKS AND SERVICES, PROCEEDINGS, 2009, 5637 : 199 - 202
[44] Reinforcement learning of multi-agent communicative acts
Hoet S.
Sabouret N.
Revue d'Intelligence Artificielle, 2010, 24 (02) : 159 - 188
[45] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
Jiang, Haitian
Xiong, Dongliang
Jiang, Xiaowen
Yin, Aiguo
Ding, Li
Huang, Kai
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
[46] Quantum Multi-Agent Meta Reinforcement Learning
Yun, Won Joon
Park, Jihong
Kim, Joongheon
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11087 - 11095
[47] Multi-agent reinforcement learning for intrusion detection
Servin, Arturo
Kudenko, Daniel
ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS, 2008, 4865 : 211 - 223
[48] Deep reinforcement learning for multi-agent interaction
Ahmed, Ibrahim H.
Brewitt, Cillian
Carlucho, Ignacio
Christianos, Filippos
Dunion, Mhairi
Fosong, Elliot
Garcin, Samuel
Guo, Shangmin
Gyevnar, Balint
McInroe, Trevor
Papoudakis, Georgios
Rahman, Arrasy
Schafer, Lukas
Tamborski, Massimiliano
Vecchio, Giuseppe
Wang, Cheng
Albrecht, Stefano, V
AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368
[49] Multi-Agent Reinforcement Learning with Reward Delays
Zhang, Yuyang
Zhang, Runyu
Gu, Yuantao
Li, Na
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[50] Reinforcement learning based on multi-agent in RoboCup
Zhang, W
Li, JG
Ruan, XG
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975

← 1 2 3 4 5 →