Regional Cooperative Multi-agent Q-learning Based on Potential Field

被引:3
|
作者
Liu, Liang [1 ]
Li, Longshu [1 ]
机构
[1] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Hefei 230039, Peoples R China
关键词
D O I
10.1109/ICNC.2008.173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
More and more Artificial Intelligence researchers focused on the reinforcement learning(RL)-based multi-agent system(MAS). Multi-agent learning problems can in principle be solved by treating the joint actions of the agents as single actions and applying single-agent Q-learning, However, the number of joint actions is exponential in the number of agents, rendering this approach infeasible for most problems. In this paper we investigate a regional cooperative of the Q-function based on potential field by only considering the joint actions in those states in which coordination is actually required. In all other states single-agent Q-learning is applied. This offers a compact state-action value representation, without compromising much in terms of solution quality. We have performed experiments in RoboCup simulation-2D which is the ideal testing platform of Multi-agent systems and compared our algorithm to other multi-agent reinforcement learning algorithms with promising results.
引用
收藏
页码:535 / 539
页数:5
相关论文
共 50 条
  • [21] Multi-Agent Coordination Method Based on Fuzzy Q-Learning
    Peng, Jun
    Liu, Miao
    Wu, Min
    Zhang, Xiaoyong
    Lin, Kuo-Chi
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5411 - +
  • [22] Q-learning with FCMAC in multi-agent cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 599 - 606
  • [23] Multi-agent dueling Q-learning with mean field and value decomposition
    Ding, Shifei
    Du, Wei
    Ding, Ling
    Guo, Lili
    Zhang, Jian
    An, Bo
    PATTERN RECOGNITION, 2023, 139
  • [24] Pricing in agent economies using multi-agent Q-learning
    Tesauro, G
    Kephart, JO
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2002, 5 (03) : 289 - 304
  • [25] Pricing in Agent Economies Using Multi-Agent Q-Learning
    Gerald Tesauro
    Jeffrey O. Kephart
    Autonomous Agents and Multi-Agent Systems, 2002, 5 : 289 - 304
  • [26] A Multi-Agent Q-Learning Based Rendezvous Strategy for Cognitive Radios
    Watson, Clifton L.
    Chakravarthy, Vasu D.
    Biswas, Subir
    2017 COGNITIVE COMMUNICATIONS FOR AEROSPACE APPLICATIONS WORKSHOP (CCAA), 2017,
  • [27] Modular Q-learning based multi-agent cooperation for robot soccer
    Park, KH
    Kim, YJ
    Kim, JH
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 35 (02) : 109 - 122
  • [28] Study on Statistics Based Q-learning Algorithm for Multi-Agent System
    Xie Ya
    Huang Zhonghua
    2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND ENGINEERING APPLICATIONS, 2013, : 595 - 600
  • [29] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
    Graham, Caoimhin
    Bell, David
    Luo, Zhihui
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
  • [30] Q-Learning Policies for Multi-Agent Foraging Task
    Yogeswaran, M.
    Ponnambalam, S. C.
    TRENDS IN INTELLIGENT ROBOTICS, 2010, 103 : 194 - 201