Reinforcement distribution in a team of cooperative Q-learning agents

被引:4
|
作者
Abbasi, Zahra [1 ]
Abbasi, Mohammad Ali [2 ]
机构
[1] Islamic Azad Univ, Parand Branch, Tehran, Iran
[2] Univ Tehran, Fac Engn, Dept Elect & Comp Engn, Tehran 14174, Iran
关键词
agent learning; evolution; and adaptation; multiagent systems; cooperative distributed problem solving; coordination; cooperation; and teamwork; multiagent learning;
D O I
10.1109/SNPD.2008.154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified-which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this cage each agent reinforced according to its real effect on the team Performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agent's action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show, the distribution of the reinforcement signals based on the proposed method improves the team learning speed.
引用
收藏
页码:154 / +
页数:3
相关论文
共 50 条
  • [41] Multi-goal Q-learning of cooperative teams
    Li, Jing
    Sheng, Zhaohan
    Ng, KwanChew
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1565 - 1574
  • [42] Distributed lazy Q-learning for cooperative mobile robots
    Touzet, Claude F.
    International Journal of Advanced Robotic Systems, 2004, 1 (01) : 5 - 13
  • [43] Q-Learning for Content Placement in Wireless Cooperative Caching
    Yang, Zhong
    Liu, Yuanwei
    Chen, Yue
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [44] Safe Q-Learning Approaches for Human-in-Loop Reinforcement Learning
    Veerabathraswamy, Swathi
    Bhatt, Nirav
    2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 16 - 21
  • [45] Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics
    Weissenbacher, Matthias
    Sinha, Samarth
    Garg, Animesh
    Kawahara, Yoshinobu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [46] Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms
    Beikmohammadi, Ali
    Magnusson, Sindri
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 21 - 31
  • [47] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
    Wilson, Callum
    Riccardi, Annalisa
    OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
  • [48] Autonomous Driving in Roundabout Maneuvers Using Reinforcement Learning with Q-Learning
    Garcia Cuenca, Laura
    Puertas, Enrique
    Fernandez Andres, Javier
    Aliane, Nourdine
    ELECTRONICS, 2019, 8 (12)
  • [49] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
    Graham, Caoimhin
    Bell, David
    Luo, Zhihui
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
  • [50] An inverse reinforcement learning framework with the Q-learning mechanism for the metaheuristic algorithm
    Zhao, Fuqing
    Wang, Qiaoyun
    Wang, Ling
    KNOWLEDGE-BASED SYSTEMS, 2023, 265