Reinforcement distribution in a team of cooperative Q-learning agents

被引：4

作者：

Abbasi, Zahra ^{[1
]}

Abbasi, Mohammad Ali ^{[2
]}

机构：

[1] Islamic Azad Univ, Parand Branch, Tehran, Iran

[2] Univ Tehran, Fac Engn, Dept Elect & Comp Engn, Tehran 14174, Iran

来源：

PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING | 2008年

关键词：

agent learning; evolution; and adaptation; multiagent systems; cooperative distributed problem solving; coordination; cooperation; and teamwork; multiagent learning;

D O I：

10.1109/SNPD.2008.154

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified-which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this cage each agent reinforced according to its real effect on the team Performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agent's action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show, the distribution of the reinforcement signals based on the proposed method improves the team learning speed.

引用

页码：154 / +

页数：3

共 50 条

[41] Multi-goal Q-learning of cooperative teams
Li, Jing
Sheng, Zhaohan
Ng, KwanChew
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1565 - 1574
[42] Distributed lazy Q-learning for cooperative mobile robots
Touzet, Claude F.
International Journal of Advanced Robotic Systems, 2004, 1 (01) : 5 - 13
[43] Q-Learning for Content Placement in Wireless Cooperative Caching
Yang, Zhong
Liu, Yuanwei
Chen, Yue
2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
[44] Safe Q-Learning Approaches for Human-in-Loop Reinforcement Learning
Veerabathraswamy, Swathi
Bhatt, Nirav
2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 16 - 21
[45] Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics
Weissenbacher, Matthias
Sinha, Samarth
Garg, Animesh
Kawahara, Yoshinobu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[46] Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms
Beikmohammadi, Ali
Magnusson, Sindri
ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 21 - 31
[47] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
Wilson, Callum
Riccardi, Annalisa
OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
[48] Autonomous Driving in Roundabout Maneuvers Using Reinforcement Learning with Q-Learning
Garcia Cuenca, Laura
Puertas, Enrique
Fernandez Andres, Javier
Aliane, Nourdine
ELECTRONICS, 2019, 8 (12)
[49] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
Graham, Caoimhin
Bell, David
Luo, Zhihui
RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
[50] An inverse reinforcement learning framework with the Q-learning mechanism for the metaheuristic algorithm
Zhao, Fuqing
Wang, Qiaoyun
Wang, Ling
KNOWLEDGE-BASED SYSTEMS, 2023, 265

← 1 2 3 4 5 →