Reinforcement distribution in a team of cooperative Q-learning agents

被引：4

作者：

Abbasi, Zahra ^{[1
]}

Abbasi, Mohammad Ali ^{[2
]}

机构：

[1] Islamic Azad Univ, Parand Branch, Tehran, Iran

[2] Univ Tehran, Fac Engn, Dept Elect & Comp Engn, Tehran 14174, Iran

来源：

PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING | 2008年

关键词：

agent learning; evolution; and adaptation; multiagent systems; cooperative distributed problem solving; coordination; cooperation; and teamwork; multiagent learning;

D O I：

10.1109/SNPD.2008.154

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified-which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this cage each agent reinforced according to its real effect on the team Performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agent's action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show, the distribution of the reinforcement signals based on the proposed method improves the team learning speed.

引用

页码：154 / +

页数：3

共 50 条

[21] Adaptable Conservative Q-Learning for Offline Reinforcement Learning
Qiu, Lyn
Li, Xu
Liang, Lenghan
Sun, Mingming
Yan, Junchi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 200 - 212
[22] Cooperative Q-learning: the knowledge sharing issue
Ahmadabadi, MN
Asadpour, M
Nakano, E
ADVANCED ROBOTICS, 2001, 15 (08) : 815 - 832
[23] Cooperative Q-Learning Based on Maturity of the Policy
Yang, Mao
Tian, Yantao
Liu, Xiaomei
2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 1352 - 1356
[24] Two-Step Deep Reinforcement Q-Learning based Relay Selection in Cooperative WPCNs
Tolebi, Gulnur
Tsiftsis, Theodoros A.
Nauryzbayev, Galymzhan
2023 INTERNATIONAL BALKAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, BALKANCOM, 2023,
[25] A task distribution based Q-learning algorithm for multi- agent team coordination
Sun, Qiao, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
[26] An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm
Spano, Sergio
Cardarilli, Gian Carlo
Di Nunzio, Luca
Fazzolari, Rocco
Giardino, Daniele
Matta, Marco
Nannarelli, Alberto
Re, Marco
IEEE ACCESS, 2019, 7 : 186340 - 186351
[27] Q-learning based Reinforcement Learning Approach for Lane Keeping
Feher, Arpad
Aradi, Szilard
Becsi, Tamas
2018 18TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2018, : 31 - 35
[28] Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA
Da Silva, Lucileide M. D.
Torquato, Matheus F.
Fernandes, Marcelo A. C.
IEEE ACCESS, 2019, 7 : 2782 - 2798
[29] Concurrent Q-learning: Reinforcement learning for dynamic goals and environments
Ollington, RB
Vamplew, PW
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2005, 20 (10) : 1037 - 1052
[30] Swarm Reinforcement Learning Method Based on Hierarchical Q-Learning
Kuroe, Yasuaki
Takeuchi, Kenya
Maeda, Yutaka
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,

← 1 2 3 4 5 →