Learning sequences of compatible actions among agents

被引：4

作者：

Polat, F ^{[1
]}

Abul, O ^{[1
]}

机构：

[1] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2002年 / 17卷 / 01期

关键词：

bucket brigade learning; multiagent learning; multiagent systems; Q-learning; reinforcement learning;

D O I：

10.1023/A:1015009422110

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action coordination in multiagent systems is a difficult task especially in dynamic environments. If the environment possesses cooperation, least communication, incompatibility and local information constraints, the task becomes even more difficult. Learning compatible action sequences to achieve a designated goal under these constraints is studied in this work. Two new multiagent learning algorithms called QACE and NoCommQACE are developed. To improve the performance of the QACE and NoCommQACE algorithms four heuristics, state iteration, means-ends analysis, decreasing reward and do-nothing, are developed. The proposed algorithms are tested on the blocks world domain and the performance results are reported.

引用

页码：21 / 37

页数：17

共 50 条

[21] Evolving cooperative actions among heterogeneous agents by an evolutionary programming method
Fujinaga, T
Moriwaki, K
Inuzuka, N
Itoh, H
SIMULATED EVOLUTION AND LEARNING, 1999, 1585 : 231 - 239
[22] Q-Learning of Spatial Actions for Hierarchical Planner of Cognitive Agents
Kiselev, Gleb
Panov, Aleksandr
INTERACTIVE COLLABORATIVE ROBOTICS, ICR 2020, 2020, 12336 : 160 - 169
[23] Applying Learning Analytics to Detect Sequences of Actions and Common Errors in a Geometry Game
Gomez, Manuel J.
Ruiperez-Valiente, Jose A.
Martinez, Pedro A.
Kim, Yoon Jeon
SENSORS, 2021, 21 (04) : 1 - 16
[24] Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences
VidalMata, Rosaura G.
Scheirer, Walter J.
Kukleva, Anna
Cox, David
Kuehne, Hilde
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1237 - 1246
[25] Representation Learning of Logic Words by an RNN: From Word Sequences to Robot Actions
Yamada, Tatsuro
Murata, Shingo
Arie, Hiroaki
Ogata, Tetsuya
FRONTIERS IN NEUROROBOTICS, 2017, 11
[26] Graded group actions and generalized H-actions compatible with gradings
Gordienko, A. S.
LINEAR ALGEBRA AND ITS APPLICATIONS, 2024, 682 : 96 - 121
[27] Graded group actions and generalized H-actions compatible with gradings
Gordienko, A.S.
Linear Algebra and Its Applications, 2024, 682 : 96 - 121
[28] Learning initial trust among interacting agents
Rettinger, Achim
Nickles, Matthias
Tresp, Volker
COOPERATIVE INFORMATION AGENTS XI, PROCEEDINGS, 2007, 4676 : 313 - +
[29] On the average complexity for the verification of compatible sequences
Koukouvinos, Christos
Pillwein, Veronika
Simos, Dimitris E.
Zafeirakopoulos, Zafeirakis
INFORMATION PROCESSING LETTERS, 2011, 111 (17) : 825 - 830
[30] Compatible sequences and a slow Winkler percolation
Gács, P
COMBINATORICS PROBABILITY & COMPUTING, 2004, 13 (06): : 815 - 856

← 1 2 3 4 5 →