Learning sequences of compatible actions among agents

被引:4
|
作者
Polat, F [1 ]
Abul, O [1 ]
机构
[1] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
关键词
bucket brigade learning; multiagent learning; multiagent systems; Q-learning; reinforcement learning;
D O I
10.1023/A:1015009422110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action coordination in multiagent systems is a difficult task especially in dynamic environments. If the environment possesses cooperation, least communication, incompatibility and local information constraints, the task becomes even more difficult. Learning compatible action sequences to achieve a designated goal under these constraints is studied in this work. Two new multiagent learning algorithms called QACE and NoCommQACE are developed. To improve the performance of the QACE and NoCommQACE algorithms four heuristics, state iteration, means-ends analysis, decreasing reward and do-nothing, are developed. The proposed algorithms are tested on the blocks world domain and the performance results are reported.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [21] Evolving cooperative actions among heterogeneous agents by an evolutionary programming method
    Fujinaga, T
    Moriwaki, K
    Inuzuka, N
    Itoh, H
    SIMULATED EVOLUTION AND LEARNING, 1999, 1585 : 231 - 239
  • [22] Q-Learning of Spatial Actions for Hierarchical Planner of Cognitive Agents
    Kiselev, Gleb
    Panov, Aleksandr
    INTERACTIVE COLLABORATIVE ROBOTICS, ICR 2020, 2020, 12336 : 160 - 169
  • [23] Applying Learning Analytics to Detect Sequences of Actions and Common Errors in a Geometry Game
    Gomez, Manuel J.
    Ruiperez-Valiente, Jose A.
    Martinez, Pedro A.
    Kim, Yoon Jeon
    SENSORS, 2021, 21 (04) : 1 - 16
  • [24] Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences
    VidalMata, Rosaura G.
    Scheirer, Walter J.
    Kukleva, Anna
    Cox, David
    Kuehne, Hilde
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1237 - 1246
  • [25] Representation Learning of Logic Words by an RNN: From Word Sequences to Robot Actions
    Yamada, Tatsuro
    Murata, Shingo
    Arie, Hiroaki
    Ogata, Tetsuya
    FRONTIERS IN NEUROROBOTICS, 2017, 11
  • [26] Graded group actions and generalized H-actions compatible with gradings
    Gordienko, A. S.
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2024, 682 : 96 - 121
  • [27] Graded group actions and generalized H-actions compatible with gradings
    Gordienko, A.S.
    Linear Algebra and Its Applications, 2024, 682 : 96 - 121
  • [28] Learning initial trust among interacting agents
    Rettinger, Achim
    Nickles, Matthias
    Tresp, Volker
    COOPERATIVE INFORMATION AGENTS XI, PROCEEDINGS, 2007, 4676 : 313 - +
  • [29] On the average complexity for the verification of compatible sequences
    Koukouvinos, Christos
    Pillwein, Veronika
    Simos, Dimitris E.
    Zafeirakopoulos, Zafeirakis
    INFORMATION PROCESSING LETTERS, 2011, 111 (17) : 825 - 830
  • [30] Compatible sequences and a slow Winkler percolation
    Gács, P
    COMBINATORICS PROBABILITY & COMPUTING, 2004, 13 (06): : 815 - 856