Learning sequences of compatible actions among agents

被引:4
|
作者
Polat, F [1 ]
Abul, O [1 ]
机构
[1] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey
关键词
bucket brigade learning; multiagent learning; multiagent systems; Q-learning; reinforcement learning;
D O I
10.1023/A:1015009422110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action coordination in multiagent systems is a difficult task especially in dynamic environments. If the environment possesses cooperation, least communication, incompatibility and local information constraints, the task becomes even more difficult. Learning compatible action sequences to achieve a designated goal under these constraints is studied in this work. Two new multiagent learning algorithms called QACE and NoCommQACE are developed. To improve the performance of the QACE and NoCommQACE algorithms four heuristics, state iteration, means-ends analysis, decreasing reward and do-nothing, are developed. The proposed algorithms are tested on the blocks world domain and the performance results are reported.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [1] Learning Sequences of Compatible Actions Among Agents
    Faruk Polat
    Osman Abul
    Artificial Intelligence Review, 2002, 17 : 21 - 37
  • [2] Learning Relational Grammars from Sequences of Actions
    Vargas-Govea, Blanca
    Morales, Eduardo F.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 892 - 900
  • [3] Learning communicative actions of conflicting human agents
    Galitsky, Boris A.
    Kuznetsov, Sergei O.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2008, 20 (04) : 277 - 317
  • [4] Learning Semantic Role Labeling from Compatible Label Sequences
    Li, Tao
    Kazeminejad, Ghazaleh
    Brown, Susan W.
    Palmer, Martha
    Srikumar, Vivek
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15561 - 15572
  • [5] AGENTS COOPERATIVE ACTIONS GENERATING OPTIMAL ROBOT HAND MOTION SEQUENCES
    IMAMURA, S
    SAKAKIBARA, H
    HORIE, Y
    ENOMOTO, S
    JOURNAL OF MECHANICAL ENGINEERING LABORATORY, 1993, 47 (06): : 227 - 236
  • [6] Agents' cooperative actions generating optimal robot hand motion sequences
    Imamuka, Satoshi
    Sakakibaka, Hisashi
    Hokh, Yumiko
    Enomoto, Susumu
    Kikai Gijutsu Kenkyusho Shoho/Journal of Mechanical Engineering Laboratory, 1993, 47 (06): : 227 - 236
  • [7] Increasing the efficiency of cooperation among agents by sharing actions
    Iwata, K
    Miyazaki, M
    Ito, N
    Ishii, N
    SOFTWARE ENGINEERING RESEARCH AND APPLICATIONS, 2004, 3026 : 279 - 289
  • [8] Composing Synergistic Macro Actions for Reinforcement Learning Agents
    Chen, Yu-Ming
    Chang, Kaun-Yu
    Liu, Chien
    Hsiao, Tsu-Ching
    Hong, Zhang-Wei
    Lee, Chun-Yi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 7251 - 7258
  • [9] From movements to actions: Two mechanisms for learning action sequences
    Endress, Ansgar D.
    Wood, Justin N.
    COGNITIVE PSYCHOLOGY, 2011, 63 (03) : 141 - 171
  • [10] Are People Successful at Learning Sequences of Actions on a Perceptual Matching Task?
    Yakushijin, Reiko
    Jacobs, Robert A.
    COGNITIVE SCIENCE, 2011, 35 (05) : 939 - 962