Learning sequences of compatible actions among agents

被引：4

作者：

Polat, F ^{[1
]}

Abul, O ^{[1
]}

机构：

[1] Middle E Tech Univ, Dept Comp Engn, TR-06531 Ankara, Turkey

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2002年 / 17卷 / 01期

关键词：

bucket brigade learning; multiagent learning; multiagent systems; Q-learning; reinforcement learning;

D O I：

10.1023/A:1015009422110

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action coordination in multiagent systems is a difficult task especially in dynamic environments. If the environment possesses cooperation, least communication, incompatibility and local information constraints, the task becomes even more difficult. Learning compatible action sequences to achieve a designated goal under these constraints is studied in this work. Two new multiagent learning algorithms called QACE and NoCommQACE are developed. To improve the performance of the QACE and NoCommQACE algorithms four heuristics, state iteration, means-ends analysis, decreasing reward and do-nothing, are developed. The proposed algorithms are tested on the blocks world domain and the performance results are reported.

引用

页码：21 / 37

页数：17

共 50 条

[1] Learning Sequences of Compatible Actions Among Agents
Faruk Polat
Osman Abul
Artificial Intelligence Review, 2002, 17 : 21 - 37
[2] Learning Relational Grammars from Sequences of Actions
Vargas-Govea, Blanca
Morales, Eduardo F.
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 892 - 900
[3] Learning communicative actions of conflicting human agents
Galitsky, Boris A.
Kuznetsov, Sergei O.
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2008, 20 (04) : 277 - 317
[4] Learning Semantic Role Labeling from Compatible Label Sequences
Li, Tao
Kazeminejad, Ghazaleh
Brown, Susan W.
Palmer, Martha
Srikumar, Vivek
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15561 - 15572
[5] AGENTS COOPERATIVE ACTIONS GENERATING OPTIMAL ROBOT HAND MOTION SEQUENCES
IMAMURA, S
SAKAKIBARA, H
HORIE, Y
ENOMOTO, S
JOURNAL OF MECHANICAL ENGINEERING LABORATORY, 1993, 47 (06): : 227 - 236
[6] Agents' cooperative actions generating optimal robot hand motion sequences
Imamuka, Satoshi
Sakakibaka, Hisashi
Hokh, Yumiko
Enomoto, Susumu
Kikai Gijutsu Kenkyusho Shoho/Journal of Mechanical Engineering Laboratory, 1993, 47 (06): : 227 - 236
[7] Increasing the efficiency of cooperation among agents by sharing actions
Iwata, K
Miyazaki, M
Ito, N
Ishii, N
SOFTWARE ENGINEERING RESEARCH AND APPLICATIONS, 2004, 3026 : 279 - 289
[8] Composing Synergistic Macro Actions for Reinforcement Learning Agents
Chen, Yu-Ming
Chang, Kaun-Yu
Liu, Chien
Hsiao, Tsu-Ching
Hong, Zhang-Wei
Lee, Chun-Yi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 7251 - 7258
[9] From movements to actions: Two mechanisms for learning action sequences
Endress, Ansgar D.
Wood, Justin N.
COGNITIVE PSYCHOLOGY, 2011, 63 (03) : 141 - 171
[10] Are People Successful at Learning Sequences of Actions on a Perceptual Matching Task?
Yakushijin, Reiko
Jacobs, Robert A.
COGNITIVE SCIENCE, 2011, 35 (05) : 939 - 962

← 1 2 3 4 5 →