Cooperative Q-learning: the knowledge sharing issue

被引:11
作者
Ahmadabadi, MN [1 ]
Asadpour, M
Nakano, E
机构
[1] Univ Tehran, Fac Engn, Dept Elect & Comp Engn, Robot & AI Lab, Tehran, Iran
[2] Inst Studies Theoret Phys & Math, Sch Intelligent Syst, Tehran, Iran
[3] Tohoku Univ, GSIS, Adv Robot Lab, Sendai, Miyagi 980, Japan
关键词
learning; cooperation; expertness; knowledge sharing;
D O I
10.1163/156855301317198142
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
A group of cooperative and homogeneous Q-learning agents can cooperate to learn faster and gain more knowledge. In order to do so, each learner agent must be able to evaluate the expertness and the intelligence level of the other agents, and to assess the knowledge and the information it gets from them. In addition, the learner needs a suitable method to properly combine its own knowledge and what it gains from the other agents according to their relative expertness. In this paper, some expertness measuring criteria are introduced. Also. a new cooperative learning method called weighted strategy sharing (WSS) is introduced. In WSS, based on the amount of its teammate expertness, each agent assigns a weight to their knowledge and utilizes it accordingly. WSS and the expertness criteria are tested on two Simulated hunter-prey and object-pushing systems.
引用
收藏
页码:815 / 832
页数:18
相关论文
共 19 条
[1]  
[Anonymous], THESIS KINGS COLL
[2]  
BAKKER P, 1996, P AISB WORKSH LEARN, P3
[3]  
CLAUS C, 1997, P AAAI 97 WORKSH MUL, P13
[4]  
Demiris, 1994, P INT S INT ROB SYST, P198
[5]  
Dorigo M., 1997, IEEE Transactions on Evolutionary Computation, V1, P53, DOI 10.1109/4235.585892
[6]  
ELPAYDIN E, 1998, P ENG INT SYST 98 C, V2, P6
[7]  
FRIEDRICH H, 1996, LECT NOTES AI, V1221
[8]  
Garland A, 1996, P AAAISS 96 STANF U, P33
[9]   LEARNING BY WATCHING - EXTRACTING REUSABLE TASK KNOWLEDGE FROM VISUAL OBSERVATION OF HUMAN-PERFORMANCE [J].
KUNIYOSHI, Y ;
INABA, M ;
INOUE, H .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1994, 10 (06) :799-822
[10]  
Liu Y, 1998, IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, P2202, DOI 10.1109/IJCNN.1998.687202