Efficient exploration by switching agents according to degree of convergence of learning on Heterogeneous Multi-Agent Reinforcement Learning in Single Robot

被引:1
|
作者
Narita, Riku [1 ]
Matsushima, Tatsufumi [2 ]
Kurashige, Kentarou [1 ]
机构
[1] Muroran Inst Technol, Div Informat & Elect Engn, Muroran, Hokkaido, Japan
[2] Panasonic Its Co Ltd, Dev Ctr 1, Sect 1, Yokohama, Kanagawa, Japan
关键词
Reinforcement Learning; MARL; Explore;
D O I
10.1109/SSCI50451.2021.9659982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, a robot is required to perform autonomously in complex environment. Some researchers use reinforcement learning that learns actions autonomously according to environment. Reinforcement learning requires exploratory actions, but in conventional reinforcement learning it was random. Random exploratory actions are inefficient and takes a lot of time to learn. To prevent inefficient exploratory actions, we proposed a method that uses Heterogeneous Multi-Agent Reinforcement Learning system (HMARL) in previous research. HMARL enables efficient exploratory actions by using multiple agents with heterogeneous learning spaces. HMARL system is a system that performs exploratory actions using the learning of multiple agents. In addition, HMARL needs an index that autonomously selects an agent from among all the agents inside heterogeneous learning space. We propose a method to select an agent using the degree of convergence of the learning of the agents in HMARL based on the TD errors. As a result, efficient exploratory actions by multiple agents with different learning spaces was achieved. Then, experiment to compare the proposed method and the method of previous research was conducted. From experimental results, the usefulness of the proposed method has been demonstrated.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Heterogeneous Observation Aggregation Network for Multi-agent Reinforcement Learning
    Hu, Tianyi
    Ai, Xiaolin
    Pu, Zhiqiang
    Qiu, Tenghai
    Yi, Jianqiang
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [32] Learning to Share in Multi-Agent Reinforcement Learning
    Yi, Yuxuan
    Li, Ge
    Wang, Yaowei
    Lu, Zongqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
    Zhang, Kaiqing
    Yang, Zhuoran
    Liu, Han
    Zhang, Tong
    Basar, Tamer
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [34] Coordinated Reinforcement Learning Agents in a Multi-Agent Virtual Environment
    Sause, William
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 227 - 230
  • [35] Social satisficing: Multi-agent reinforcement learning with satisficing agents
    Uragami, Daisuke
    Sonota, Noriaki
    Takahashi, Tatsuji
    BIOSYSTEMS, 2024, 243
  • [36] Coordination Between Individual Agents in Multi-Agent Reinforcement Learning
    Zhang, Yang
    Yang, Qingyu
    An, Dou
    Zhang, Chengwei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11387 - 11394
  • [37] A Multi-Agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning
    Zhang, Sai Qian
    Lin, Jieyu
    Zhang, Qi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9091 - 9099
  • [38] Determining the Applicability of Advice for Efficient Multi-Agent Reinforcement Learning
    Wang, Yuchen
    Ren, Fenghui
    Zhang, Minjie
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 343 - 351
  • [39] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
    Liu, Guanlin
    Lai, Lifeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [40] Efficient Communications in Multi-Agent Reinforcement Learning for Mobile Applications
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Zhu, Yunjun
    Han, Shuai
    Liu, Yong-Jin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (09) : 12440 - 12454