Obstacle avoidance of multi mobile robots based on behavior decomposition reinforcement learning

被引:3
|
作者
Zu, Linan [1 ]
Yang, Peng [1 ]
Chen, Lingling [1 ]
Zhang, Xueping [1 ]
Tian, Yantao [2 ]
机构
[1] Hebei Univ Technol, Sch Elect Engn & Automat, Tianjin 300130, Peoples R China
[2] Jilin Univ, Coll Commun Engn, Changchun 130025, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; Q-learning; obstacle avoidance; behavior decomposition;
D O I
10.1109/ROBIO.2007.4522303
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A reinforcement learning method based on behavior decomposition was proposed for obstacle avoidance of multi mobile robots. It decomposed the complicated behaviors into a series of simple sub-behaviors which were learned independently. The learning structures, parameters and reinforcement functions of every behavior are designed. Then, the fusion for learning results of all behaviors was optimized by learning. This learning algorithm could reduce the status space and predigest the design of reinforcement functions so as to improve the learning speed and the veracity of learning results. Finally, this learning method was adopted to realize the self-adaptation action fusion of mobile robots in the task of obstacle avoidance. And its efficiency was validated by simulation results.
引用
收藏
页码:1018 / +
页数:2
相关论文
共 50 条
  • [31] A method for local obstacle avoidance of mobile robots
    Ko, NY
    Kim, SC
    Cho, HK
    MOBILE ROBOT TECHNOLOGY, PROCEEDINGS, 2001, : 215 - 220
  • [32] Dynamic obstacle avoidance method for mobile robots
    Zhang H.
    Miao C.
    Tang Y.
    Yan X.
    Shi Y.
    Yu Y.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (06): : 1013 - 1021
  • [33] Obstacle avoidance control of nonholonomic mobile robots
    Niu, Wenbin
    Wang, Chaoli
    Li, Qingsong
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 1462 - 1466
  • [34] Target tracking and obstacle avoidance for mobile robots
    Chancharoen, R
    Sangveraphunsiri, V
    Navakulsirinart, T
    Thanawittayakorn, W
    Boonsanongsupa, W
    Meesaplak, A
    IEEE ICIT' 02: 2002 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS I AND II, PROCEEDINGS, 2002, : 13 - 17
  • [35] The Multi-Dimensional Actions Control Approach for Obstacle Avoidance Based on Reinforcement Learning
    Wu, Menghao
    Gao, Yanbin
    Wang, Pengfei
    Zhang, Fan
    Liu, Zhejun
    SYMMETRY-BASEL, 2021, 13 (08):
  • [36] Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning
    Ji X.
    Hai J.
    Luo W.
    Lin C.
    Xiong Y.
    Ou Z.
    Wen J.
    Journal of Shanghai Jiaotong University (Science), 2021, 26 (05) : 680 - 685
  • [37] Path Planning of Mobile Robot in Dynamic Obstacle Avoidance Environment Based on Deep Reinforcement Learning
    Zhang, Qingfeng
    Ma, Wenpeng
    Zheng, Qingchun
    Zhai, Xiaofan
    Zhang, Wenqian
    Zhang, Tianchang
    Wang, Shuo
    IEEE ACCESS, 2024, 12 : 189136 - 189152
  • [38] Cooperative behavior acquisition in multi mobile robots environment by reinforcement learning based on state vector estimation
    Uchibe, E
    Asada, M
    Hosoda, K
    1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 1558 - 1563
  • [39] Sliding mode based obstacle avoidance and target tracking for mobile robots
    Yannier, S
    Sabanovic, A
    Onat, A
    Bastan, A
    ISIE 2005: PROCEEDINGS OF THE IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS 2005, VOLS 1- 4, 2005, : 1489 - 1493
  • [40] Environmental modeling and obstacle avoidance of mobile robots based on laser radar
    Yang, Ming, 2000, Press of Tsinghua University, China (40):