Reinforcement learning-driven dynamic obstacle avoidance for mobile robot trajectory tracking

被引:4
|
作者
Xiao, Hanzhen [1 ]
Chen, Canghao [1 ]
Zhang, Guidong [1 ]
Chen, C. L. Philip [2 ,3 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[3] Pazhou Lab, Ctr Affect Comp & Gen Models, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Obstacle avoidance; Q-Learning; Trajectory tracking; Mobile robot; NAVIGATION;
D O I
10.1016/j.knosys.2024.111974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a trajectory tracking method based on optimized Q-Learning (QL), which has realtime obstacle avoidance capability, for controlling wheeled mobile robots in dynamic local environments. Based on the observation data and the state of the robot, the designed reinforcement learning (RL) method can determine the obstacle avoidance action during trajectory tracking while simultaneously utilizing controllers to maintain action precision. Through a simple observation space data processing method (OSDPM), the inputting data from the equipped raw lidar is transformed into a dimensionality reduction index vector containing the surrounding environment information of the mobile robot, which can guide QL to quickly correspond the current observation state of the robot to the table state of the QL. To improve the iteration and decision efficiency of the RL method, we optimize the Q -Table structure based on the type of data used. Finally, the simulation results verify the effectiveness of the OSDPM and the obstacle avoidance ability of RL method in unknown local environment.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Deep Reinforcement Learning of Map-Based Obstacle Avoidance for Mobile Robot Navigation
    Chen G.
    Pan L.
    Chen Y.
    Xu P.
    Wang Z.
    Wu P.
    Ji J.
    Chen X.
    SN Computer Science, 2021, 2 (6)
  • [22] Embedding Obstacle Avoidance to Trajectory Tracking for Unicycle Mobile Robots
    Resende, Cassius Z.
    Carelli, Ricardo
    Bastos-Filho, Teodiano F.
    Sarcinelli-Filho, Mario
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2228 - 2233
  • [23] Trajectory Tracking and Optimal Obstacle Avoidance of Mobile Agent based on Data-driven Control
    Xing Shaomin
    Guan Xinping
    Luo Xiaoyuan
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 4619 - 4623
  • [24] Design and Implementation of the Trajectory Tracking and Dynamic Obstacle Avoidance of Wheeled Mobile Robot Based on T-S Fuzzy Model
    Lin, Hung-Yi
    Tsai, Shun-Hung
    Chen, Kuan-Yo
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2023, 25 (06) : 2423 - 2438
  • [25] A reinforcement learning approach to obstacle avoidance of mobile robots
    Macek, K
    Petrovic, I
    Peric, N
    7TH INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL, PROCEEDINGS, 2002, : 462 - 466
  • [26] Robust Tracking Control of Mobile Robot Formation with Obstacle Avoidance
    Yang, Tiantian
    Liu, Zhiyuan
    Chen, Hong
    Pei, Run
    JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2007, 2007
  • [27] Sound source tracking considering obstacle avoidance for a mobile robot
    Uchiyama, Naoki
    Sano, Shigenori
    Yamamoto, Akihiro
    ROBOTICA, 2010, 28 : 1057 - 1064
  • [28] Target Tracking and Obstacle Avoidance for Mobile Robot based on Kinect
    Li, Mengxin
    Yin, Jiadi
    PROCEEDINGS OF THE 2015 3RD INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS, 2015, 35 : 1116 - 1120
  • [29] Motion Control and Trajectory Planning for Obstacle Avoidance of the Mobile Parallel Robot Driven by Three Tracked Vehicles
    Shentu, Shuzhan
    Xie, Fugui
    Liu, Xin-Jun
    Gong, Zhao
    ROBOTICA, 2021, 39 (06) : 1037 - 1050
  • [30] Trajectory generation for a mobile robot by reinforcement learning
    Shimizu, M
    Fujita, M
    Miyamoto, H
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS MINIROBOTS FOR RESEARCH AND EDUTAINMENT (AMIRE 2005), 2006, : 119 - +