Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm

被引:0
|
作者
Yang, Jianan [1 ]
Liu, Yu [1 ]
Zhang, Jie [2 ]
Guan, Yong [1 ]
Shao, Zhenzhou [1 ]
机构
[1] Capital Normal Univ, Coll Informat Engn, 105 West Third Ring Rd North, Beijing 100048, Peoples R China
[2] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Mobile robots; deep reinforcement learning; intrinsic reward; curiosity; random enhancement;
D O I
10.1177/17298806241292893
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning methods have been applied to mobile robot navigation to find the optimal path to the target. The rewards are usually given when the task is completed, which may lead to the local optima during the training procedure. It seriously affects the training efficiency and navigation performance of the mobile robot. To this end, this paper proposes an intrinsic reward mechanism with intrinsic curiosity module and randomness enhanced module, combining the TD3 (twin-delayed deep deterministic policy gradient) reinforcement learning algorithm for mobile robot navigation. It effectively resolves the issue of slow convergence caused by sparse rewards in continuous action spaces. It also encourages mobile robots to explore unknown areas and reduces the occurrence of local optima. The experimental results show that the proposed navigation method significantly improves the training efficiency of mobile robots. Out of 1000 test episodes, only 3 exceeded the maximum step limit. This approach significantly reduces the occurrence of local optima. Furthermore, it increases the success rate to an impressive 83.5%, outperforms the existing navigation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] On a Senor Based Navigation for a Mobile Robot
    Noborio, Hiroshi
    Journal of Robotics and Mechatronics, 1996, 8 (01): : 2 - 14
  • [42] Genetically optimized TD3 algorithm for efficient access control in the internet of vehicles
    Al-Atawi, Abdullah A.
    WIRELESS NETWORKS, 2024, 30 (09) : 7581 - 7601
  • [43] Navigation Simulation of a Mecanum Wheel Mobile Robot Based on an Improved A* Algorithm in Unity3D
    Li, Yunwang
    Dai, Sumei
    Shi, Yong
    Zhao, Lala
    Ding, Minghua
    SENSORS, 2019, 19 (13)
  • [44] Mobile Robot Navigation in Unknown Dynamic Environment Based on Ant Colony Algorithm
    Zeng Bi
    Yang Yimin
    Xu Yisan
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 98 - 102
  • [45] Development of laser rangefinder-based SLAM algorithm for mobile robot navigation
    Misono, Yusuke
    Goto, Yoshitaka
    Tarutoko, Yuki
    Kobayashi, Kazuyuki
    Watanabe, Kajiro
    PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 389 - 393
  • [46] Algorithm for Autonomous Navigation of Mobile Robot Measurements Based on Beidou/Laser Radar
    Li, Dan
    He, Guotian
    Wu, Canfeng
    Wang, Tianmiao
    2017 2ND ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS), 2017, : 305 - 309
  • [47] An RSSI-Based Navigation Algorithm for a Mobile Robot in Wireless Sensor Networks
    de Carvalho, Antonio R., Jr.
    Ribas, Afonso D.
    da Camara Neto, Vilar F.
    Nakamura, Eduardo Freire
    Figueiredo, Carlos Mauricio
    37TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2012), 2012, : 308 - 311
  • [48] Research on application of genetic algorithm for intelligent mobile robot navigation based on dynamic
    Yang, Shiqiang
    Fu, Weiping
    Li, Dexin
    Wang, Wen
    2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, : 898 - 902
  • [49] Mobile robot navigation based on turning point algorithm and sliding mode controller
    Hassani, Imen
    Maalej, Imen
    Rekik, Chokri
    2018 15TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES (SSD), 2018, : 1380 - 1385
  • [50] Aε*-DFS:: an algorithm for minimizing search effort in sensor based mobile robot navigation
    Shmoulian, L
    Rimon, E
    1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 356 - 362