Deep Deterministic Policy Gradient for Navigation of Mobile Robots

被引:10
|
作者
de Jesus, Junior Costa [1 ]
Bottega, Jair Augusto [2 ]
de Souza Leite Cuadros, Marco Antonio [3 ]
Tello Gamarra, Daniel Fernando [4 ]
机构
[1] Fed Univ Rio Grande, Rio Grande, RS, Brazil
[2] Univ Fed Santa Maria, Santa Maria, RS, Brazil
[3] Fed Inst Espirito Santo, Serra, ES, Brazil
[4] Univ Fed Santa Maria, Proc Dept Elect, Santa Maria, RS, Brazil
关键词
Deep Deterministic Policy Gradient; Deep Reinforcement Learning; Navigation for Mobile Robots;
D O I
10.3233/JIFS-191711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes the use of the Deep Deterministic Policy Gradient network, a deep reinforcement learning algorithm, for mobile robot navigation. The neural network structure has as inputs laser range findings, angular and linear velocities of the robot, and position and orientation of the mobile robot with respect to a goal position. The outputs of the network will be the angular and linear velocities used as control signals for the robot. The experiments demonstrated that deep reinforcement learning's techniques that uses continuous actions, are efficient for decision-making in a mobile robot. Nevertheless, the design of the reward functions constitutes an important issue in the performance of deep reinforcement learning algorithms. In order to show the performance of the Deep Reinforcement Learning algorithm, we have applied successfully the proposed architecture in simulated environments and in experiments with a real robot.
引用
收藏
页码:349 / 361
页数:13
相关论文
共 50 条
  • [41] Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms
    Zhang, Haifei
    Xu, Jian
    Zhang, Jian
    Liu, Quan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [42] NETWORK ARCHITECTURE REASONING VIA DEEP DETERMINISTIC POLICY GRADIENT
    Liu, Huidong
    Du, Fang
    Tang, Xiaofen
    Liu, Hao
    Yu, Zhenhua
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [43] A Method of Attitude Control Based on Deep Deterministic Policy Gradient
    Zhang, Jian
    Wu, Fengge
    Zhao, Junsuo
    Xu, Fanjiang
    COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 197 - 207
  • [44] Dynamical Motor Control Learned with Deep Deterministic Policy Gradient
    Shi, Haibo
    Sun, Yaoru
    Li, Jie
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [45] BYZANTINE-ROBUST FEDERATED DEEP DETERMINISTIC POLICY GRADIENT
    Lin, Qifeng
    Ling, Qing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4013 - 4017
  • [46] Target tracking strategy using deep deterministic policy gradient
    You, Shixun
    Diao, Ming
    Gao, Lipeng
    Zhang, Fulong
    Wang, Huan
    APPLIED SOFT COMPUTING, 2020, 95
  • [47] Duplicated Replay Buffer for Asynchronous Deep Deterministic Policy Gradient
    Motehayeri, Seyed Mohammad Seyed
    Baghi, Vahid
    Miandoab, Ehsan Maani
    Moeini, Ali
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [48] Optimal Trade Execution Based on Deep Deterministic Policy Gradient
    Ye, Zekun
    Deng, Weijie
    Zhou, Shuigeng
    Xu, Yi
    Guan, Jihong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 638 - 654
  • [49] Compensation Control of UAV Based on Deep Deterministic Policy Gradient
    Xu, Zijun
    Qi, Juntong
    Wang, Mingming
    Wu, Chong
    Yang, Guang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2289 - 2296
  • [50] Deep Deterministic Policy Gradient Artificial Intelligence for Radar Applications
    Reininger, Taylor J.
    Smith, Graeme E.
    2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,