Deep Deterministic Policy Gradient for Navigation of Mobile Robots

被引：10

作者：

de Jesus, Junior Costa ^{[1
]}

Bottega, Jair Augusto ^{[2
]}

de Souza Leite Cuadros, Marco Antonio ^{[3
]}

Tello Gamarra, Daniel Fernando ^{[4
]}

机构：

[1] Fed Univ Rio Grande, Rio Grande, RS, Brazil

[2] Univ Fed Santa Maria, Santa Maria, RS, Brazil

[3] Fed Inst Espirito Santo, Serra, ES, Brazil

[4] Univ Fed Santa Maria, Proc Dept Elect, Santa Maria, RS, Brazil

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2021年 / 40卷 / 01期

关键词：

Deep Deterministic Policy Gradient; Deep Reinforcement Learning; Navigation for Mobile Robots;

D O I：

10.3233/JIFS-191711

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article describes the use of the Deep Deterministic Policy Gradient network, a deep reinforcement learning algorithm, for mobile robot navigation. The neural network structure has as inputs laser range findings, angular and linear velocities of the robot, and position and orientation of the mobile robot with respect to a goal position. The outputs of the network will be the angular and linear velocities used as control signals for the robot. The experiments demonstrated that deep reinforcement learning's techniques that uses continuous actions, are efficient for decision-making in a mobile robot. Nevertheless, the design of the reward functions constitutes an important issue in the performance of deep reinforcement learning algorithms. In order to show the performance of the Deep Reinforcement Learning algorithm, we have applied successfully the proposed architecture in simulated environments and in experiments with a real robot.

引用

页码：349 / 361

页数：13

共 50 条

[41] Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms
Zhang, Haifei
Xu, Jian
Zhang, Jian
Liu, Quan
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[42] NETWORK ARCHITECTURE REASONING VIA DEEP DETERMINISTIC POLICY GRADIENT
Liu, Huidong
Du, Fang
Tang, Xiaofen
Liu, Hao
Yu, Zhenhua
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[43] A Method of Attitude Control Based on Deep Deterministic Policy Gradient
Zhang, Jian
Wu, Fengge
Zhao, Junsuo
Xu, Fanjiang
COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 197 - 207
[44] Dynamical Motor Control Learned with Deep Deterministic Policy Gradient
Shi, Haibo
Sun, Yaoru
Li, Jie
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[45] BYZANTINE-ROBUST FEDERATED DEEP DETERMINISTIC POLICY GRADIENT
Lin, Qifeng
Ling, Qing
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4013 - 4017
[46] Target tracking strategy using deep deterministic policy gradient
You, Shixun
Diao, Ming
Gao, Lipeng
Zhang, Fulong
Wang, Huan
APPLIED SOFT COMPUTING, 2020, 95
[47] Duplicated Replay Buffer for Asynchronous Deep Deterministic Policy Gradient
Motehayeri, Seyed Mohammad Seyed
Baghi, Vahid
Miandoab, Ehsan Maani
Moeini, Ali
2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
[48] Optimal Trade Execution Based on Deep Deterministic Policy Gradient
Ye, Zekun
Deng, Weijie
Zhou, Shuigeng
Xu, Yi
Guan, Jihong
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 638 - 654
[49] Compensation Control of UAV Based on Deep Deterministic Policy Gradient
Xu, Zijun
Qi, Juntong
Wang, Mingming
Wu, Chong
Yang, Guang
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2289 - 2296
[50] Deep Deterministic Policy Gradient Artificial Intelligence for Radar Applications
Reininger, Taylor J.
Smith, Graeme E.
2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,

← 1 2 3 4 5 →