An End-to-End Deep Reinforcement Learning Model Based on Proximal Policy Optimization Algorithm for Autonomous Driving of Off-Road Vehicle

被引:0
|
作者
Wang, Yiquan [1 ,2 ]
Wang, Jingguo [2 ]
Yang, Yu [1 ]
Li, Zhaodong [1 ]
Zhao, Xijun [1 ]
机构
[1] China North Artificial Intelligence & Innovat Res, Beijing, Peoples R China
[2] Jiuquan Satellite Launch Ctr, Jiuquan, Gansu, Peoples R China
关键词
Reinforcement Learning; End-to-End; UGV; Wild Environment; GROUND VEHICLE;
D O I
10.1007/978-981-99-0479-2_248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most conventional unmanned vehicle control algorithms require human adjustment of parameters and design of precise rules, thus failing to adapt quickly to multiple situations when facing complex environments in the wild. To address these problems, this paper adopts an end-to-end deep reinforcement learning model based on proximal policy optimization algorithm to control the steering, speed and braking of an unmanned vehicle, allowing it to autonomously learn motion control strategies from perceptionmap in un-known environments. A novel environment simulator which contains variable passable areas and obstacles is also proposed to support agents to achieve target reward. The proposed agent model has been proved to receive the highest reward over SAC and has the ability to overcome the complexity of the wild environment generated by the simulator.
引用
收藏
页码:2692 / 2704
页数:13
相关论文
共 50 条
  • [21] End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
    Ruan, Xiaogang
    Li, Peng
    Zhu, Xiaoqing
    Yu, Hejie
    Yu, Naigong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [22] End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning
    Perot, Etienne
    Jaritz, Maximilian
    Toromanoff, Marin
    de Charette, Raoul
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 474 - 475
  • [23] Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras
    Sandha, Sandeep Singh
    Balaji, Bharathan
    Garcia, Luis
    Srivastava, Mani
    PROCEEDINGS 8TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2023, 2023, : 144 - 157
  • [24] Agile Autonomous Driving using End-to-End Deep Imitation Learning
    Pan, Yunpeng
    Cheng, Ching-An
    Saigol, Kamil
    Lee, Keuntaek
    Yan, Xinyan
    Theodorou, Evangelos A.
    Boots, Byron
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [25] End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function
    Jeng, Shyr-Long
    Chiang, Chienhsun
    SENSORS, 2023, 23 (20)
  • [26] End-to-end deep learning for reverse driving trajectory of autonomous bulldozer
    You, Ke
    Ding, Lieyun
    Jiang, Yutian
    Wu, Zhangang
    Zhou, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [27] End-to-End Deep Reinforcement Learning for Image-Based UAV Autonomous Control
    Zhao, Jiang
    Sun, Jiaming
    Cai, Zhihao
    Wang, Longhong
    Wang, Yingxun
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [28] Towards End-to-End Chase in Urban Autonomous Driving Using Reinforcement Learning
    Kolomanski, Michal
    Sakhai, Mustafa
    Nowak, Jakub
    Wielgosz, Maciej
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, 2023, 544 : 408 - 426
  • [29] Towards End-to-End Escape in Urban Autonomous Driving Using Reinforcement Learning
    Sakhai, Mustafa
    Wielgosz, Maciej
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 21 - 40
  • [30] Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data
    So, John
    Xie, Amber
    Jung, Sunggoo
    Edlund, Jeffrey
    Thakker, Rohan
    Agha-mohammadi, Ali
    Abbeel, Pieter
    James, Stephen
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1871 - 1881