An End-to-End Deep Reinforcement Learning Model Based on Proximal Policy Optimization Algorithm for Autonomous Driving of Off-Road Vehicle

被引:0
|
作者
Wang, Yiquan [1 ,2 ]
Wang, Jingguo [2 ]
Yang, Yu [1 ]
Li, Zhaodong [1 ]
Zhao, Xijun [1 ]
机构
[1] China North Artificial Intelligence & Innovat Res, Beijing, Peoples R China
[2] Jiuquan Satellite Launch Ctr, Jiuquan, Gansu, Peoples R China
关键词
Reinforcement Learning; End-to-End; UGV; Wild Environment; GROUND VEHICLE;
D O I
10.1007/978-981-99-0479-2_248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most conventional unmanned vehicle control algorithms require human adjustment of parameters and design of precise rules, thus failing to adapt quickly to multiple situations when facing complex environments in the wild. To address these problems, this paper adopts an end-to-end deep reinforcement learning model based on proximal policy optimization algorithm to control the steering, speed and braking of an unmanned vehicle, allowing it to autonomously learn motion control strategies from perceptionmap in un-known environments. A novel environment simulator which contains variable passable areas and obstacles is also proposed to support agents to achieve target reward. The proposed agent model has been proved to receive the highest reward over SAC and has the ability to overcome the complexity of the wild environment generated by the simulator.
引用
收藏
页码:2692 / 2704
页数:13
相关论文
共 50 条
  • [41] End-to-end Deep Reinforcement Learning Based Coreference Resolution
    Fei, Hongliang
    Li, Xu
    Li, Dingcheng
    Li, Ping
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 660 - 665
  • [42] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
    Wang, Mengqin
    Wei, Yanling
    Huang, Xueliang
    Gao, Shan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 33671 - 33682
  • [43] A Reinforcement Learning based End-to-End Algorithm for Confrontation Problem
    Wang, Siqiang
    Yao, Haodi
    Yao, Yu
    He, Fenghua
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2594 - 2598
  • [44] End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs
    Babu, R. Logesh
    Gurumoorthy, Sasikumar
    Parameshachari, B. D.
    Nelson, S. Christalin
    Hua, Qiaozhi
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (13)
  • [45] End-to-end autonomous driving based on the convolution neural network model
    Zhao, Yuanfang
    Chen, Yunli
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 419 - 423
  • [46] Learning a Deep Neural Net Policy for End-to-End Control of Autonomous Vehicles
    Rausch, Viktor
    Hansen, Andreas
    Solowjow, Eugen
    Liu, Chang
    Kreuzer, Edwin
    Hedrick, J. Karl
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 4914 - 4919
  • [47] Early Failure Detection of Deep End-to-End Control Policy by Reinforcement Learning
    Lee, Keuntaek
    Saigol, Kamil
    Theodorou, Evangelos A.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8543 - 8549
  • [48] End-to-end deep learning-based autonomous driving control for high-speed environment
    Cheol-jin Kim
    Myung-jae Lee
    Kyu-hong Hwang
    Young-guk Ha
    The Journal of Supercomputing, 2022, 78 : 1961 - 1982
  • [49] End-to-end deep learning-based autonomous driving control for high-speed environment
    Kim, Cheol-jin
    Lee, Myung-jae
    Hwang, Kyu-hong
    Ha, Young-guk
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1961 - 1982
  • [50] An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems
    Ma, Song
    Ruan, Jingqing
    Du, Yali
    Bucknall, Richard
    Liu, Yuanchang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1519 - 1533