Path planning via reinforcement learning with closed-loop motion control and field tests

被引:0
|
作者
Feher, Arpad [1 ]
Domina, Adam [2 ]
Bardos, Adam [2 ]
Aradi, Szilard [1 ]
Becsi, Tamas [1 ]
机构
[1] Budapest Univ Technol & Econ, Fac Transportat Engn & Vehicle Engn, Dept Control Transportat & Vehicle Syst, Muegyet Rkp 3, H-1111 Budapest, Hungary
[2] Budapest Univ Technol & Econ, Dept Automot Technol, Fac Transportat Engn & Vehicle Engn, Muegyetem Rkp 3, H-1111 Budapest, Hungary
关键词
Vehicle dynamics; Advanced driver assistance systems; Machine learning; Reinforcement learning; Model predictive control; ACTIVE STEERING CONTROL; MODEL; SIMULATION; VEHICLES;
D O I
10.1016/j.engappai.2024.109870
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performing evasive maneuvers with highly automated vehicles is a challenging task. The algorithm must fulfill safety constraints and complete the task while keeping the car in a controllable state. Furthermore, considering all aspects of vehicle dynamics, the path generation problem is numerically complex. Hence its classical solutions can hardly meet real-time requirements. On the other hand, single reinforcement learning based approaches only could handle this problem as a simple driving task and would not provide feasibility information on the whole task's horizon. Therefore, this paper presents a hierarchical method for obstacle avoidance of an automated vehicle to overcome this issue, where the geometric path generation is provided by a single-step continuous Reinforcement Learning agent, while a model-predictive controller deals with lateral control to perform a double lane change maneuver. As the agent plays the optimization role in this architecture, it is trained in various scenarios to provide the necessary parameters fora geometric path generator in a onestep neural network output. During the training, the controller that follows the track evaluates the feasibility of the generated path whose performance metrics provide feedback to the agent so it can further improve its performance. The framework can train an agent fora given problem with various parameters. Asa use case, it is presented as a static obstacle avoidance maneuver. the proposed framework was tested on an automotive proving ground with the geometric constraints of the ISO-3888-2 test. The results proved its real-time capability and performance compared to human drivers' abilities.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] 6-DoF Closed-Loop Grasping with Reinforcement Learning
    Herland, Sverre
    Bach, Kerstin
    Misimi, Ekrem
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 7812 - 7818
  • [32] Forced convection heat transfer control for cylinder via closed-loop continuous goal-oriented reinforcement learning
    Liu, Yangwei
    Wang, Feitong
    Zhao, Shihang
    Tang, Yumeng
    PHYSICS OF FLUIDS, 2024, 36 (11)
  • [33] The motion of a cylinder-rod system on a horizontal plane: Path controllability and closed-loop control
    Yavin, Y
    Ehlers, GW
    Frangos, C
    MATHEMATICAL AND COMPUTER MODELLING, 1997, 25 (02) : 109 - 116
  • [34] The Motion of a Cylinder-Rod System on a Horizontal Plane: Path Controllability and Closed-Loop Control
    Yavin, Y.
    Ehlers, G. W.
    Frangos, C.
    Mathematical and Computer Modelling (Oxford), 25 (02):
  • [35] From open-loop learning to closed-loop control
    Jansson, H
    Hjalmarsson, H
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 4209 - 4214
  • [36] Closed-loop randomized kinodynamic path planning for an autonomous underwater vehicle
    Taheri, Ehsan
    Ferdowsi, Mohammad Hossein
    Danesh, Mohammad
    APPLIED OCEAN RESEARCH, 2019, 83 : 48 - 64
  • [37] Long-Term Adaptation of Closed-Loop Glucose Regulation Via Reinforcement Learning Tools
    Serafini, Maria Cecilia
    Rosales, Nicolas
    Garelli, Fabricio
    IFAC PAPERSONLINE, 2022, 55 (07): : 649 - 654
  • [38] Mixed-Integer MPC for Closed-Loop Motion Planning and Flight Control of a Laboratory Helicopter
    Caregnato-Neto, Angelo
    Afonso, Rubens Junqueira Magalhaes
    JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2024, 35 (03) : 415 - 427
  • [39] Mixed-Integer MPC for Closed-Loop Motion Planning and Flight Control of a Laboratory Helicopter
    Angelo Caregnato-Neto
    Rubens Junqueira Magalhães Afonso
    Journal of Control, Automation and Electrical Systems, 2024, 35 : 415 - 427
  • [40] Closed-loop insulin delivery - the path to physiological glucose control
    Steil, GM
    Panteleon, AE
    Rebrin, K
    ADVANCED DRUG DELIVERY REVIEWS, 2004, 56 (02) : 125 - 144