Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios

被引:18
|
作者
Tang, Xiaolin [1 ]
Yang, Yuyou [1 ]
Liu, Teng [1 ,2 ,3 ]
Lin, Xianke [4 ]
Yang, Kai [1 ]
Li, Shen [5 ]
机构
[1] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ Three Gorges Hosp, Three Gorges Hosp, Clin Res Ctr, Wanzhou 404000, Peoples R China
[3] Chongqing Univ, Three Gorges Hosp, Med Pathol Ctr, Wanzhou 404000, Peoples R China
[4] Ontario Tech Univ, Dept Automot & Mechatron Engn, Oshawa, ON L1G 0C5, Canada
[5] Tsinghua Univ, Sch Civil Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Automatic parking; control strategy; parking deviation (APS); soft actor-critic (SAC);
D O I
10.1109/JAS.2023.123975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot (PDEVNTPL) on the automatic ego vehicle (AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework (SPTF) based on soft actor-critic (SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning (DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26% respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43 degrees.
引用
收藏
页码:181 / 195
页数:15
相关论文
共 25 条
  • [21] An expert-demonstrated soft actor-critic based adaptive trajectory tracking control of Autonomous Underwater Vehicle with Long Short-Term Memory
    Wang, Yuxuan
    Hou, Yaochun
    Lai, Zhounian
    Cao, Linlin
    Hong, Weirong
    Wu, Dazhuan
    OCEAN ENGINEERING, 2025, 321
  • [22] Optimal dynamic thermal management for data center via soft actor-critic algorithm with dynamic control interval and combined-value state space
    Guo, Yuxiang
    Qu, Shengli
    Wang, Chuang
    Xing, Ziwen
    Duan, Kaiwen
    APPLIED ENERGY, 2024, 373
  • [23] UDE-Based Dynamic Surface Control for Quadrotor Drone Attitude Tracking under Non-Ideal Actuators
    Xu, Linxi
    Qin, Kaiyu
    Tang, Fan
    Shi, Mengji
    Lin, Boxian
    DRONES, 2023, 7 (05)
  • [24] Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture
    Sharma, Manish
    Verma, Ajay
    INTERNATIONAL JOURNAL OF AUTOMATION AND CONTROL, 2013, 7 (04) : 288 - 303