Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios

被引：18

作者：

Tang, Xiaolin ^{[1
]}

Yang, Yuyou ^{[1
]}

Liu, Teng ^{[1
,2
,3
]}

Lin, Xianke ^{[4
]}

Yang, Kai ^{[1
]}

Li, Shen ^{[5
]}

机构：

[1] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China

[2] Chongqing Univ Three Gorges Hosp, Three Gorges Hosp, Clin Res Ctr, Wanzhou 404000, Peoples R China

[3] Chongqing Univ, Three Gorges Hosp, Med Pathol Ctr, Wanzhou 404000, Peoples R China

[4] Ontario Tech Univ, Dept Automot & Mechatron Engn, Oshawa, ON L1G 0C5, Canada

[5] Tsinghua Univ, Sch Civil Engn, Beijing 100084, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Automatic parking; control strategy; parking deviation (APS); soft actor-critic (SAC);

D O I：

10.1109/JAS.2023.123975

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot (PDEVNTPL) on the automatic ego vehicle (AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework (SPTF) based on soft actor-critic (SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning (DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26% respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43 degrees.

引用

页码：181 / 195

页数：15

共 25 条

[21] An expert-demonstrated soft actor-critic based adaptive trajectory tracking control of Autonomous Underwater Vehicle with Long Short-Term Memory
Wang, Yuxuan
Hou, Yaochun
Lai, Zhounian
Cao, Linlin
Hong, Weirong
Wu, Dazhuan
OCEAN ENGINEERING, 2025, 321
[22] Optimal dynamic thermal management for data center via soft actor-critic algorithm with dynamic control interval and combined-value state space
Guo, Yuxiang
Qu, Shengli
Wang, Chuang
Xing, Ziwen
Duan, Kaiwen
APPLIED ENERGY, 2024, 373
[23] UDE-Based Dynamic Surface Control for Quadrotor Drone Attitude Tracking under Non-Ideal Actuators
Xu, Linxi
Qin, Kaiyu
Tang, Fan
Shi, Mengji
Lin, Boxian
DRONES, 2023, 7 (05)
[24] Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture
Sharma, Manish
Verma, Ajay
INTERNATIONAL JOURNAL OF AUTOMATION AND CONTROL, 2013, 7 (04) : 288 - 303
[25] Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture
Sharma, M. (er.mann24@gmail.com), 2013, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (07)

← 1 2 3 →