Decision making for autonomous vehicles in highway scenarios using Harmonic SK Deep SARSA

被引：9

作者：

Rais, Mohamed Saber ^{[1
]}

Boudour, Rachid ^{[1
]}

Zouaidia, Khouloud ^{[1
]}

Bougueroua, Lamine ^{[2
]}

机构：

[1] Badji Mokhtar Univ, Embedded Syst Lab, Annaba, Algeria

[2] Efrei Paris, Allianst Res Lab, Villejuif, France

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 03期

关键词：

Reinforcement learning; Deep learning; Human inspired meta-heuristics; Decision making; Autonomous vehicles;

D O I：

10.1007/s10489-022-03357-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The complexity of taking decisions for an autonomous vehicle (AV) to avoid road accident fatalities, provide safety, comfort, and reduce traffic raises the need for improvements in the field of decision making. To solve these challenges, many algorithms and techniques were applied, and the most common ones were reinforcement learning (RL) algorithms combined with deep learning techniques. Therefore, in this paper we proposed a novel extension of the popular "SARSA" (State-Action-Reward-State-Action) RL technique called "Harmonic SK Deep SARSA" that takes advantage of the stability which SARSA algorithm provides and uses the notion of similar and cumulative states saved in an alternative memory to enhance the stability of the algorithm and achieve remarkable performance that SARSA could not accomplish due to its on policy nature. Through the investigation of our novel extension the adaptability of the algorithm to unexpected situations during learning and to unforeseen changes in the environment was proved while reducing the computational load in the learning process and increasing the convergence rate that plays a key role in upgrading decision making application that require numerous real time consecutive decisions, including autonomous vehicles, industrial robots, gaming, aerial navigation... The novel algorithm was tested in a gym environment simulator called "Highway-env" with multiple highway situations (multiple lanes configurations, highway with dynamic number of lanes (from 4-lane to 2-lane, from 4-lane to 6-lane), merge) with numerous dynamic obstacles. For the purpose of comparison, we used a benchmark of cutting edge algorithms known for their prominent performance. The experimental results showed that the proposed algorithm outperformed the comparison algorithms in learning stability and performance that were validated by the following metrics: average loss value per episode, average accuracy per episode, maximum speed value reached per episode, average speed per episode, and the total reward per episode.

引用

页码：2488 / 2505

页数：18

共 50 条

[1] Decision making for autonomous vehicles in highway scenarios using Harmonic SK Deep SARSA
Mohamed Saber Rais
Rachid Boudour
Khouloud Zouaidia
Lamine Bougueroua
Applied Intelligence, 2023, 53 : 2488 - 2505
[2] Decision-Making Strategy on Highway for Autonomous Vehicles Using Deep Reinforcement Learning
Liao, Jiangdong
Liu, Teng
Tang, Xiaolin
Mu, Xingyu
Huang, Bing
Cao, Dongpu
IEEE ACCESS, 2020, 8 (08): : 177804 - 177814
[3] Decision-Making in Fallback Scenarios for Autonomous Vehicles: Deep Reinforcement Learning Approach
Lee, Cheonghwa
An, Dawn
APPLIED SCIENCES-BASEL, 2023, 13 (22):
[4] Decision-Making for Autonomous Vehicles in Random Task Scenarios at Unsignalized Intersection Using Deep Reinforcement Learning
Xiao, Wenxuan
Yang, Yuyou
Mu, Xinyu
Xie, Yi
Tang, Xiaolin
Cao, Dongpu
Liu, Teng
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 7812 - 7825
[5] Decision making of autonomous vehicles in lane change scenarios: Deep reinforcement learning approaches with risk awareness
Li, Guofa
Yang, Yifan
Li, Shen
Qu, Xingda
Lyu, Nengchao
Li, Shengbo Eben
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 134
[6] Cooperation-Aware Decision Making for Autonomous Vehicles in Merge Scenarios
Liu, Kaiwen
Li, Nan
Tseng, H. Eric
Kolmanovsky, Ilya
Girard, Anouck
Filev, Dimitar
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 5006 - 5012
[7] A Survey of Localization Methods for Autonomous Vehicles in Highway Scenarios
Laconte, Johann
Kasmi, Abderrahim
Aufrere, Romuald
Vaidis, Maxime
Chapuis, Roland
SENSORS, 2022, 22 (01)
[8] Enhanced decision making in multi-scenarios for autonomous vehicles using alternative bidirectional Q network
Rais, Mohamed Saber
Zouaidia, Khouloud
Boudour, Rachid
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15981 - 15996
[9] Enhanced decision making in multi-scenarios for autonomous vehicles using alternative bidirectional Q network
Mohamed Saber Rais
Khouloud Zouaidia
Rachid Boudour
Neural Computing and Applications, 2022, 34 : 15981 - 15996
[10] Upgraded decision making in continuous domains for autonomous vehicles in high complexity scenarios using escalated DDPG
Zouaidia, Khouloud
Rais, Med Saber
Bougueroua, Lamine
APPLIED INTELLIGENCE, 2025, 55 (07)

← 1 2 3 4 5 →