Decision making for autonomous vehicles in highway scenarios using Harmonic SK Deep SARSA

被引:9
|
作者
Rais, Mohamed Saber [1 ]
Boudour, Rachid [1 ]
Zouaidia, Khouloud [1 ]
Bougueroua, Lamine [2 ]
机构
[1] Badji Mokhtar Univ, Embedded Syst Lab, Annaba, Algeria
[2] Efrei Paris, Allianst Res Lab, Villejuif, France
关键词
Reinforcement learning; Deep learning; Human inspired meta-heuristics; Decision making; Autonomous vehicles;
D O I
10.1007/s10489-022-03357-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The complexity of taking decisions for an autonomous vehicle (AV) to avoid road accident fatalities, provide safety, comfort, and reduce traffic raises the need for improvements in the field of decision making. To solve these challenges, many algorithms and techniques were applied, and the most common ones were reinforcement learning (RL) algorithms combined with deep learning techniques. Therefore, in this paper we proposed a novel extension of the popular "SARSA" (State-Action-Reward-State-Action) RL technique called "Harmonic SK Deep SARSA" that takes advantage of the stability which SARSA algorithm provides and uses the notion of similar and cumulative states saved in an alternative memory to enhance the stability of the algorithm and achieve remarkable performance that SARSA could not accomplish due to its on policy nature. Through the investigation of our novel extension the adaptability of the algorithm to unexpected situations during learning and to unforeseen changes in the environment was proved while reducing the computational load in the learning process and increasing the convergence rate that plays a key role in upgrading decision making application that require numerous real time consecutive decisions, including autonomous vehicles, industrial robots, gaming, aerial navigation... The novel algorithm was tested in a gym environment simulator called "Highway-env" with multiple highway situations (multiple lanes configurations, highway with dynamic number of lanes (from 4-lane to 2-lane, from 4-lane to 6-lane), merge) with numerous dynamic obstacles. For the purpose of comparison, we used a benchmark of cutting edge algorithms known for their prominent performance. The experimental results showed that the proposed algorithm outperformed the comparison algorithms in learning stability and performance that were validated by the following metrics: average loss value per episode, average accuracy per episode, maximum speed value reached per episode, average speed per episode, and the total reward per episode.
引用
收藏
页码:2488 / 2505
页数:18
相关论文
共 50 条
  • [1] Decision making for autonomous vehicles in highway scenarios using Harmonic SK Deep SARSA
    Mohamed Saber Rais
    Rachid Boudour
    Khouloud Zouaidia
    Lamine Bougueroua
    Applied Intelligence, 2023, 53 : 2488 - 2505
  • [2] Decision-Making Strategy on Highway for Autonomous Vehicles Using Deep Reinforcement Learning
    Liao, Jiangdong
    Liu, Teng
    Tang, Xiaolin
    Mu, Xingyu
    Huang, Bing
    Cao, Dongpu
    IEEE ACCESS, 2020, 8 (08): : 177804 - 177814
  • [3] Decision-Making in Fallback Scenarios for Autonomous Vehicles: Deep Reinforcement Learning Approach
    Lee, Cheonghwa
    An, Dawn
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [4] Decision-Making for Autonomous Vehicles in Random Task Scenarios at Unsignalized Intersection Using Deep Reinforcement Learning
    Xiao, Wenxuan
    Yang, Yuyou
    Mu, Xinyu
    Xie, Yi
    Tang, Xiaolin
    Cao, Dongpu
    Liu, Teng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 7812 - 7825
  • [5] Decision making of autonomous vehicles in lane change scenarios: Deep reinforcement learning approaches with risk awareness
    Li, Guofa
    Yang, Yifan
    Li, Shen
    Qu, Xingda
    Lyu, Nengchao
    Li, Shengbo Eben
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 134
  • [6] Cooperation-Aware Decision Making for Autonomous Vehicles in Merge Scenarios
    Liu, Kaiwen
    Li, Nan
    Tseng, H. Eric
    Kolmanovsky, Ilya
    Girard, Anouck
    Filev, Dimitar
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 5006 - 5012
  • [7] A Survey of Localization Methods for Autonomous Vehicles in Highway Scenarios
    Laconte, Johann
    Kasmi, Abderrahim
    Aufrere, Romuald
    Vaidis, Maxime
    Chapuis, Roland
    SENSORS, 2022, 22 (01)
  • [8] Enhanced decision making in multi-scenarios for autonomous vehicles using alternative bidirectional Q network
    Rais, Mohamed Saber
    Zouaidia, Khouloud
    Boudour, Rachid
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15981 - 15996
  • [9] Enhanced decision making in multi-scenarios for autonomous vehicles using alternative bidirectional Q network
    Mohamed Saber Rais
    Khouloud Zouaidia
    Rachid Boudour
    Neural Computing and Applications, 2022, 34 : 15981 - 15996
  • [10] Upgraded decision making in continuous domains for autonomous vehicles in high complexity scenarios using escalated DDPG
    Zouaidia, Khouloud
    Rais, Med Saber
    Bougueroua, Lamine
    APPLIED INTELLIGENCE, 2025, 55 (07)