Decision making for autonomous vehicles in highway scenarios using Harmonic SK Deep SARSA

被引:9
|
作者
Rais, Mohamed Saber [1 ]
Boudour, Rachid [1 ]
Zouaidia, Khouloud [1 ]
Bougueroua, Lamine [2 ]
机构
[1] Badji Mokhtar Univ, Embedded Syst Lab, Annaba, Algeria
[2] Efrei Paris, Allianst Res Lab, Villejuif, France
关键词
Reinforcement learning; Deep learning; Human inspired meta-heuristics; Decision making; Autonomous vehicles;
D O I
10.1007/s10489-022-03357-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The complexity of taking decisions for an autonomous vehicle (AV) to avoid road accident fatalities, provide safety, comfort, and reduce traffic raises the need for improvements in the field of decision making. To solve these challenges, many algorithms and techniques were applied, and the most common ones were reinforcement learning (RL) algorithms combined with deep learning techniques. Therefore, in this paper we proposed a novel extension of the popular "SARSA" (State-Action-Reward-State-Action) RL technique called "Harmonic SK Deep SARSA" that takes advantage of the stability which SARSA algorithm provides and uses the notion of similar and cumulative states saved in an alternative memory to enhance the stability of the algorithm and achieve remarkable performance that SARSA could not accomplish due to its on policy nature. Through the investigation of our novel extension the adaptability of the algorithm to unexpected situations during learning and to unforeseen changes in the environment was proved while reducing the computational load in the learning process and increasing the convergence rate that plays a key role in upgrading decision making application that require numerous real time consecutive decisions, including autonomous vehicles, industrial robots, gaming, aerial navigation... The novel algorithm was tested in a gym environment simulator called "Highway-env" with multiple highway situations (multiple lanes configurations, highway with dynamic number of lanes (from 4-lane to 2-lane, from 4-lane to 6-lane), merge) with numerous dynamic obstacles. For the purpose of comparison, we used a benchmark of cutting edge algorithms known for their prominent performance. The experimental results showed that the proposed algorithm outperformed the comparison algorithms in learning stability and performance that were validated by the following metrics: average loss value per episode, average accuracy per episode, maximum speed value reached per episode, average speed per episode, and the total reward per episode.
引用
收藏
页码:2488 / 2505
页数:18
相关论文
共 50 条
  • [21] Prediction Based Decision Making for Autonomous Highway Driving
    Yildirim, Mustafa
    Mozaffari, Sajjad
    McCutcheon, Luc
    Dianati, Mehrdad
    Tamaddoni-Nezhad, Alireza
    Fallah, Saber
    arXiv, 2022,
  • [22] Prediction Based Decision Making for Autonomous Highway Driving
    Yildirim, Mustafa
    Mozaffari, Sajjad
    McCutcheon, Luc
    Dianati, Mehrdad
    Tamaddoni-Nezhad, Alireza
    Fallah, Saber
    IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, 2022, 2022-October : 138 - 145
  • [23] Highway Traffic Modeling and Decision Making for Autonomous Vehicle Using Reinforcement Learning
    You, Changxi
    Lu, Jianbo
    Filev, Dimitar
    Tsiotras, Panagiotis
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1227 - 1232
  • [24] Planning and Decision-Making for Autonomous Vehicles
    Schwarting, Wilko
    Alonso-Mora, Javier
    Rus, Daniela
    ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 1, 2018, 1 : 187 - 210
  • [25] A Survey on Datasets for the Decision Making of Autonomous Vehicles
    Wang, Yuning
    Han, Zeyu
    Xing, Yining
    Xu, Shaobing
    Wang, Jianqiang
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 23 - 40
  • [26] An Overview of Decision-Making in Autonomous Vehicles
    Ghraizi, Dany
    Talj, Reine
    Francis, Clovis
    IFAC PAPERSONLINE, 2023, 56 (02):
  • [27] Robust Decision Making for Autonomous Vehicles at Highway On-Ramps: A Constrained Adversarial Reinforcement Learning Approach
    He, Xiangkun
    Lou, Baichuan
    Yang, Haohan
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4103 - 4113
  • [28] Interpretable Decision-Making for Autonomous Vehicles at Highway On-Ramps With Latent Space Reinforcement Learning
    Wang, Huanjie
    Gao, Hongbo
    Yuan, Shihua
    Zhao, Hongfei
    Wang, Kelong
    Wang, Xiulai
    Li, Keqiang
    Li, Deyi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) : 8707 - 8719
  • [29] Driving Tasks Transfer Using Deep Reinforcement Learning for Decision-Making of Autonomous Vehicles in Unsignalized Intersection
    Shu, Hong
    Liu, Teng
    Mu, Xingyu
    Cao, Dongpu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 41 - 52
  • [30] Human-Guided Deep Reinforcement Learning for Optimal Decision Making of Autonomous Vehicles
    Wu, Jingda
    Yang, Haohan
    Yang, Lie
    Huang, Yi
    He, Xiangkun
    Lv, Chen
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, : 1 - 15