Severity Prediction of Highway Crashes in Saudi Arabia Using Machine Learning Techniques

被引:15
|
作者
Aldhari, Ibrahim [1 ]
Almoshaogeh, Meshal [1 ]
Jamal, Arshad [2 ]
Alharbi, Fawaz [1 ]
Alinizzi, Majed [1 ]
Haider, Husnain [1 ]
机构
[1] Qassim Univ, Coll Engn, Dept Civil Engn, Buraydah 51452, Saudi Arabia
[2] Imam Abdulrahman Bin Faisal Univ, Coll Engn, Transportat & Traff Engn Dept, POB 1982, Dammam 31451, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 01期
关键词
traffic safety; severity prediction; machine learning; SHapley Additive exPlanations; SHAP; XGBoost; random forest; regression analysis; INJURY SEVERITY; TRAFFIC ACCIDENTS; IDENTIFICATION;
D O I
10.3390/app13010233
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Kingdom of Among the G20 countries, Saudi Arabia (KSA) is facing alarming traffic safety issues compared to other G-20 countries. Mitigating the burden of traffic accidents has been identified as a primary focus as part of vision 20230 goals. Driver distraction is the primary cause of increased severity traffic accidents in KSA. In this study, three different machine learning-based severity prediction models were developed and implemented for accident data from the Qassim Province, KSA. Traffic accident data for January 2017 to December 2019 assessment period were obtained from the Ministry of Transport and Logistics Services. Three classifiers, two of which are ensemble machine learning methods, namely random forest, XGBoost, and logistic regression, were used for crash injury severity classification. A resampling technique was used to deal with the problem of bias due to data imbalance issue. SHapley Additive exPlanations (SHAP) analysis interpreted and ranked the factors contributing to crash injury. Two forms of modeling were adopted: multi and binary classification. Among the three models, XGBoost achieved the highest classification accuracy (71%), precision (70%), recall (71%), F1-scores (70%), and area curve (AUC) (0.87) of receiver operating characteristic (ROC) curve when used for multi-category classifications. While adopting the target as a binary classification, XGBoost again outperformed the other classifiers with an accuracy of 94% and an AUC of 0.98. The SHAP results from both global and local interpretations illustrated that the accidents classified under property damage only were primarily categorized by their consequences and the number of vehicles involved. The type of road and lighting conditions were among the other influential factors affecting injury s severity outcome. The death class was classified with respect to temporal parameters, including month and day of the week, as well as road type. Assessing the factors associated with the severe injuries caused by road traffic accidents will assist policymakers in developing safety mitigation strategies in the Qassim Region and other regions of Saudi Arabia.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Rainfall Prediction Rate in Saudi Arabia Using Improved Machine Learning Techniques
    Baljon, Mohammed
    Sharma, Sunil Kumar
    WATER, 2023, 15 (04)
  • [2] Identifying Causes of Traffic Crashes Associated with Driver Behavior Using Supervised Machine Learning Methods: Case of Highway 15 in Saudi Arabia
    Akin, Darcin
    Sisiopiku, Virginia P.
    Alateah, Ali H.
    Almonbhi, Ali O.
    Al-Tholaia, Mohammed M. H.
    Al-Sodani, Khaled A. Alawi
    SUSTAINABILITY, 2022, 14 (24)
  • [3] Injury severity prediction of traffic crashes with ensemble machine learning techniques: a comparative study
    Jamal, Arshad
    Zahid, Muhammad
    Tauhidur Rahman, Muhammad
    Al-Ahmadi, Hassan M.
    Almoshaogeh, Meshal
    Farooq, Danish
    Ahmad, Mahmood
    INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2021, 28 (04) : 408 - 427
  • [4] Severity prediction of motorcycle crashes with machine learning methods
    Wahab, Lukuman
    Jiang, Haobin
    INTERNATIONAL JOURNAL OF CRASHWORTHINESS, 2020, 25 (05) : 485 - 492
  • [5] Using Machine Learning for Prediction of Factors Affecting Crimes in Saudi Arabia
    Alsaqabi, Anadil
    Aldhubayi, Fatimah
    Albahli, Saleh
    BDE 2019: 2019 INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING, 2019, : 51 - 56
  • [6] Involvement of Road Users from the Productive Age Group in Traffic Crashes in Saudi Arabia: An Investigative Study Using Statistical and Machine Learning Techniques
    Islam, Md. Kamrul
    Gazder, Uneb
    Akter, Rocksana
    Arifuzzaman, Md.
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [7] Prediction of acute organophosphate poisoning severity using machine learning techniques
    Hosseini, Sayed Masoud
    Rahimi, Mitra
    Afrash, Mohammad Reza
    Ziaeefar, Pardis
    Yousefzadeh, Parsa
    Pashapour, Sanaz
    Evini, Peyman Erfan Talab
    Mostafazadeh, Babak
    Shadnia, Shahin
    TOXICOLOGY, 2023, 486
  • [8] Optimizing Machine Learning Models with Bayesian Techniques for Prediction of Groundwater Quality Index in Southwest Saudi Arabia
    Alshehri, Fahad
    Shahfahad
    Rahman, Atiqur
    EARTH SYSTEMS AND ENVIRONMENT, 2024, 8 (04) : 1417 - 1436
  • [9] An Improved Flood Susceptibility Assessment in Jeddah, Saudi Arabia, Using Advanced Machine Learning Techniques
    Ghanim, Abdulnoor A. J.
    Shaf, Ahmad
    Ali, Tariq
    Zafar, Maryam
    Al-Areeq, Ahmed M.
    Alyami, Saleh H.
    Irfan, Muhammad
    Rahman, Saifur
    WATER, 2023, 15 (14)
  • [10] Prediction of Road Accidents' Severity on Russian Roads Using Machine Learning Techniques
    Donchenko, D.
    Sadovnikova, N.
    Parygin, D.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, ICIE 2019, VOL II, 2020, : 1493 - 1501