Severity Prediction of Highway Crashes in Saudi Arabia Using Machine Learning Techniques

被引:15
|
作者
Aldhari, Ibrahim [1 ]
Almoshaogeh, Meshal [1 ]
Jamal, Arshad [2 ]
Alharbi, Fawaz [1 ]
Alinizzi, Majed [1 ]
Haider, Husnain [1 ]
机构
[1] Qassim Univ, Coll Engn, Dept Civil Engn, Buraydah 51452, Saudi Arabia
[2] Imam Abdulrahman Bin Faisal Univ, Coll Engn, Transportat & Traff Engn Dept, POB 1982, Dammam 31451, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 01期
关键词
traffic safety; severity prediction; machine learning; SHapley Additive exPlanations; SHAP; XGBoost; random forest; regression analysis; INJURY SEVERITY; TRAFFIC ACCIDENTS; IDENTIFICATION;
D O I
10.3390/app13010233
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Kingdom of Among the G20 countries, Saudi Arabia (KSA) is facing alarming traffic safety issues compared to other G-20 countries. Mitigating the burden of traffic accidents has been identified as a primary focus as part of vision 20230 goals. Driver distraction is the primary cause of increased severity traffic accidents in KSA. In this study, three different machine learning-based severity prediction models were developed and implemented for accident data from the Qassim Province, KSA. Traffic accident data for January 2017 to December 2019 assessment period were obtained from the Ministry of Transport and Logistics Services. Three classifiers, two of which are ensemble machine learning methods, namely random forest, XGBoost, and logistic regression, were used for crash injury severity classification. A resampling technique was used to deal with the problem of bias due to data imbalance issue. SHapley Additive exPlanations (SHAP) analysis interpreted and ranked the factors contributing to crash injury. Two forms of modeling were adopted: multi and binary classification. Among the three models, XGBoost achieved the highest classification accuracy (71%), precision (70%), recall (71%), F1-scores (70%), and area curve (AUC) (0.87) of receiver operating characteristic (ROC) curve when used for multi-category classifications. While adopting the target as a binary classification, XGBoost again outperformed the other classifiers with an accuracy of 94% and an AUC of 0.98. The SHAP results from both global and local interpretations illustrated that the accidents classified under property damage only were primarily categorized by their consequences and the number of vehicles involved. The type of road and lighting conditions were among the other influential factors affecting injury s severity outcome. The death class was classified with respect to temporal parameters, including month and day of the week, as well as road type. Assessing the factors associated with the severe injuries caused by road traffic accidents will assist policymakers in developing safety mitigation strategies in the Qassim Region and other regions of Saudi Arabia.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Emotion Prediction using Machine Learning Techniques
    Shamsi, Areeba
    Nasir, Sabika
    Hajiani, Mishaal Amin
    Ejaz, Afshan
    Ali, Syed Asim
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (06): : 166 - 172
  • [22] Diabetes Prediction using Machine Learning Techniques
    Obulesu, O.
    Suresh, K.
    Ramudu, B. Venkata
    HELIX, 2020, 10 (02): : 136 - 142
  • [23] Prediction of hypercholesterolemia using machine learning techniques
    Moradifar, Pooyan
    Amiri, Mohammad Meskarpour
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2023, 22 (01) : 255 - 265
  • [24] Using Data-Mining Techniques for the Prediction of the Severity of Road Crashes in Cartagena, Colombia
    Ospina-Mateus, Holman
    Quintana Jimenez, Leonardo Augusto
    Jose Lopez-Valdes, Francisco
    Morales-Londono, Natalie
    Salas-Navarro, Katherinne
    APPLIED COMPUTER SCIENCES IN ENGINEERING (WEA 2019), 2019, 1052 : 309 - 320
  • [25] An Approach for Severity Prediction of Autism Using Machine Learning
    Che, Min
    Wang, Liya
    Huang, Lin
    Jiang, Zhibin
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 701 - 705
  • [26] Using machine learning to reduce unnecessary rehospitalization of cardiovascular patients in Saudi Arabia
    Alzeer, Abdullah H.
    Althemery, Abdullah
    Alsaawi, Fahad
    Albalawi, Marwan
    Alharbi, Abdulaziz
    Alzahrani, Somayah
    Alabdulaali, Deema
    Alabdullatif, Raghad
    Tash, Adel
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 154
  • [27] Severity modeling of work zone crashes in New Jersey using machine learning models
    Hasan, Ahmed Sajid
    Kabir, Md Asif Bin
    Jalayer, Mohammad
    Das, Subasish
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (06) : 604 - 635
  • [28] Comparison of three machine learning models for the prediction of hourly PV output power in Saudi Arabia
    Mas'ud, Abdullahi Abubakar
    AIN SHAMS ENGINEERING JOURNAL, 2022, 13 (04)
  • [29] Prediction of the crack condition of highway pavements using machine learning models
    Inkoom, Sylvester
    Sobanjo, John
    Barbu, Adrian
    Niu, Xufeng
    STRUCTURE AND INFRASTRUCTURE ENGINEERING, 2019, 15 (07) : 940 - 953
  • [30] Cervical Cancer Severity Characterization Using Machine Learning Techniques
    Jadhav, Varsha S.
    Yakkundimath, Rajesh
    Konnurmath, Guruprasad
    INDIAN JOURNAL OF GYNECOLOGIC ONCOLOGY, 2024, 22 (04)