Evaluating expressway traffic crash severity by using logistic regression and explainable & supervised machine learning classifiers

被引:5
|
作者
Madushani J.P.S.S. [1 ]
Sandamal R.M.K. [1 ]
Meddage D.P.P. [2 ]
Pasindu H.R. [3 ]
Gomes P.I.A. [1 ]
机构
[1] Department of Civil Engineering, Faculty of Engineering, Sri Lanka Institute of Information Technology
[2] School of Engineering and Information Technology, University of New South Wales
[3] Department of Civil Engineering, Faculty of Engineering, University of Moratuwa
来源
关键词
Explainable machine learning; Expressways; Logistic regression; Machine learning; Traffic crash severity;
D O I
10.1016/j.treng.2023.100190
中图分类号
学科分类号
摘要
The number of expressway road accidents in Sri Lanka has significantly increased (by 20%) due to the expansion of the transport network and high traffic volume. It is crucial to identify the causes of these crashes for effective road safety management. However, traditional statistical methods may be insufficient due to their inherent assumptions. This study utilized explainable machine learning to investigate the factors that affect the severity of traffic crashes on expressways. The study evaluated two groups of traffic crashes: fatal or severe crashes, and other crashes that included non-severe injuries or only property damage. Five factors that contribute to crashes were analyzed: road surface condition, road alignment, location, weather condition, and lighting effect. Four machine learning models (Random Forest (RF), Decision Tree (DT), extreme gradient boosting (XGB), K-Nearest Neighbor (KNN)) were developed and compared with Logistic Regression (LR) using 223 training and 56 testing data instances. The study revealed that the machine learning algorithms provided more accurate predictions than the LR model. To explain the machine learning models, Shapley Additive Explanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) were used. These methods revealed that all five features decreased the possibility of occurrence of fatal accidents. SHAP and LIME explanations confirmed the known interactions between factors influencing crash severity in expressway operational conditions. These explanations increase the trust of end-users and domain experts on machine learning models. Furthermore, the study concluded that using explainable machine learning methods is more effective than traditional regression analysis in evaluating safety performance. Additionally, the results of the study can be utilized to improve road safety by providing accurate explanations for decision-making processes for black-box models. © 2023
引用
收藏
相关论文
共 50 条
  • [41] Loan Repayment Prediction Using Logistic Regression Ensemble Learning With Machine Learning Algorithms
    Dinh, Thuan Nguyen
    Thanh, Binh Pham
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 79 - 85
  • [42] Traffic Collision Severity Modeling Using Multi-Level Multinomial Logistic Regression Model
    Alsaleh, Rushdi
    Walia, Kawal
    Moshiri, Ghoncheh
    Alsaleh, Yasmeen T.
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [43] Comparing Machine Learning Classifiers and Linear/Logistic Regression to Explore the Relationship between Hand Dimensions and Demographic Characteristics
    Miguel-Hurtado, Oscar
    Guest, Richard
    Stevenage, Sarah V.
    Neil, Greg J.
    Black, Sue
    PLOS ONE, 2016, 11 (11):
  • [44] Predicting Hydrological Drought Alert Levels Using Supervised Machine-Learning Classifiers
    Jehanzaib, Muhammad
    Shah, Sabab Ali
    Son, Ho Jun
    Jang, Sung-Hwan
    Kim, Tae-Woong
    KSCE JOURNAL OF CIVIL ENGINEERING, 2022, 26 (06) : 3019 - 3030
  • [45] Prediction of Orthosteric and Allosteric Regulations on Cannabinoid Receptors Using Supervised Machine Learning Classifiers
    Bian, Yuemin
    Jing, Yankang
    Wang, Lirong
    Ma, Shifan
    Jun, Jaden Jungho
    Xie, Xiang-Qun
    MOLECULAR PHARMACEUTICS, 2019, 16 (06) : 2605 - 2615
  • [46] Traffic Crash Severity Prediction-A Synergy by Hybrid Principal Component Analysis and Machine Learning Models
    Assi, Khaled
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (20) : 1 - 16
  • [47] Predicting Hydrological Drought Alert Levels Using Supervised Machine-Learning Classifiers
    Muhammad Jehanzaib
    Sabab Ali Shah
    Ho Jun Son
    Sung-Hwan Jang
    Tae-Woong Kim
    KSCE Journal of Civil Engineering, 2022, 26 : 3019 - 3030
  • [48] Application of multinomial and ordinal logistic regression to model injury severity of truck crashes, using violation and crash data
    Rezapour M.
    Ksaibati K.
    Journal of Modern Transportation, 2018, 26 (4): : 268 - 277
  • [49] Improved Accuracy of Calculation of Vehicle Crash Severity in Highways using Random Forest over Logistic Regression Algorithm
    Vignesh, S.
    Sashirekha, K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1520 - 1526
  • [50] Explainable Machine Learning Approach to Prediction of Prolonged Intesive Care Unit Stay in Adult Spinal Deformity Patients: Machine Learning Outperforms Logistic Regression
    Zaidat, Bashar
    Kurapatti, Mark
    Gal, Jonathan S.
    Cho, Samuel K.
    Kim, Jun S.
    GLOBAL SPINE JOURNAL, 2024,