Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引:5
|
作者
Hamad, Khaled [1 ,2 ]
Obaid, Lubna [1 ,2 ]
Nassif, Ali Bou [3 ]
Abu Dabous, Saleh [1 ,2 ]
Al-Ruzouq, Rami [1 ,2 ]
Zeiada, Waleed [1 ,2 ]
机构
[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates
[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates
关键词
Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;
D O I
10.1007/s41062-023-01138-1
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Predicting the duration of motorway incidents using machine learning
    Robert Corbally
    Linhao Yang
    Abdollah Malekjafarian
    European Transport Research Review, 16
  • [22] Predicting the duration of motorway incidents using machine learning
    Corbally, Robert
    Yang, Linhao
    Malekjafarian, Abdollah
    EUROPEAN TRANSPORT RESEARCH REVIEW, 2024, 16 (01)
  • [23] Comparative evaluation of machine learning classifiers with Obesity dataset
    Ramya, A.
    Rohini, K.
    2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 38 - 41
  • [24] A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers
    Chen, Zhenpeng
    Zhang, Jie M.
    Sarro, Federica
    Harman, Mark
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
  • [25] Predicting Traffic Incident Severity Level Using Machine Learning
    Elawady, Ahmed
    Khetrish, Abdulrauf
    Hamad, Khaled
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 432 - 437
  • [26] Predicting Radiological Panel Opinions Using a Panel of Machine Learning Classifiers
    Zinovev, Dmitriy
    Raicu, Daniela
    Furst, Jacob
    Armato, Samuel G., III
    ALGORITHMS, 2009, 2 (04) : 1473 - 1502
  • [27] EnsembleMatrix: Interactive Visualization to Support Machine Learning with Multiple Classifiers
    Talbot, Justin
    Lee, Bongshin
    Kapoor, Ashish
    Tan, Desney S.
    CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2009, : 1283 - 1292
  • [28] Detection of Melanoma with Multiple Machine Learning Classifiers in Dermoscopy Images
    Yildiz, Ugur Emre
    Kilic, Volkan
    2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 145 - 148
  • [29] Machine learning for predicting protein properties: A comprehensive review
    Wang, Yizhen
    Zhang, Yanyun
    Zhan, Xuhui
    He, Yuhao
    Yang, Yongfu
    Cheng, Li
    Alghazzawi, Daniyal
    NEUROCOMPUTING, 2024, 597
  • [30] Performance Evaluation Indicator (PEI): A new paradigm to evaluate the competence of machine learning classifiers in predicting rockmass conditions
    Zhu, Mengqi
    Gutierrez, Marte
    Zhu, Hehua
    Ju, J. Woody
    Sarna, Sharmin
    ADVANCED ENGINEERING INFORMATICS, 2021, 47