Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引:5
|
作者
Hamad, Khaled [1 ,2 ]
Obaid, Lubna [1 ,2 ]
Nassif, Ali Bou [3 ]
Abu Dabous, Saleh [1 ,2 ]
Al-Ruzouq, Rami [1 ,2 ]
Zeiada, Waleed [1 ,2 ]
机构
[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates
[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates
关键词
Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;
D O I
10.1007/s41062-023-01138-1
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Machine Learning Classifiers for Predicting Transit Fraud Emergent Research Forum (ERF)
    Claiborne, Jay
    Gupta, Ashish
    AMCIS 2018 PROCEEDINGS, 2018,
  • [42] Predicting the Risk of Diabetes and Heart Disease with Machine Learning Classifiers: The Mediation Analysis
    Verma, Ajay
    Jain, Manisha
    MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
  • [43] Factorial design-machine learning approach for predicting incident durations
    Hamad, Khaled
    Obaid, Lubna
    Haridy, Salah
    Zeiada, Waleed
    Al-Khateeb, Ghazi
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (05) : 660 - 680
  • [44] Comprehensive Evaluation of Bankruptcy Prediction in Taiwanese Firms Using Multiple Machine Learning Models
    Pham, Hung, V
    Chu, Tuan
    Le, Tuan M.
    Tran, Hieu M.
    Tran, Huong T. K.
    Yen, Khanh N.
    Dao, Son V. T.
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2025, 16 (01) : 289 - 309
  • [45] EVALUATION OF MACHINE LEARNING APPROACHES IN PREDICTING THE INITIAL TREATMENT STRATEGY IN PATIENTS WITH MULTIPLE SCLEROSIS
    Li, J.
    Lin, Y.
    Huang, Y.
    Aparasu, R. R.
    VALUE IN HEALTH, 2024, 27 (06) : S268 - S269
  • [46] Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs
    Rahman, Sheikh Shah Mohammad Motiur
    Rafiq, Fatama Binta
    Toma, Tapushe Rabaya
    Hossain, Syeda Sumbul
    Biplob, Khalid Been Badruzzaman
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 285 - 296
  • [47] Improved Arabic Characters Recognition by Combining Multiple Machine Learning Classifiers
    Alabbas, Maytham
    Khudeyer, Raidah S.
    Jaf, Sardar
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 262 - 265
  • [48] Classifying and Forecasting Traffic Incident Duration Using Various Machine Learning Techniques
    Rahmat-Ullah, Zakiya
    Alsmadi, Sara
    Hamad, Khaled
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 388 - 393
  • [49] Comparative study of statistical and machine learning methods for streetcar incident duration analysis
    Zhu, Siying
    INTERNATIONAL JOURNAL OF CRASHWORTHINESS, 2024, 29 (01) : 16 - 21
  • [50] Predicting hotel booking cancellations: a comprehensive machine learning approach
    Ampountolas, Apostolos
    JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2025,