Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引:5
|
作者
Hamad, Khaled [1 ,2 ]
Obaid, Lubna [1 ,2 ]
Nassif, Ali Bou [3 ]
Abu Dabous, Saleh [1 ,2 ]
Al-Ruzouq, Rami [1 ,2 ]
Zeiada, Waleed [1 ,2 ]
机构
[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates
[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates
关键词
Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;
D O I
10.1007/s41062-023-01138-1
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers
    Xie, Wenxiu
    Ji, Meng
    Huang, Riliu
    Hao, Tianyong
    Chow, Chi-Yin
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (16)
  • [32] A Novel Machine Learning Model for Predicting Orthodontic Treatment Duration
    Volovic, James
    Badirli, Sarkhan
    Ahmad, Sunna
    Leavitt, Landon
    Mason, Taylor
    Bhamidipalli, Surya Sruthi
    Eckert, George
    Albright, David
    Turkkahraman, Hakan
    DIAGNOSTICS, 2023, 13 (17)
  • [33] Sentiment Analysis Using Machine Learning Classifiers: Evaluation of Performance
    Rai, Shamantha B.
    Shetty, Sweekriti M.
    Rai, Prakhyath
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 21 - 25
  • [34] Analysis and Evaluation of Machine Learning Classifiers for IoT Attack Dataset
    Jagruthi, H.
    Kavitha, C.
    MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 471 - 482
  • [35] Predicting Source Gaze Fixation Duration: A Machine Learning Approach
    Saikh, Tanik
    Bangalore, Srinivas
    Carl, Michael
    Bandyopadhyay, Sivaji
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [36] Predicting the Duration of Forest Fires Using Machine Learning Methods
    Kopitsa, Constantina
    Tsoulos, Ioannis G.
    Charilogis, Vasileios
    Stavrakoudis, Athanassios
    FUTURE INTERNET, 2024, 16 (11)
  • [37] Predicting Freeway Work Zone Delays and Costs with a Hybrid Machine-Learning Model
    Du, Bo
    Chien, Steven
    Lee, Joyoung
    Spasovic, Lazar
    JOURNAL OF ADVANCED TRANSPORTATION, 2017,
  • [38] Predicting Childhood Obesity Based on Single and Multiple Well-Child Visit Data Using Machine Learning Classifiers
    Mondal, Pritom Kumar
    Foysal, Kamrul H. H.
    Norman, Bryan A. A.
    Gittner, Lisaann S. S.
    SENSORS, 2023, 23 (02)
  • [39] Predicting the Appearance of Hypotension during Hemodialysis Sessions Using Machine Learning Classifiers
    Gomez-Pulido, Juan A.
    Gomez-Pulido, Jose M.
    Rodriguez-Puyol, Diego
    Polo-Luque, Maria-Luz
    Vargas-Lombardo, Miguel
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (05) : 1 - 17
  • [40] Effectively Predicting the Presence of Coronary Heart Disease Using Machine Learning Classifiers
    ul Hassan, Ch Anwar
    Iqbal, Jawaid
    Irfan, Rizwana
    Hussain, Saddam
    Algarni, Abeer D.
    Bukhari, Syed Sabir Hussain
    Alturki, Nazik
    Ullah, Syed Sajid
    SENSORS, 2022, 22 (19)