Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引：5

作者：

Hamad, Khaled ^{[1
,2
]}

Obaid, Lubna ^{[1
,2
]}

Nassif, Ali Bou ^{[3
]}

Abu Dabous, Saleh ^{[1
,2
]}

Al-Ruzouq, Rami ^{[1
,2
]}

Zeiada, Waleed ^{[1
,2
]}

机构：

[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates

[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates

[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates

来源：

INNOVATIVE INFRASTRUCTURE SOLUTIONS | 2023年 / 8卷 / 06期

关键词：

Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;

D O I：

10.1007/s41062-023-01138-1

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.

引用

页数：24

共 50 条

[41] Machine Learning Classifiers for Predicting Transit Fraud Emergent Research Forum (ERF)
Claiborne, Jay
Gupta, Ashish
AMCIS 2018 PROCEEDINGS, 2018,
[42] Predicting the Risk of Diabetes and Heart Disease with Machine Learning Classifiers: The Mediation Analysis
Verma, Ajay
Jain, Manisha
MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
[43] Factorial design-machine learning approach for predicting incident durations
Hamad, Khaled
Obaid, Lubna
Haridy, Salah
Zeiada, Waleed
Al-Khateeb, Ghazi
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (05) : 660 - 680
[44] Comprehensive Evaluation of Bankruptcy Prediction in Taiwanese Firms Using Multiple Machine Learning Models
Pham, Hung, V
Chu, Tuan
Le, Tuan M.
Tran, Hieu M.
Tran, Huong T. K.
Yen, Khanh N.
Dao, Son V. T.
INTERNATIONAL JOURNAL OF TECHNOLOGY, 2025, 16 (01) : 289 - 309
[45] EVALUATION OF MACHINE LEARNING APPROACHES IN PREDICTING THE INITIAL TREATMENT STRATEGY IN PATIENTS WITH MULTIPLE SCLEROSIS
Li, J.
Lin, Y.
Huang, Y.
Aparasu, R. R.
VALUE IN HEALTH, 2024, 27 (06) : S268 - S269
[46] Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs
Rahman, Sheikh Shah Mohammad Motiur
Rafiq, Fatama Binta
Toma, Tapushe Rabaya
Hossain, Syeda Sumbul
Biplob, Khalid Been Badruzzaman
DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 285 - 296
[47] Improved Arabic Characters Recognition by Combining Multiple Machine Learning Classifiers
Alabbas, Maytham
Khudeyer, Raidah S.
Jaf, Sardar
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 262 - 265
[48] Classifying and Forecasting Traffic Incident Duration Using Various Machine Learning Techniques
Rahmat-Ullah, Zakiya
Alsmadi, Sara
Hamad, Khaled
2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 388 - 393
[49] Comparative study of statistical and machine learning methods for streetcar incident duration analysis
Zhu, Siying
INTERNATIONAL JOURNAL OF CRASHWORTHINESS, 2024, 29 (01) : 16 - 21
[50] Predicting hotel booking cancellations: a comprehensive machine learning approach
Ampountolas, Apostolos
JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2025,

← 1 2 3 4 5 →