Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引：5

作者：

Hamad, Khaled ^{[1
,2
]}

Obaid, Lubna ^{[1
,2
]}

Nassif, Ali Bou ^{[3
]}

Abu Dabous, Saleh ^{[1
,2
]}

Al-Ruzouq, Rami ^{[1
,2
]}

Zeiada, Waleed ^{[1
,2
]}

机构：

[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates

[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates

[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates

来源：

INNOVATIVE INFRASTRUCTURE SOLUTIONS | 2023年 / 8卷 / 06期

关键词：

Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;

D O I：

10.1007/s41062-023-01138-1

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.

引用

页数：24

共 50 条

[21] Predicting the duration of motorway incidents using machine learning
Robert Corbally
Linhao Yang
Abdollah Malekjafarian
European Transport Research Review, 16
[22] Predicting the duration of motorway incidents using machine learning
Corbally, Robert
Yang, Linhao
Malekjafarian, Abdollah
EUROPEAN TRANSPORT RESEARCH REVIEW, 2024, 16 (01)
[23] Comparative evaluation of machine learning classifiers with Obesity dataset
Ramya, A.
Rohini, K.
2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 38 - 41
[24] A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers
Chen, Zhenpeng
Zhang, Jie M.
Sarro, Federica
Harman, Mark
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
[25] Predicting Traffic Incident Severity Level Using Machine Learning
Elawady, Ahmed
Khetrish, Abdulrauf
Hamad, Khaled
2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 432 - 437
[26] Predicting Radiological Panel Opinions Using a Panel of Machine Learning Classifiers
Zinovev, Dmitriy
Raicu, Daniela
Furst, Jacob
Armato, Samuel G., III
ALGORITHMS, 2009, 2 (04) : 1473 - 1502
[27] EnsembleMatrix: Interactive Visualization to Support Machine Learning with Multiple Classifiers
Talbot, Justin
Lee, Bongshin
Kapoor, Ashish
Tan, Desney S.
CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2009, : 1283 - 1292
[28] Detection of Melanoma with Multiple Machine Learning Classifiers in Dermoscopy Images
Yildiz, Ugur Emre
Kilic, Volkan
2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 145 - 148
[29] Machine learning for predicting protein properties: A comprehensive review
Wang, Yizhen
Zhang, Yanyun
Zhan, Xuhui
He, Yuhao
Yang, Yongfu
Cheng, Li
Alghazzawi, Daniyal
NEUROCOMPUTING, 2024, 597
[30] Performance Evaluation Indicator (PEI): A new paradigm to evaluate the competence of machine learning classifiers in predicting rockmass conditions
Zhu, Mengqi
Gutierrez, Marte
Zhu, Hehua
Ju, J. Woody
Sarna, Sharmin
ADVANCED ENGINEERING INFORMATICS, 2021, 47

← 1 2 3 4 5 →