Investigating the Role of Dimension Reduction in Counteracting Machine Learning Performance Bias in TSMO Strategies

被引：0

作者：

Attallah, Mustafa ^{[1
]}

Kianfar, Jalil ^{[1
]}

机构：

[1] St Louis Univ, Dept Civil Comp & Elect Engn, St Louis, MO 63103 USA

来源：

INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2024: TRANSPORTATION SAFETY AND EMERGING TECHNOLOGIES, ICTD 2024 | 2024年

关键词：

TEXT ANALYSIS; INCIDENT; ERROR;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Machine learning models are increasingly being utilized to develop predictive models for Transportation Systems Management and Operations (TSMO) applications. These models are often assessed based on a global performance metric that evaluates the model's performance when the entire testing dataset is presented to the model. A TSMO application is expected to perform reliably and consistently in various situations and roadway conditions. The reliability and consistency of the model predictions for various scenarios are critical to the success of transportation agencies' efforts to address mobility and safety issues. Performance bias might be influencing the model when a model's performance is inconsistent for different scenarios. This paper investigates the performance bias that the traffic management center may face when applying machine learning methods to predict incident clearance time. Additionally, dimension-reduction techniques are employed as mitigation techniques in the model development process. This paper investigates the impact of two common dimension-reduction methods, important feature selection and principal component analysis, on performance bias. In a case study, this paper investigates the performance bias of RF, BRNN, SVR, KNN, XGB, GP, and NNET in incident clearance time prediction. Incident data from three interstate corridors in Missouri, USA, were utilized to develop and evaluate the models. Repeated k-fold cross-validation was used to prepare 20 training and testing sets to demonstrate and assess the learners' performance variations due to data splits. The results indicated that the seven learners suffered from performance bias. The analysis of the impact of dimension-reduction models revealed that the important feature selection method did not significantly mitigate the performance bias. On the other hand, the principal component analysis method significantly mitigated this bias for all learners, with poor-performing learners gaining the most improvements. In addition to contributing to reducing the performance bias, the principal component analysis significantly reduced the learners' global (i.e., overall) error metrics.

引用

页码：821 / 832

页数：12

共 50 条

[1] Dimension Reduction With Extreme Learning Machine
Kasun, Liyanaarachchi Lekamalage Chamara
Yang, Yan
Huang, Guang-Bin
Zhang, Zhengyou
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (08) : 3906 - 3918
[2] Adversarial learning with optimism for bias reduction in machine learning
Yu-Chen Cheng
Po-An Chen
Feng-Chi Chen
Ya-Wen Cheng
AI and Ethics, 2024, 4 (4): : 1389 - 1402
[3] Investigating anatomical bias in clinical machine learning algorithms
Pedersen, Jannik Skyttegaard
Laursen, Martin Sundahl
Vinholt, Pernille Just
Alnor, Anne Bryde
Savarimuthu, Thiusius Rajeeth
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1398 - 1410
[4] NORMALIZATION AND DIMENSION REDUCTION FOR MACHINE LEARNING IN ADVANCED MANUFACTURING
Huang, Jida
Kwok, Tsz-Ho
PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
[5] Comparing Swarm Intelligence Algorithms for Dimension Reduction in Machine Learning
Kicska, Gabriella
Kiss, Attila
BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (03)
[6] Entropy Dimension Reduction Method for Randomized Machine Learning Problems
Popkov, Yu. S.
Dubnov, Yu. A.
Popkov, A. Yu.
AUTOMATION AND REMOTE CONTROL, 2018, 79 (11) : 2038 - 2051
[7] Entropy Dimension Reduction Method for Randomized Machine Learning Problems
Yu. S. Popkov
Yu. A. Dubnov
A. Yu. Popkov
Automation and Remote Control, 2018, 79 : 2038 - 2051
[8] Investigating Intrinsic Bias in Publicly Available Critical Care Datasets for Machine Learning
Langnas, Erica
Fong, Nicholas
Law, Tyler
Chyan, Arthur
Lipnick, Michael
Pirracchio, Romain
ANESTHESIA AND ANALGESIA, 2023, 136 : 266 - 267
[9] Graph Embedding-Based Dimension Reduction With Extreme Learning Machine
Yang, Le
Song, Shiji
Li, Shuang
Chen, Yiming
Huang, Gao
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (07): : 4262 - 4273
[10] Investigating the performance of Hadoop and Spark platforms on machine learning algorithms
Ali Mostafaeipour
Amir Jahangard Rafsanjani
Mohammad Ahmadi
Joshuva Arockia Dhanraj
The Journal of Supercomputing, 2021, 77 : 1273 - 1300

← 1 2 3 4 5 →