Investigating the Role of Dimension Reduction in Counteracting Machine Learning Performance Bias in TSMO Strategies

被引:0
|
作者
Attallah, Mustafa [1 ]
Kianfar, Jalil [1 ]
机构
[1] St Louis Univ, Dept Civil Comp & Elect Engn, St Louis, MO 63103 USA
关键词
TEXT ANALYSIS; INCIDENT; ERROR;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning models are increasingly being utilized to develop predictive models for Transportation Systems Management and Operations (TSMO) applications. These models are often assessed based on a global performance metric that evaluates the model's performance when the entire testing dataset is presented to the model. A TSMO application is expected to perform reliably and consistently in various situations and roadway conditions. The reliability and consistency of the model predictions for various scenarios are critical to the success of transportation agencies' efforts to address mobility and safety issues. Performance bias might be influencing the model when a model's performance is inconsistent for different scenarios. This paper investigates the performance bias that the traffic management center may face when applying machine learning methods to predict incident clearance time. Additionally, dimension-reduction techniques are employed as mitigation techniques in the model development process. This paper investigates the impact of two common dimension-reduction methods, important feature selection and principal component analysis, on performance bias. In a case study, this paper investigates the performance bias of RF, BRNN, SVR, KNN, XGB, GP, and NNET in incident clearance time prediction. Incident data from three interstate corridors in Missouri, USA, were utilized to develop and evaluate the models. Repeated k-fold cross-validation was used to prepare 20 training and testing sets to demonstrate and assess the learners' performance variations due to data splits. The results indicated that the seven learners suffered from performance bias. The analysis of the impact of dimension-reduction models revealed that the important feature selection method did not significantly mitigate the performance bias. On the other hand, the principal component analysis method significantly mitigated this bias for all learners, with poor-performing learners gaining the most improvements. In addition to contributing to reducing the performance bias, the principal component analysis significantly reduced the learners' global (i.e., overall) error metrics.
引用
收藏
页码:821 / 832
页数:12
相关论文
共 50 条
  • [1] Dimension Reduction With Extreme Learning Machine
    Kasun, Liyanaarachchi Lekamalage Chamara
    Yang, Yan
    Huang, Guang-Bin
    Zhang, Zhengyou
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (08) : 3906 - 3918
  • [2] Adversarial learning with optimism for bias reduction in machine learning
    Yu-Chen Cheng
    Po-An Chen
    Feng-Chi Chen
    Ya-Wen Cheng
    AI and Ethics, 2024, 4 (4): : 1389 - 1402
  • [3] Investigating anatomical bias in clinical machine learning algorithms
    Pedersen, Jannik Skyttegaard
    Laursen, Martin Sundahl
    Vinholt, Pernille Just
    Alnor, Anne Bryde
    Savarimuthu, Thiusius Rajeeth
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1398 - 1410
  • [4] NORMALIZATION AND DIMENSION REDUCTION FOR MACHINE LEARNING IN ADVANCED MANUFACTURING
    Huang, Jida
    Kwok, Tsz-Ho
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
  • [5] Comparing Swarm Intelligence Algorithms for Dimension Reduction in Machine Learning
    Kicska, Gabriella
    Kiss, Attila
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (03)
  • [6] Entropy Dimension Reduction Method for Randomized Machine Learning Problems
    Popkov, Yu. S.
    Dubnov, Yu. A.
    Popkov, A. Yu.
    AUTOMATION AND REMOTE CONTROL, 2018, 79 (11) : 2038 - 2051
  • [7] Entropy Dimension Reduction Method for Randomized Machine Learning Problems
    Yu. S. Popkov
    Yu. A. Dubnov
    A. Yu. Popkov
    Automation and Remote Control, 2018, 79 : 2038 - 2051
  • [8] Investigating Intrinsic Bias in Publicly Available Critical Care Datasets for Machine Learning
    Langnas, Erica
    Fong, Nicholas
    Law, Tyler
    Chyan, Arthur
    Lipnick, Michael
    Pirracchio, Romain
    ANESTHESIA AND ANALGESIA, 2023, 136 : 266 - 267
  • [9] Graph Embedding-Based Dimension Reduction With Extreme Learning Machine
    Yang, Le
    Song, Shiji
    Li, Shuang
    Chen, Yiming
    Huang, Gao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (07): : 4262 - 4273
  • [10] Investigating the performance of Hadoop and Spark platforms on machine learning algorithms
    Ali Mostafaeipour
    Amir Jahangard Rafsanjani
    Mohammad Ahmadi
    Joshuva Arockia Dhanraj
    The Journal of Supercomputing, 2021, 77 : 1273 - 1300