Investigating the Role of Dimension Reduction in Counteracting Machine Learning Performance Bias in TSMO Strategies

被引:0
|
作者
Attallah, Mustafa [1 ]
Kianfar, Jalil [1 ]
机构
[1] St Louis Univ, Dept Civil Comp & Elect Engn, St Louis, MO 63103 USA
关键词
TEXT ANALYSIS; INCIDENT; ERROR;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning models are increasingly being utilized to develop predictive models for Transportation Systems Management and Operations (TSMO) applications. These models are often assessed based on a global performance metric that evaluates the model's performance when the entire testing dataset is presented to the model. A TSMO application is expected to perform reliably and consistently in various situations and roadway conditions. The reliability and consistency of the model predictions for various scenarios are critical to the success of transportation agencies' efforts to address mobility and safety issues. Performance bias might be influencing the model when a model's performance is inconsistent for different scenarios. This paper investigates the performance bias that the traffic management center may face when applying machine learning methods to predict incident clearance time. Additionally, dimension-reduction techniques are employed as mitigation techniques in the model development process. This paper investigates the impact of two common dimension-reduction methods, important feature selection and principal component analysis, on performance bias. In a case study, this paper investigates the performance bias of RF, BRNN, SVR, KNN, XGB, GP, and NNET in incident clearance time prediction. Incident data from three interstate corridors in Missouri, USA, were utilized to develop and evaluate the models. Repeated k-fold cross-validation was used to prepare 20 training and testing sets to demonstrate and assess the learners' performance variations due to data splits. The results indicated that the seven learners suffered from performance bias. The analysis of the impact of dimension-reduction models revealed that the important feature selection method did not significantly mitigate the performance bias. On the other hand, the principal component analysis method significantly mitigated this bias for all learners, with poor-performing learners gaining the most improvements. In addition to contributing to reducing the performance bias, the principal component analysis significantly reduced the learners' global (i.e., overall) error metrics.
引用
收藏
页码:821 / 832
页数:12
相关论文
共 50 条
  • [41] Investigating on Combining System Dynamics and Machine Learning for Predicting Safety Performance in Construction Projects
    Nishat, Mirza Muntasir
    Borkenhagen, Ingrid Renolen
    Olsen, Jenni Sveen
    Rauzy, Antoine
    12TH NORDIC CONFERENCE ON CONSTRUCTION ECONOMICS AND ORGANISATION, 2024, 2024, 1389
  • [42] Machine Learning for Advanced Emission Monitoring and Reduction Strategies in Fossil Fuel Power Plants
    Zuo, Zitu
    Niu, Yongjie
    Li, Jiale
    Fu, Hongpeng
    Zhou, Mengjie
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [43] Dimension Reduction Techniques for Machine Learning-Based AC Microgrid Fault Diagnosis: A Systematic Review
    Zaben, Muiz M.
    Abido, Mohammad A.
    Worku, Muhammed Y.
    Hassan, Mohamed A.
    IEEE ACCESS, 2024, 12 : 160586 - 160612
  • [44] Catenary components state detection method based on the dimension reduction-kernel extreme learning machine
    Wu, Changdong
    INFRARED PHYSICS & TECHNOLOGY, 2024, 136
  • [45] The roles of differencing and dimension reduction in machine learning forecasting of employment level using the FRED big data
    Choi, Ji-Eun
    Shin, Dong Wan
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2019, 26 (05) : 497 - 506
  • [46] Strategies to optimise machine learning classification performance when using biomechanical features
    Liew, Bernard X.W.
    Pfisterer, Florian
    Rügamer, David
    Zhai, Xiaojun
    Journal of Biomechanics, 2024, 165
  • [47] Strategies to optimise machine learning classification performance when using biomechanical features
    Liew, Bernard X. W.
    Pfisterer, Florian
    Ruegamer, David
    Zhai, Xiaojun
    JOURNAL OF BIOMECHANICS, 2024, 165
  • [48] Performance Guarantees on Machine-Learning-based Overtaking Strategies for Autonomous Vehicles
    Nemeth, Balazs
    Hegedus, Tamas
    Gaspar, Peter
    2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 136 - 141
  • [49] Representations and strategies for transferable machine learning improve model performance in chemical discovery
    Harper, Daniel R.
    Nandy, Aditya
    Arunachalam, Naveen
    Duan, Chenru
    Janet, Jon Paul
    Kulik, Heather J.
    JOURNAL OF CHEMICAL PHYSICS, 2022, 156 (07):
  • [50] Impact of Bit Allocation Strategies on Machine Learning Performance in Rate Limited Systems
    Gharouni, Afsaneh
    Rost, Peter
    Maeder, Andreas
    Schotten, Hans D.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (06) : 1168 - 1172