Coverage for Identifying Critical Metadata in Machine Learning Operating Envelopes

被引:0
|
作者
Lanus, Erin [1 ]
Lee, Brian [1 ]
Pol, Luis [1 ]
Sobien, Daniel [1 ]
Kauffman, Justin [1 ]
Freeman, Laura J. [1 ]
机构
[1] Virginia Tech, Virginia Tech Natl Secur Inst, Arlington, VA 22203 USA
关键词
combinatorial testing; design of experiments; machine learning; operating envelopes;
D O I
10.1109/ICSTW60967.2024.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Specifying the conditions under which a machine learning (ML) model was trained is crucial to defining the operating envelope which in turn is important for understanding where the model has known and unknown performance. Metrics such as combinatorial coverage applied over metadata features provide a mechanism for defining the envelope for computer vision algorithms, but not all metadata features impact performance. In this work, we propose Systematic Inclusion & Exclusion, an experimental framework that draws on practices from combinatorial interaction testing and design of experiments to identify the critical metadata features that define the dimensions of the operating envelope. A data splitting algorithm to construct training and test sets for a collection of models is developed to implement the framework. The framework is demonstrated on an open-source dataset and learning algorithm, and future directions and improvements are suggested.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 50 条
  • [21] Machine learning for automation usage prediction: identifying critical factors in driver decision-making
    Orellana, Carlos Bustamante
    Rodriguez, Lucero Rodriguez
    Huang, Lixiao
    Cooke, Nancy
    Kang, Yun
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [22] Coverage Prediction for Target Coverage in WSN Using Machine Learning Approaches
    Chaturvedi, Pooja
    Daniel, A. K.
    Narayan, Vipul
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (02) : 931 - 950
  • [23] Measuring and Visualizing Dataset Coverage for Machine Learning
    Kuhn, D. Richard
    Raunak, M. S.
    Kacker, Raghu N.
    COMPUTER, 2025, 58 (04) : 18 - 26
  • [24] Applying Machine Learning to Design for Reliability Coverage
    Chang, Norman
    Chuang, Wentze
    Tsavatapalli, Ganesh Kumar
    Geada, Joao
    Zhuang, Hao
    Ramachandran, Sankar
    Rajan, Rahul
    Li, Ying-Shiun
    Jia, Yaowei
    Kaipanatu, Mathew
    Mantena, Suresh Kumar
    Shih, Ming-Chih
    Yang, Anita
    Jang, Roger
    2019 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2019,
  • [25] Coverage bias in small molecule machine learning
    Kretschmer, Fleming
    Seipp, Jan
    Ludwig, Marcus
    Klau, Gunnar W.
    Boecker, Sebastian
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [26] Extracting metadata of scientific references in patents based on combination of representation learning and machine learning
    Zhang J.
    Hu Y.
    2018, John Wiley and Sons Inc (55) : 948 - 950
  • [27] Detection of Cognitive Impairment From eSAGE Metadata Using Machine Learning
    Kawakami, Ryoma
    Wright, Kathy D.
    Scharre, Douglas W.
    Ning, Xia
    ALZHEIMER DISEASE & ASSOCIATED DISORDERS, 2024, 38 (01): : 22 - 27
  • [28] Machine Learning in Finance: A Metadata-Based Systematic Review of the Literature
    Warin, Thierry
    Stojkov, Aleksandar
    JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2021, 14 (07)
  • [29] A Machine Learning Method for Identifying Critical Interactions Between Gene Pairs in Alzheimer's Disease Prediction
    Chen, Hao
    He, Yong
    Ji, Jiadong
    Shi, Yufeng
    FRONTIERS IN NEUROLOGY, 2019, 10
  • [30] Identifying the critical features influencing warehouse rental prices and their nonlinear associations: A spatial machine learning approach
    He, Nannan
    Liu, Sijing
    Cao, Jason
    Li, Guoqi
    Jian, Ming
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2025, 197