Coverage for Identifying Critical Metadata in Machine Learning Operating Envelopes

被引:0
|
作者
Lanus, Erin [1 ]
Lee, Brian [1 ]
Pol, Luis [1 ]
Sobien, Daniel [1 ]
Kauffman, Justin [1 ]
Freeman, Laura J. [1 ]
机构
[1] Virginia Tech, Virginia Tech Natl Secur Inst, Arlington, VA 22203 USA
关键词
combinatorial testing; design of experiments; machine learning; operating envelopes;
D O I
10.1109/ICSTW60967.2024.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Specifying the conditions under which a machine learning (ML) model was trained is crucial to defining the operating envelope which in turn is important for understanding where the model has known and unknown performance. Metrics such as combinatorial coverage applied over metadata features provide a mechanism for defining the envelope for computer vision algorithms, but not all metadata features impact performance. In this work, we propose Systematic Inclusion & Exclusion, an experimental framework that draws on practices from combinatorial interaction testing and design of experiments to identify the critical metadata features that define the dimensions of the operating envelope. A data splitting algorithm to construct training and test sets for a collection of models is developed to implement the framework. The framework is demonstrated on an open-source dataset and learning algorithm, and future directions and improvements are suggested.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 50 条
  • [1] Identifying IoT Devices: A Machine Learning Analysis Using Traffic Flow Metadata
    Adjei, Jeffrey
    Heywood, Nur Zincir
    Nandy, Biswajit
    Seddigh, Nabil
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [2] Identifying Clinical Study Types from PubMed Metadata: The Active (Machine) Learning Approach
    Dunn, Adam G.
    Arachi, Diana
    Bourgeois, Florence T.
    MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 867 - 871
  • [3] Operating Critical Machine Learning Models in Resource Constrained Regimes
    Selvan, Raghavendra
    Schon, Julian
    Dam, Erik B.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2023 WORKSHOPS, 2023, 14394 : 325 - 335
  • [4] Machine Learning Methods for Identifying Critical Data Elements in Nursing Documentation
    Bose, Eliezer
    Maganti, Sasank
    Bowles, Kathryn H.
    Brueshoff, Bonnie L.
    Monsen, Karen A.
    NURSING RESEARCH, 2019, 68 (01) : 65 - 72
  • [5] Identifying Critical Decision Points in Musical Compositions using Machine Learning
    Banar, Berker
    Colton, Simon
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [6] Identifying Critical Contextual Design Cues Through a Machine Learning Approach
    Cummings, Mary L. ''Missy''
    Stimpson, Alexander
    AI MAGAZINE, 2019, 40 (04) : 28 - 39
  • [7] Operating Envelopes of the Variable-Flux Machine With Positive Reluctance Torque
    Aljehaimi, Akrem Mohamed
    Pillay, Pragasen
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2018, 4 (03): : 707 - 719
  • [8] Metadata Representations for Queryable Repositories of Machine Learning Models
    Li, Ziyu
    Kant, Henk
    Hai, Rihan
    Katsifodimos, Asterios
    Brambilla, Marco
    Bozzon, Alessandro
    IEEE ACCESS, 2023, 11 : 125616 - 125630
  • [9] Automated metadata annotation: What is and is not possible with machine learning
    Wu, Mingfang
    Brandhorst, Hans
    Marinescu, Maria-Cristina
    Lopez, Joaquim More
    Hlava, Margorie
    Busch, Joseph
    DATA INTELLIGENCE, 2023, 5 (01) : 122 - 138
  • [10] Automated Metadata Annotation:What Is and Is Not Possible with Machine Learning
    Mingfang Wu
    Hans Brandhorst
    MariaCristina Marinescu
    Joaquim Mor Lpez
    Margorie Hlava
    Joseph Busch
    Data Intelligence, 2023, 5 (01) : 122 - 138