Coverage for Identifying Critical Metadata in Machine Learning Operating Envelopes

被引:0
|
作者
Lanus, Erin [1 ]
Lee, Brian [1 ]
Pol, Luis [1 ]
Sobien, Daniel [1 ]
Kauffman, Justin [1 ]
Freeman, Laura J. [1 ]
机构
[1] Virginia Tech, Virginia Tech Natl Secur Inst, Arlington, VA 22203 USA
关键词
combinatorial testing; design of experiments; machine learning; operating envelopes;
D O I
10.1109/ICSTW60967.2024.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Specifying the conditions under which a machine learning (ML) model was trained is crucial to defining the operating envelope which in turn is important for understanding where the model has known and unknown performance. Metrics such as combinatorial coverage applied over metadata features provide a mechanism for defining the envelope for computer vision algorithms, but not all metadata features impact performance. In this work, we propose Systematic Inclusion & Exclusion, an experimental framework that draws on practices from combinatorial interaction testing and design of experiments to identify the critical metadata features that define the dimensions of the operating envelope. A data splitting algorithm to construct training and test sets for a collection of models is developed to implement the framework. The framework is demonstrated on an open-source dataset and learning algorithm, and future directions and improvements are suggested.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 50 条
  • [41] Identifying drug interactions using machine learning
    Demirsoy, Idris
    Karaibrahimoglu, Adnan
    ADVANCES IN CLINICAL AND EXPERIMENTAL MEDICINE, 2023, 32 (08): : 829 - 838
  • [42] Identifying Pathogens of Foodborne Diseases with Machine Learning
    Wang H.
    Cui W.
    Zhou Y.
    Du Y.
    Data Analysis and Knowledge Discovery, 2021, 5 (09) : 54 - 62
  • [43] Identifying epilepsy psychiatric comorbidities with machine learning
    Glauser, Tracy
    Santel, Daniel
    DelBello, Melissa
    Faist, Robert
    Toon, Tonia
    Clark, Peggy
    McCourt, Rachel
    Wissel, Benjamin
    Pestian, John
    ACTA NEUROLOGICA SCANDINAVICA, 2020, 141 (05): : 388 - 396
  • [45] MACHINE LEARNING (AI) FOR IDENTIFYING SMART CITIES
    Hammoumi, L.
    Rhinane, H.
    8TH INTERNATIONAL CONFERENCE ON GEOINFORMATION ADVANCES, GEOADVANCES 2024, VOL. 48-4, 2024, : 221 - 228
  • [46] Identifying Novel Oncogenes: A Machine Learning Approach
    Kumar, Ambuj
    Rajendran, Vidya
    Sethumadhavan, Rao
    Purohit, Rituraj
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2013, 5 (04) : 241 - 246
  • [47] Identifying Vulnerable Households Using Machine Learning
    Gao, Chen
    Fei, Chengcheng J.
    McCarl, Bruce A.
    Leatham, David J.
    SUSTAINABILITY, 2020, 12 (15)
  • [48] Machine Learning for Identifying Group Trajectory Outliers
    Belhadi, Asma
    Djenouri, Youcef
    Djenouri, Djamel
    Michalak, Tomasz
    Lin, Jerry Chun-Wei
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2021, 12 (02)
  • [49] Identifying plastics with photoluminescence spectroscopy and machine learning
    Benjamin Lotter
    Srumika Konde
    Johnny Nguyen
    Michael Grau
    Martin Koch
    Peter Lenz
    Scientific Reports, 12
  • [50] Identifying critical operating parameters and mechanism for a manganese sulphide precipitation process
    Lewis, Alison
    Nathoo, Jeeten
    Glueck, Thomas
    BIWIC 2006, 2006, : 1 - +