Coverage for Identifying Critical Metadata in Machine Learning Operating Envelopes

被引:0
|
作者
Lanus, Erin [1 ]
Lee, Brian [1 ]
Pol, Luis [1 ]
Sobien, Daniel [1 ]
Kauffman, Justin [1 ]
Freeman, Laura J. [1 ]
机构
[1] Virginia Tech, Virginia Tech Natl Secur Inst, Arlington, VA 22203 USA
关键词
combinatorial testing; design of experiments; machine learning; operating envelopes;
D O I
10.1109/ICSTW60967.2024.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Specifying the conditions under which a machine learning (ML) model was trained is crucial to defining the operating envelope which in turn is important for understanding where the model has known and unknown performance. Metrics such as combinatorial coverage applied over metadata features provide a mechanism for defining the envelope for computer vision algorithms, but not all metadata features impact performance. In this work, we propose Systematic Inclusion & Exclusion, an experimental framework that draws on practices from combinatorial interaction testing and design of experiments to identify the critical metadata features that define the dimensions of the operating envelope. A data splitting algorithm to construct training and test sets for a collection of models is developed to implement the framework. The framework is demonstrated on an open-source dataset and learning algorithm, and future directions and improvements are suggested.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 50 条
  • [31] Machine Learning Security Defense Algorithms Based on Metadata Correlation Features
    Jia, Ruchun
    Zhang, Jianwei
    Lin, Yi
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 2391 - 2418
  • [32] Identifying plastics with photoluminescence spectroscopy and machine learning
    Lotter, Benjamin
    Konde, Srumika
    Nguyen, Johnny
    Grau, Michael
    Koch, Martin
    Lenz, Peter
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [33] Identifying novel oncogenes: A machine learning approach
    Ambuj Kumar
    Vidya Rajendran
    Rao Sethumadhavan
    Rituraj Purohit
    Interdisciplinary Sciences: Computational Life Sciences, 2013, 5 : 241 - 246
  • [34] Identifying and Correcting Label Bias in Machine Learning
    Jiang, Heinrich
    Nachum, Ofir
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 702 - 711
  • [35] Identifying lightning structures via machine learning
    Wang, Lingxiao
    Hare, Brian M.
    Zhou, Kai
    Stoecker, Horst
    Scholten, Olaf
    CHAOS SOLITONS & FRACTALS, 2023, 170
  • [36] Machine Learning in the Tasks of Identifying Unwanted Content
    Gorodnichev, M. G.
    Vanushina, A. V.
    Moseva, M. S.
    Trubnikova, N., V
    2019 WAVE ELECTRONICS AND ITS APPLICATION IN INFORMATION AND TELECOMMUNICATION SYSTEMS (WECONF), 2019,
  • [37] Identifying risk of stillbirth using machine learning
    Cersonsky, Tess E. K.
    Ayala, Nina K.
    Pinar, Halit
    Dudley, Donald J.
    Saade, George R.
    Silver, Robert M.
    Lewkowitz, Adam K.
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2023, 229 (03) : 327.e1 - 327.e16
  • [38] Machine learning for identifying troubled life insurer
    Zhang, T
    Li, J
    Li, Z
    MLMTA'03: INTERNATIONAL CONFERENCE ON MACHINE LEARNING; MODELS, TECHNOLOGIES AND APPLICATIONS, 2003, : 216 - 219
  • [39] Machine Learning Algorithms for Identifying Fake Currencies
    Chandrappa S.
    Chandra Shekar P.
    Chaya P.
    Dharmanna L.
    Guruprasad M.S.
    SN Computer Science, 4 (4)
  • [40] Identifying Zeolite Frameworks with a Machine Learning Approach
    Yang, Shujiang
    Lach-hab, Mohammed
    Vaisman, Iosif I.
    Blaisten-Barojas, Estela
    JOURNAL OF PHYSICAL CHEMISTRY C, 2009, 113 (52): : 21721 - 21725