ATOM: Automated Black-Box Testing of Multi-Label Image Classification Systems

被引:0
|
作者
Hu, Shengyou [1 ,2 ]
Wu, Huayao [1 ,2 ]
Wang, Peng [1 ,2 ]
Chang, Jing [3 ]
Tu, Yongjun [3 ]
Jiang, Xiu [3 ]
Niu, Xintao [1 ,2 ]
Nie, Changhai [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[3] Guangdong OPPO Mobile Telecommun Corp Ltd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label Image Classification Testing; Black-box Testing; Metamorphic Testing;
D O I
10.1109/ASE56229.2023.00156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label Image Classification Systems (MICSs) developed based on Deep Neural Networks (DNNs) are extensively used in people's daily life. Currently, although there are a variety of approaches to test DNN-based systems, they typically rely on the internals of DNNs to design test cases, and do not take the core specification of MICS (i.e., correctly recognizing multiple objects in a given image) into account. In this paper, we propose ATOM, an automated and systematic black-box testing framework for testing MICS. Specifically, ATOM exploits the label combination as the testing adequacy criteria, hoping to systematically examine the impact of correlations between a fixed number of labels on the classification ability of MICS. Then, ATOM leverages image search engine and natural language processing to find test images that are not only common to the real-world, but also relevant to target label combinations. Finally, ATOM combines metamorphic testing and label information to realize test oracle identification, based on which the ability of MICS in classifying different label combinations is evaluated. To evaluate the effectiveness of ATOM, we have performed experiments on two popular datasets of MICS, VOC and COCO (each with five state-of-the-art DNN models), and one real-world photo tagging application from our industrial partner. The experimental results reveal that the performance of current DNN-based MICSs remains less satisfactory even in recognizing correlations between only two labels, as ATOM triggers a total number of 6,049 such label combination related errors for all MICSs studied. In particular, ATOM reports 587 error-revealing images for the industrial MICS, in which 92% of them are confirmed by the developers.
引用
收藏
页码:230 / 242
页数:13
相关论文
共 50 条
  • [41] Untargeted Attack on Targeted-label for Multi-label Image Classification
    Lin, Yangfei
    Qiao, Peng
    Dou, Yong
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [42] Explainable Noisy Label Flipping for Multi-Label Fashion Image Classification
    Ferreira, Beatriz Quintino
    Costeira, Joao P.
    Gomes, Joao P.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3911 - 3915
  • [43] State of the Art: Automated Black-Box Web Application Vulnerability Testing
    Bau, Jason
    Bursztein, Elie
    Gupta, Divij
    Mitchell, John
    2010 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2010, : 332 - 345
  • [44] MULTI-LABEL CLASSIFICATION WITH SINGLE POSITIVE LABEL FOR REMOTE SENSING IMAGE
    Fujii, Keigo
    Iwasaki, Akira
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5870 - 5873
  • [45] Label Enhancement Manifold Learning Algorithm for Multi-label Image Classification
    Tan, Chao
    Ji, Genlin
    2020 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2020), 2020, : 96 - 102
  • [46] Active learning with label correlation exploration for multi-label image classification
    Wu, Jian
    Ye, Chen
    Sheng, Victor S.
    Zhang, Jing
    Zhao, Pengpeng
    Cui, Zhiming
    IET COMPUTER VISION, 2017, 11 (07) : 577 - 584
  • [47] Automated black-box testing of nominal and error scenarios in RESTful APIs
    Corradini, Davide
    Zampieri, Amedeo
    Pasqua, Michele
    Viglianisi, Emanuele
    Dallago, Michael
    Ceccato, Mariano
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2022, 32 (05):
  • [48] RestTestGen: An Extensible Framework for Automated Black-box Testing of RESTful APIs
    Corradini, Davide
    Zampieri, Amedeo
    Pasqua, Michele
    Ceccato, Mariano
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 504 - 508
  • [49] Black-Box Optimization for Automated Discovery
    Terayama, Kei
    Sumita, Masato
    Tamura, Ryo
    Tsuda, Koji
    ACCOUNTS OF CHEMICAL RESEARCH, 2021, 54 (06) : 1334 - 1346
  • [50] Automated Black-box Testing of Mass Assignment Vulnerabilities in RESTful APIs
    Corradini, Davide
    Pasqua, Michele
    Ceccato, Mariano
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 2553 - 2564