ATOM: Automated Black-Box Testing of Multi-Label Image Classification Systems

被引:0
|
作者
Hu, Shengyou [1 ,2 ]
Wu, Huayao [1 ,2 ]
Wang, Peng [1 ,2 ]
Chang, Jing [3 ]
Tu, Yongjun [3 ]
Jiang, Xiu [3 ]
Niu, Xintao [1 ,2 ]
Nie, Changhai [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[3] Guangdong OPPO Mobile Telecommun Corp Ltd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label Image Classification Testing; Black-box Testing; Metamorphic Testing;
D O I
10.1109/ASE56229.2023.00156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label Image Classification Systems (MICSs) developed based on Deep Neural Networks (DNNs) are extensively used in people's daily life. Currently, although there are a variety of approaches to test DNN-based systems, they typically rely on the internals of DNNs to design test cases, and do not take the core specification of MICS (i.e., correctly recognizing multiple objects in a given image) into account. In this paper, we propose ATOM, an automated and systematic black-box testing framework for testing MICS. Specifically, ATOM exploits the label combination as the testing adequacy criteria, hoping to systematically examine the impact of correlations between a fixed number of labels on the classification ability of MICS. Then, ATOM leverages image search engine and natural language processing to find test images that are not only common to the real-world, but also relevant to target label combinations. Finally, ATOM combines metamorphic testing and label information to realize test oracle identification, based on which the ability of MICS in classifying different label combinations is evaluated. To evaluate the effectiveness of ATOM, we have performed experiments on two popular datasets of MICS, VOC and COCO (each with five state-of-the-art DNN models), and one real-world photo tagging application from our industrial partner. The experimental results reveal that the performance of current DNN-based MICSs remains less satisfactory even in recognizing correlations between only two labels, as ATOM triggers a total number of 6,049 such label combination related errors for all MICSs studied. In particular, ATOM reports 587 error-revealing images for the industrial MICS, in which 92% of them are confirmed by the developers.
引用
收藏
页码:230 / 242
页数:13
相关论文
共 50 条
  • [1] FACTS: Automated Black-Box Testing of FinTech Systems
    Wang, Qingshun
    Gu, Lintao
    Xue, Minhui
    Xu, Lihua
    Niu, Wenyu
    Dou, Liang
    He, Liang
    Xie, Tao
    ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 839 - 844
  • [2] Automated Requirements-Based Testing of Black-Box Reactive Systems
    Narizzano, Massimo
    Pulina, Luca
    Tacchella, Armando
    Vuotto, Simone
    NASA FORMAL METHODS (NFM 2020), 2020, 12229 : 153 - 169
  • [3] A comprehensive transplanting of black-box adversarial attacks from multi-class to multi-label models
    Chen, Zhijian
    Zhou, Qi
    Liu, Yujiang
    Luo, Wenjian
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
  • [4] Explaining Black-Box Automated Electrocardiogram Classification to Cardiologists
    Oliveira, Derick M.
    Ribeiro, Antonio H.
    Pedrosa, Joao A. O.
    Paixdo, Gabriela M. M.
    Ribeiro, Antonio L.
    Meira Jr, Wagner
    2020 COMPUTING IN CARDIOLOGY, 2020,
  • [5] Generating expected results for automated black-box testing
    Schroeder, PJ
    Faherty, P
    Korel, B
    ASE 2002: 17TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, 2002, : 139 - 148
  • [6] Black-Box Testing for Security-Informed Safety of Automated Driving Systems
    Skoglund, Martin
    Warg, Fredrik
    Hansson, Hans
    Punnekkat, Sasikumar
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [7] RESTTESTGEN: Automated Black-Box Testing of RESTful APIs
    Viglianisi, Emanuele
    Dallago, Michael
    Ceccato, Mariano
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2020), 2020, : 142 - 152
  • [8] Automated black-box testing with abstract VDM oracles
    Aichernig, BK
    COMPUTER SAFETY, RELIABILITY AND SECURITY, 1999, 1698 : 250 - 259
  • [9] Automated Image Reduction for Explaining Black-box Classifiers
    Jiang, Mingyue
    Tang, Chengjian
    Zhang, Xiao-Yi
    Zhao, Yangyang
    Ding, Zuohua
    2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 367 - 378
  • [10] A review of black-box adversarial attacks on image classification
    Zhu, Yanfei
    Zhao, Yaochi
    Hu, Zhuhua
    Luo, Tan
    He, Like
    NEUROCOMPUTING, 2024, 610