ATOM: Automated Black-Box Testing of Multi-Label Image Classification Systems

被引:0
|
作者
Hu, Shengyou [1 ,2 ]
Wu, Huayao [1 ,2 ]
Wang, Peng [1 ,2 ]
Chang, Jing [3 ]
Tu, Yongjun [3 ]
Jiang, Xiu [3 ]
Niu, Xintao [1 ,2 ]
Nie, Changhai [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[3] Guangdong OPPO Mobile Telecommun Corp Ltd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label Image Classification Testing; Black-box Testing; Metamorphic Testing;
D O I
10.1109/ASE56229.2023.00156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label Image Classification Systems (MICSs) developed based on Deep Neural Networks (DNNs) are extensively used in people's daily life. Currently, although there are a variety of approaches to test DNN-based systems, they typically rely on the internals of DNNs to design test cases, and do not take the core specification of MICS (i.e., correctly recognizing multiple objects in a given image) into account. In this paper, we propose ATOM, an automated and systematic black-box testing framework for testing MICS. Specifically, ATOM exploits the label combination as the testing adequacy criteria, hoping to systematically examine the impact of correlations between a fixed number of labels on the classification ability of MICS. Then, ATOM leverages image search engine and natural language processing to find test images that are not only common to the real-world, but also relevant to target label combinations. Finally, ATOM combines metamorphic testing and label information to realize test oracle identification, based on which the ability of MICS in classifying different label combinations is evaluated. To evaluate the effectiveness of ATOM, we have performed experiments on two popular datasets of MICS, VOC and COCO (each with five state-of-the-art DNN models), and one real-world photo tagging application from our industrial partner. The experimental results reveal that the performance of current DNN-based MICSs remains less satisfactory even in recognizing correlations between only two labels, as ATOM triggers a total number of 6,049 such label combination related errors for all MICSs studied. In particular, ATOM reports 587 error-revealing images for the industrial MICS, in which 92% of them are confirmed by the developers.
引用
收藏
页码:230 / 242
页数:13
相关论文
共 50 条
  • [21] LABEL RELATION INFERENCE FOR MULTI-LABEL AERIAL IMAGE CLASSIFICATION
    Hua, Yuansheng
    Mou, Lichao
    Zhu, Xiao Xiang
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5244 - 5247
  • [22] Multi-label Image Classification with A Probabilistic Label Enhancement Model
    Li, Xin
    Zhao, Feipeng
    Guo, Yuhong
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 430 - 439
  • [23] Multi-Label Active Learning with Label Correlation for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Pengpeng
    Cui, Zhiming
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3437 - 3441
  • [24] Multi-label Iterated Learning for Image Classification with Label Ambiguity
    Rajeswar, Sai
    Rodriguez, Pau
    Singhal, Soumye
    Vazquez, David
    Courville, Aaron
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4773 - 4783
  • [25] Black-Box Testing and Auditing of Bias in ADM Systems
    Krafft, Tobias D.
    Hauer, Marc P.
    Zweig, Katharina
    MINDS AND MACHINES, 2024, 34 (02)
  • [26] Interpreting Undesirable Pixels for Image Classification on Black-Box Models
    Kang, Sin-Han
    Jung, Hong-Gyu
    Lee, Seong-Whan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4250 - 4254
  • [27] A Capsule Network for Hierarchical Multi-label Image Classification
    Noor, Khondaker Tasrif
    Robles-Kelly, Antonio
    Kusy, Brano
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 163 - 172
  • [28] Discovering Unknown Labels for Multi-Label Image Classification
    Huang, Jun
    Yan, Yu
    Zheng, Xiao
    Qu, Xiwen
    Hong, Xudong
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 797 - 806
  • [29] Multi-label Classification with Clustering for Image and Text Categorization
    Nasierding, Gulisong
    Sajjanhar, Atul
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 869 - 874
  • [30] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    IEEE ACCESS, 2019, 7 : 98005 - 98013