Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation

被引:7
|
作者
Hu, Qiang [1 ]
Guo, Yuejun [2 ]
Xie, Xiaofei [3 ]
Cordy, Maxime [1 ]
Papadakis, Mike [1 ]
Ma, Lei [4 ,5 ]
Le Traon, Yves [1 ]
机构
[1] Univ Luxembourg, Luxembourg, Luxembourg
[2] Luxembourg Inst Sci & Technol, Luxembourg, Luxembourg
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Alberta, Edmonton, AB, Canada
[5] Univ Tokyo, Tokyo, Japan
基金
加拿大自然科学与工程研究理事会;
关键词
deep learning testing; performance estimation; distribution shift;
D O I
10.1109/ICSE48619.2023.00152
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning (DL) plays a more and more important role in our daily life due to its competitive performance in industrial application domains. As the core of DL-enabled systems, deep neural networks (DNNs) need to be carefully evaluated to ensure the produced models match the expected requirements. In practice, the de facto standard to assess the quality of DNNs in the industry is to check their performance (accuracy) on a collected set of labeled test data. However, preparing such labeled data is often not easy partly because of the huge labeling effort, i.e., data labeling is labor-intensive, especially with the massive new incoming unlabeled data every day. Recent studies show that test selection for DNN is a promising direction that tackles this issue by selecting minimal representative data to label and using these data to assess the model. However, it still requires human effort and cannot be automatic. In this paper, we propose a novel technique, named Aries, that can estimate the performance of DNNs on new unlabeled data using only the information obtained from the original test data. The key insight behind our technique is that the model should have similar prediction accuracy on the data which have similar distances to the decision boundary. We performed a large-scale evaluation of our technique on two famous datasets, CIFAR-10 and Tiny-ImageNet, four widely studied DNN models including ResNet101 and DenseNet-121, and 13 types of data transformation methods. Results show that the estimated accuracy by Aries is only 0.03% - 2.60% off the true accuracy. Besides, Aries also outperforms the state-of-the-art labeling-free methods in 50 out of 52 cases and selection-labeling-based methods in 96 out of 128 cases.
引用
收藏
页码:1776 / 1787
页数:12
相关论文
共 50 条
  • [1] LaF: Labeling-free Model Selection for Automated Deep Neural Network Reusing
    Hu, Qiang
    Guo, Yuejun
    Xie, Xiaofei
    Cordy, Maxime
    Papadakis, Mike
    Le Traon, Yves
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (01)
  • [2] Practical Accuracy Estimation for Efficient Deep Neural Network Testing
    Chen, Junjie
    Wu, Zhuo
    Wang, Zan
    You, Hanmo
    Zhang, Lingming
    Yan, Ming
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2020, 29 (04)
  • [3] Robust and Efficient Phase Estimation in Legged Robots via Signal Imaging and Deep Neural Networks
    Yazdipaz, Kamyab
    Kohli, Nooshin
    Golestaneh, Seyed Ali
    Shahbazi, Mohammad
    IEEE ACCESS, 2025, 13 : 49018 - 49029
  • [4] Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks
    Fazlyab, Mahyar
    Robey, Alexander
    Hassani, Hamed
    Morari, Manfred
    Pappas, George J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Lipschitz regularity of deep neural networks: analysis and efficient estimation
    Scaman, Kevin
    Virmaux, Aladin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Accuracy Measurement of Deep Neural Network Accelerator via Metamorphic Testing
    Wang, Chaojin
    Shen, Jian
    Fang, Chunrong
    Guan, Xiangsheng
    Wu, Kaitao
    Wang, Jiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST), 2020, : 55 - 61
  • [7] Floating Point Accuracy Testing in Deep Neural Network Computations via Hypothesis Testing
    Wang, Chaojin
    Shen, Jian
    2020 27TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2020), 2020, : 257 - 265
  • [8] DeepPose: Human Pose Estimation via Deep Neural Networks
    Toshev, Alexander
    Szegedy, Christian
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1653 - 1660
  • [9] 3D Mesh Labeling via Deep Convolutional Neural Networks
    Guo, Kan
    Zou, Dongqing
    Chen, Xiaowu
    ACM TRANSACTIONS ON GRAPHICS, 2015, 35 (01):
  • [10] Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos
    Pfister, Tomas
    Simonyan, Karen
    Charles, James
    Zisserman, Andrew
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 538 - 552