Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation

被引:7
|
作者
Hu, Qiang [1 ]
Guo, Yuejun [2 ]
Xie, Xiaofei [3 ]
Cordy, Maxime [1 ]
Papadakis, Mike [1 ]
Ma, Lei [4 ,5 ]
Le Traon, Yves [1 ]
机构
[1] Univ Luxembourg, Luxembourg, Luxembourg
[2] Luxembourg Inst Sci & Technol, Luxembourg, Luxembourg
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Alberta, Edmonton, AB, Canada
[5] Univ Tokyo, Tokyo, Japan
来源
2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE | 2023年
基金
加拿大自然科学与工程研究理事会;
关键词
deep learning testing; performance estimation; distribution shift;
D O I
10.1109/ICSE48619.2023.00152
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning (DL) plays a more and more important role in our daily life due to its competitive performance in industrial application domains. As the core of DL-enabled systems, deep neural networks (DNNs) need to be carefully evaluated to ensure the produced models match the expected requirements. In practice, the de facto standard to assess the quality of DNNs in the industry is to check their performance (accuracy) on a collected set of labeled test data. However, preparing such labeled data is often not easy partly because of the huge labeling effort, i.e., data labeling is labor-intensive, especially with the massive new incoming unlabeled data every day. Recent studies show that test selection for DNN is a promising direction that tackles this issue by selecting minimal representative data to label and using these data to assess the model. However, it still requires human effort and cannot be automatic. In this paper, we propose a novel technique, named Aries, that can estimate the performance of DNNs on new unlabeled data using only the information obtained from the original test data. The key insight behind our technique is that the model should have similar prediction accuracy on the data which have similar distances to the decision boundary. We performed a large-scale evaluation of our technique on two famous datasets, CIFAR-10 and Tiny-ImageNet, four widely studied DNN models including ResNet101 and DenseNet-121, and 13 types of data transformation methods. Results show that the estimated accuracy by Aries is only 0.03% - 2.60% off the true accuracy. Besides, Aries also outperforms the state-of-the-art labeling-free methods in 50 out of 52 cases and selection-labeling-based methods in 96 out of 128 cases.
引用
收藏
页码:1776 / 1787
页数:12
相关论文
共 50 条
  • [21] DEEP NEURAL NETWORKS FOR ESTIMATION AND INFERENCE
    Farrell, Max H.
    Liang, Tengyuan
    Misra, Sanjog
    ECONOMETRICA, 2021, 89 (01) : 181 - 213
  • [22] Vehicle Pose Estimation in WAMI Imagery via Deep Convolutional Neural Networks
    Yi, Meng
    Wang, Dong
    Yang, Fan
    Xu, Jonathan
    Cai, Yiran
    Blasch, Erik
    Sheaff, Carolyn
    Chen, Genshe
    Ling, Haibin
    PROCEEDINGS OF THE 2016 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON) AND OHIO INNOVATION SUMMIT (OIS), 2016, : 233 - 240
  • [23] Robust DOA Estimation Method for MIMO Radar via Deep Neural Networks
    Cong, Jingyu
    Wang, Xianpeng
    Huang, Mengxing
    Wan, Liangtian
    IEEE SENSORS JOURNAL, 2021, 21 (06) : 7498 - 7507
  • [24] Temperature-based Accuracy Estimation on Instrument Transformers via Neural Networks and Microcontrollers
    Negri, Virginia
    Mingotti, Alessandro
    Tinarelli, Roberto
    Peretto, Lorenzo
    2024 IEEE 14TH INTERNATIONAL WORKSHOP ON APPLIED MEASUREMENTS FOR POWER SYSTEMS, AMPS 2024, 2024,
  • [25] Efficient Uncertainty Estimation in Spiking Neural Networks via MC-dropout
    Sun, Tao
    Yin, Bojian
    Bohte, Sander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 393 - 406
  • [26] Efficient Low Cost Alternative Testing of Analog Crossbar Arrays for Deep Neural Networks
    Ma, Kwondo
    Saha, Anurup
    Amarnath, Chandramouli
    Chatterjee, Abhijit
    2022 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2022, : 499 - 503
  • [27] Parameter estimation via neural networks
    Phillips, NG
    Kogut, A
    STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 471 - 473
  • [28] Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks
    Kwasniewska, Alicja
    Ruminski, Jacek
    Szankin, Maciej
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [29] Efficient Estimation of Single-index Models with Deep Re QU Neural Networks
    Zhihuang Yang
    Siming Zheng
    Niansheng Tang
    Acta Mathematica Sinica,English Series, 2025, (02) : 640 - 676
  • [30] Boosting Estimation Accuracy of Low-Cost Monopulse Receiver Via Deep Neural Network
    Zhang, Hanxiang
    Pour, Saeed Zolfaghary
    Liu, Powei
    Yan, Hao
    Casamayor, Jonathan
    Plaisir, Mitch
    Arigong, Bayaner
    2024 IEEE WIRELESS AND MICROWAVE TECHNOLOGY CONFERENCE, WAMICON, 2024,