Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks

被引:0
|
作者
Ardis, Paul [1 ]
Flenner, Arjuna [2 ]
机构
[1] GE Aerosp Res, 1 Res Circle, Niskayuna, NY 12309 USA
[2] GE Aerosp, 3290 Patterson Ave SE, Grand Rapids, MI 49512 USA
关键词
D O I
10.1117/12.3012765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Spectral Privacy Detection on Black-box Graph Neural Networks
    Yang, Yining
    Lu, Jialiang
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [32] BET: Black-Box Efficient Testing for Convolutional Neural Networks
    Wang, Jialai
    Qiu, Han
    Rong, Yi
    Ye, Hengkai
    Li, Qi
    Li, Zongpeng
    Zhang, Chao
    PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 164 - 175
  • [33] Neural networks in antenna engineering - Beyond black-box modeling
    Patnaik, A
    Anagnostou, D
    Christodoulou, CG
    2005 IEEE/ACES INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND APPLIED COMPUTATIONAL ELECTROMAGNETICS, 2005, : 598 - 601
  • [34] A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks
    Mo, Mouke
    Wang, Chuntao
    Bian, Shan
    SYMMETRY-BASEL, 2024, 16 (03):
  • [35] Black-box Adversarial Attack and Defense on Graph Neural Networks
    Li, Haoyang
    Di, Shimin
    Li, Zijian
    Chen, Lei
    Cao, Jiannong
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1017 - 1030
  • [36] Revisiting Black-box Ownership Verification for Graph Neural Networks
    Zhou, Ruikai
    Yang, Kang
    Wang, Xiuling
    Wang, Wendy Hui
    Xu, Jun
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 2478 - 2496
  • [37] Feature Importance Explanations for Temporal Black-Box Models
    Sood, Akshay
    Craven, Mark
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8351 - 8360
  • [38] Uncertainty propagation method for high-dimensional black-box problems via Bayesian deep neural network
    Jing Fei Liu
    Chao Jiang
    Jing Zheng
    Structural and Multidisciplinary Optimization, 2022, 65
  • [39] Uncertainty propagation method for high-dimensional black-box problems via Bayesian deep neural network
    Liu, Jing Fei
    Jiang, Chao
    Zheng, Jing
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (03)
  • [40] Comparing Explanations from Glass-Box and Black-Box Machine-Learning Models
    Kuk, Michal
    Bobek, Szymon
    Nalepa, Grzegorz J.
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 668 - 675