Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks

被引：0

作者：

Ardis, Paul ^{[1
]}

Flenner, Arjuna ^{[2
]}

机构：

[1] GE Aerosp Res, 1 Res Circle, Niskayuna, NY 12309 USA

[2] GE Aerosp, 3290 Patterson Ave SE, Grand Rapids, MI 49512 USA

来源：

ASSURANCE AND SECURITY FOR AI-ENABLED SYSTEMS | 2024年 / 13054卷

关键词：

D O I：

10.1117/12.3012765

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.

引用

页数：8

共 50 条

[31] Spectral Privacy Detection on Black-box Graph Neural Networks
Yang, Yining
Lu, Jialiang
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[32] BET: Black-Box Efficient Testing for Convolutional Neural Networks
Wang, Jialai
Qiu, Han
Rong, Yi
Ye, Hengkai
Li, Qi
Li, Zongpeng
Zhang, Chao
PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 164 - 175
[33] Neural networks in antenna engineering - Beyond black-box modeling
Patnaik, A
Anagnostou, D
Christodoulou, CG
2005 IEEE/ACES INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND APPLIED COMPUTATIONAL ELECTROMAGNETICS, 2005, : 598 - 601
[34] A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks
Mo, Mouke
Wang, Chuntao
Bian, Shan
SYMMETRY-BASEL, 2024, 16 (03):
[35] Black-box Adversarial Attack and Defense on Graph Neural Networks
Li, Haoyang
Di, Shimin
Li, Zijian
Chen, Lei
Cao, Jiannong
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1017 - 1030
[36] Revisiting Black-box Ownership Verification for Graph Neural Networks
Zhou, Ruikai
Yang, Kang
Wang, Xiuling
Wang, Wendy Hui
Xu, Jun
45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 2478 - 2496
[37] Feature Importance Explanations for Temporal Black-Box Models
Sood, Akshay
Craven, Mark
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8351 - 8360
[38] Uncertainty propagation method for high-dimensional black-box problems via Bayesian deep neural network
Jing Fei Liu
Chao Jiang
Jing Zheng
Structural and Multidisciplinary Optimization, 2022, 65
[39] Uncertainty propagation method for high-dimensional black-box problems via Bayesian deep neural network
Liu, Jing Fei
Jiang, Chao
Zheng, Jing
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (03)
[40] Comparing Explanations from Glass-Box and Black-Box Machine-Learning Models
Kuk, Michal
Bobek, Szymon
Nalepa, Grzegorz J.
COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 668 - 675

← 1 2 3 4 5 →