Towards Visually Explaining Variational Autoencoders

被引：111

作者：

Liu, Wenqian ^{[1
]}

Li, Runze ^{[2
]}

Zheng, Meng ^{[3
]}

Karanam, Srikrishna ^{[4
]}

Wu, Ziyan ^{[4
]}

Bhanu, Bir ^{[2
]}

Radke, Richard J. ^{[3
]}

Camps, Octavia ^{[1
]}

机构：

[1] Northeastern Univ, Boston, MA 02115 USA

[2] Univ Calif Riverside, Riverside, CA 92521 USA

[3] Rensselaer Polytech Inst, Troy, NY USA

[4] United Imaging Intelligence, Cambridge, MA USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00867

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in convolutional neural network (CNN) model interpretability have led to impressive progress in visualizing and understanding model predictions. In particular, gradient-based visual attention methods have driven much recent effort in using visual attention maps as a means for visual explanations. A key problem, however, is these methods are designed for classification and categorization tasks, and their extension to explaining generative models, e.g., variational autoencoders (VAE) is not trivial. In this work, we take a step towards bridging this crucial gap, proposing the first technique to visually explain VAEs by means of gradient-based attention. We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions. We show how these attention maps can be used to localize anomalies in images, demonstrating state-of-the-art performance on the MVTec-AD dataset. We also show how they can be infused into model training, helping bootstrap the VAE into learning improved latent space disentanglement, demonstrated on the Dsprites dataset.

引用

页码：8639 / 8648

页数：10

共 50 条

[21] Variational Autoencoders for Assessing Sustainability
Fernando Romero-Canizares, Jose
Vicente-Galindo, Purificacion
DOCTORAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGIES - DSICT, 2022, 846 : 47 - 62
[22] Variational Graph Normalized AutoEncoders
Ahn, Seong Jin
Kim, MyoungHo
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2827 - 2831
[23] Resampled Priors for Variational Autoencoders
Bauer, Matthias
Mnih, Andriy
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 66 - 75
[24] Inference Suboptimality in Variational Autoencoders
Cremer, Chris
Li, Xuechen
Duvenaud, David
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[25] Variational Autoencoders for Collaborative Filtering
Liang, Dawen
Krishnan, Rahul G.
Hoffman, Matthew D.
Jebara, Tony
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 689 - 698
[26] An Evolutionary Approach to Variational Autoencoders
Hajewski, Jeff
Oliveira, Suely
2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 71 - 77
[27] Disentangling Disentanglement in Variational Autoencoders
Mathieu, Emile
Rainforth, Tom
Siddharth, N.
Teh, Yee Whye
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[28] A Geometric Perspective on Variational Autoencoders
Chadebec, Clement
Allassonniere, Stephanie
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[29] Shedding Light on Variational Autoencoders
Ruiz Vargas, J. C.
Novaes, S. F.
Cobe, R.
Iope, R.
Stanzani, S.
Tomei, T. R.
2018 XLIV LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2018), 2018, : 294 - 298
[30] Rethinking Controllable Variational Autoencoders
Shao, Huajie
Yang, Yifei
Lin, Haohong
Lin, Longzhong
Chen, Yizhuo
Yang, Qinmin
Zhao, Han
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19228 - 19237

← 1 2 3 4 5 →