Towards Visually Explaining Variational Autoencoders

被引:111
|
作者
Liu, Wenqian [1 ]
Li, Runze [2 ]
Zheng, Meng [3 ]
Karanam, Srikrishna [4 ]
Wu, Ziyan [4 ]
Bhanu, Bir [2 ]
Radke, Richard J. [3 ]
Camps, Octavia [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Univ Calif Riverside, Riverside, CA 92521 USA
[3] Rensselaer Polytech Inst, Troy, NY USA
[4] United Imaging Intelligence, Cambridge, MA USA
关键词
D O I
10.1109/CVPR42600.2020.00867
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in convolutional neural network (CNN) model interpretability have led to impressive progress in visualizing and understanding model predictions. In particular, gradient-based visual attention methods have driven much recent effort in using visual attention maps as a means for visual explanations. A key problem, however, is these methods are designed for classification and categorization tasks, and their extension to explaining generative models, e.g., variational autoencoders (VAE) is not trivial. In this work, we take a step towards bridging this crucial gap, proposing the first technique to visually explain VAEs by means of gradient-based attention. We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions. We show how these attention maps can be used to localize anomalies in images, demonstrating state-of-the-art performance on the MVTec-AD dataset. We also show how they can be infused into model training, helping bootstrap the VAE into learning improved latent space disentanglement, demonstrated on the Dsprites dataset.
引用
收藏
页码:8639 / 8648
页数:10
相关论文
共 50 条
  • [21] Variational Autoencoders for Assessing Sustainability
    Fernando Romero-Canizares, Jose
    Vicente-Galindo, Purificacion
    DOCTORAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGIES - DSICT, 2022, 846 : 47 - 62
  • [22] Variational Graph Normalized AutoEncoders
    Ahn, Seong Jin
    Kim, MyoungHo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2827 - 2831
  • [23] Resampled Priors for Variational Autoencoders
    Bauer, Matthias
    Mnih, Andriy
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 66 - 75
  • [24] Inference Suboptimality in Variational Autoencoders
    Cremer, Chris
    Li, Xuechen
    Duvenaud, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [25] Variational Autoencoders for Collaborative Filtering
    Liang, Dawen
    Krishnan, Rahul G.
    Hoffman, Matthew D.
    Jebara, Tony
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 689 - 698
  • [26] An Evolutionary Approach to Variational Autoencoders
    Hajewski, Jeff
    Oliveira, Suely
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 71 - 77
  • [27] Disentangling Disentanglement in Variational Autoencoders
    Mathieu, Emile
    Rainforth, Tom
    Siddharth, N.
    Teh, Yee Whye
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [28] A Geometric Perspective on Variational Autoencoders
    Chadebec, Clement
    Allassonniere, Stephanie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Shedding Light on Variational Autoencoders
    Ruiz Vargas, J. C.
    Novaes, S. F.
    Cobe, R.
    Iope, R.
    Stanzani, S.
    Tomei, T. R.
    2018 XLIV LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2018), 2018, : 294 - 298
  • [30] Rethinking Controllable Variational Autoencoders
    Shao, Huajie
    Yang, Yifei
    Lin, Haohong
    Lin, Longzhong
    Chen, Yizhuo
    Yang, Qinmin
    Zhao, Han
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19228 - 19237