Learning Energy-Based Model with Variational Auto-Encoder as Amortized Sampler

被引:0
|
作者
Xie, Jianwen [1 ]
Zheng, Zilong [1 ]
Li, Ping [1 ]
机构
[1] Baidu Res, Cognit Comp Lab, 10900 NE 8th St, Bellevue, WA 98004 USA
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
关键词
FRAME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the intractable partition function, training energy-based models (EBMs) by maximum likelihood requires Markov chain Monte Carlo (MCMC) sampling to approximate the gradient of the Kullback-Leibler divergence between data and model distributions. However, it is non-trivial to sample from an EBM because of the difficulty of mixing between modes. In this paper, we propose to learn a variational auto-encoder (VAE) to initialize the finite-step MCMC, such as Langevin dynamics that is derived from the energy function, for efficient amortized sampling of the EBM. With these amortized MCMC samples, the EBM can be trained by maximum likelihood, which follows an "analysis by synthesis" scheme; while the VAE learns from these MCMC samples via variational Bayes. We call this joint training algorithm the variational MCMC teaching, in which the VAE chases the EBM toward data distribution. We interpret the learning algorithm as a dynamic alternating projection in the context of information geometry. Our proposed models can generate samples comparable to GANs and EBMs. Additionally, we demonstrate that our model can learn effective probabilistic distribution toward supervised conditional learning tasks.
引用
收藏
页码:10441 / 10451
页数:11
相关论文
共 50 条
  • [21] A trajectory outlier detection method based on variational auto-encoder
    Zhang, Longmei
    Lu, Wei
    Xue, Feng
    Chang, Yanshuo
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 15075 - 15093
  • [22] Anomaly detection method based on convolutional variational auto-encoder
    Yu X.
    Xu M.
    Wang Y.
    Wang S.
    Hu N.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (05): : 151 - 158
  • [23] Detection Algorithm of the Mimicry Attack based on Variational Auto-Encoder
    Wang, Qunke
    Fang, Lanting
    Zhu, Zhenchao
    Huang, Jie
    51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN-W 2021), 2021, : 114 - 120
  • [24] An unsupervised adversarial domain adaptation based on variational auto-encoder
    Zonoozi, Mahta Hassan Pour
    Seydi, Vahid
    Deypir, Mahmood
    MACHINE LEARNING, 2025, 114 (05)
  • [25] Twin Variational Auto-Encoder for Representation Learning in IoT Intrusion Detection
    Phai Vu Dinh
    Nguyen Quang Uy
    Nguyen, Diep N.
    Dinh Thai Hoang
    Son Pham Bao
    Dutkiewicz, Eryk
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 848 - 853
  • [26] Path Tracking Control Using Imitation Learning with Variational Auto-Encoder
    Lee, Su-Jin
    Chun, Tae Yoon
    Lim, Hyoung Woo
    Lee, Sang-Ho
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 501 - 505
  • [27] Unsupervised Text Feature Learning via Deep Variational Auto-encoder
    Liu, Genggeng
    Xie, Lin
    Chen, Chi-Hua
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (03): : 421 - 437
  • [28] Deep variational auto-encoder for text classification
    Xie, Lin
    Liu, Genggeng
    Lian, Hongfei
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER PHYSICAL SYSTEMS (ICPS 2019), 2019, : 737 - 742
  • [29] Cascade Variational Auto-Encoder for Hierarchical Disentanglement
    Lin, Fudong
    Yuan, Xu
    Peng, Lu
    Tzeng, Nian-Feng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1248 - 1257
  • [30] A Context-Aware Variational Auto-Encoder Model for Text Generation
    Ma, Zhiqiang
    Wang, Chunyu
    Shen, Ji
    Du, Baoxiang
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 1176 - 1182