Discrete Auto-regressive Variational Attention Models for Text Modeling

被引:0
|
作者
Fang, Xianghong [1 ]
Bai, Haoli [1 ]
Li, Jian [1 ]
Xu, Zenglin [2 ]
Lyu, Michael [1 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Engn, Shenzhen, Peoples R China
关键词
Text Modeling; Information Underrepresentation; Posterior Collapse;
D O I
10.1109/IJCNN52387.2021.9534375
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoders (VAEs) have been widely applied for text modeling. In practice, however, they are troubled by two challenges: information underrepresentation and posterior collapse. The former arises as only the last hidden state of LSTM encoder is transformed into the latent space, which is generally insufficient to summarize the data. The latter is a long-standing problem during the training of VAEs as the optimization is trapped to a disastrous local optimum. In this paper, we propose Discrete Auto-regressive Variational Attention Model (DAVAM) to address the challenges. Specifically, we introduce an auto-regressive variational attention approach to enrich the latent space by effectively capturing the semantic dependency from the input. We further design discrete latent space for the variational attention and mathematically show that our model is free from posterior collapse. Extensive experiments on language modeling tasks demonstrate the superiority of DAVAM against several VAE counterparts. Code will be released.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Estimation of Auto-Regressive models for time series using Binary or Quantized Data
    Auber, R.
    Pouliquen, M.
    Pigeon, E.
    M'Saad, M.
    Gehan, O.
    Chapon, P. A.
    Moussay, S.
    IFAC PAPERSONLINE, 2018, 51 (15): : 581 - 586
  • [42] Adaptive accelerated proximal gradient algorithm for auto-regressive exogenous models with outliers
    Ji, Xixi
    Chen, Jing
    Liu, Qiang
    Zhu, Quanmin
    APPLIED MATHEMATICAL MODELLING, 2024, 133 : 310 - 326
  • [43] AUTO-CORRELOGRAMS AND AUTO-REGRESSIVE MODELS OF TRACE-METAL DISTRIBUTIONS IN COCHIN BACKWATERS
    JAYALAKSHMY, KV
    SANKARANARAYANAN, VN
    INDIAN JOURNAL OF MARINE SCIENCES, 1983, 12 (04): : 236 - 238
  • [44] Beyond Spatial Auto-Regressive Models: Predicting Housing Prices with Satellite Imagery
    Bency, Archith J.
    Rallapalli, Swati
    Ganti, Raghu K.
    Srivatsa, Mudhakar
    Manjunath, B. S.
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 320 - 329
  • [45] Auto-regressive Image Synthesis with Integrated Quantization
    Zhan, Fangneng
    Yu, Yingchen
    Wu, Rongliang
    Zhang, Jiahui
    Cui, Kaiwen
    Zhang, Changgong
    Lu, Shijian
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 110 - 127
  • [46] ADAPTIVE IMPORTANCE SAMPLING VIA AUTO-REGRESSIVE GENERATIVE MODELS AND GAUSSIAN PROCESSES
    Wang, Hechuan
    Bugallo, Monica F.
    Djuric, Petar M.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5584 - 5588
  • [47] Computerized Wrist Pulse Signal Diagnosis Using Modified Auto-Regressive Models
    Yinghui Chen
    Lei Zhang
    David Zhang
    Dongyu Zhang
    Journal of Medical Systems, 2011, 35 : 321 - 328
  • [48] Time-varying auto-regressive models for count time-series
    Roy, Arkaprava
    Karmakar, Sayar
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 2905 - 2938
  • [49] Adaptive Auto-Regressive Proportional Myoelectric Control
    Igual, Carles
    Igual, Jorge
    Hahne, Janne M.
    Parra, Lucas C.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (02) : 314 - 322
  • [50] Hyper-parameter optimization with REINFORCE and Masked Attention Auto-regressive Density Estimators
    Krishna, Chepuri Shri
    Gupta, Ashish
    Narayan, Swarnim
    Rai, Himanshu
    Manchanda, Diksha
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5108 - 5117