Discrete Auto-regressive Variational Attention Models for Text Modeling

被引:0
|
作者
Fang, Xianghong [1 ]
Bai, Haoli [1 ]
Li, Jian [1 ]
Xu, Zenglin [2 ]
Lyu, Michael [1 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Engn, Shenzhen, Peoples R China
关键词
Text Modeling; Information Underrepresentation; Posterior Collapse;
D O I
10.1109/IJCNN52387.2021.9534375
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoders (VAEs) have been widely applied for text modeling. In practice, however, they are troubled by two challenges: information underrepresentation and posterior collapse. The former arises as only the last hidden state of LSTM encoder is transformed into the latent space, which is generally insufficient to summarize the data. The latter is a long-standing problem during the training of VAEs as the optimization is trapped to a disastrous local optimum. In this paper, we propose Discrete Auto-regressive Variational Attention Model (DAVAM) to address the challenges. Specifically, we introduce an auto-regressive variational attention approach to enrich the latent space by effectively capturing the semantic dependency from the input. We further design discrete latent space for the variational attention and mathematically show that our model is free from posterior collapse. Extensive experiments on language modeling tasks demonstrate the superiority of DAVAM against several VAE counterparts. Code will be released.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] On the Modeling of Discrete Time Auto-Regressive Representations
    Moysis, Lazaros
    Karampetakis, Nicholas P.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 381 - 386
  • [2] ESTIMATION AND FORECASTING IN AUTO-REGRESSIVE MODELS
    MALINVAUD, E
    ECONOMETRICA, 1962, 30 (01) : 198 - 201
  • [3] Variational Auto-Regressive Gaussian Processes for Continual Learning
    Kapoor, Sanyam
    Karaletsos, Theofanis
    Bui, Thang D.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Modeling of discrete time auto-regressive systems with given forward and backward behavior
    Moysis, Lazaros
    Karampetakis, Nicholas P.
    2014 22ND MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2014, : 139 - 144
  • [5] Quantile approximations in auto-regressive portfolio models
    Ahcan, Ales
    Masten, Igor
    Polanec, Saso
    Perman, Mihael
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 235 (08) : 1976 - 1983
  • [6] On the use of Auto-Regressive Modeling for Arrhythmia Detection
    Adnane, Mourad
    Belouchrani, Adel
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2410 - 2414
  • [7] Locally Hierarchical Auto-Regressive Modeling for Image Generation
    You, Tackgeun
    Kim, Saehoon
    Kim, Chiheon
    Lee, Doyup
    Han, Bohyung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] Facial expression recognition using auto-regressive models
    Dornaika, Fadi
    Davoine, Franck
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 520 - +
  • [9] Mixed frequency structural vector auto-regressive models
    Foroni, Claudia
    Marcellino, Massimiliano
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2016, 179 (02) : 403 - 425
  • [10] Auto-regressive modeling of the shadowing for RSS mobile tracking
    Noureddine, Hadi
    Gresset, Nicolas
    Castelain, Damien
    Pyndiah, Ramesh
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,