LANGUAGE MODELING WITH NEURAL TRANS-DIMENSIONAL RANDOM FIELDS

被引:0
|
作者
Wang, Bin [1 ]
Ou, Zhijian [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
Language modeling; Random field; Stochastic approximation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trans-dimensional random field language models (TRF LMs) have recently been introduced, where sentences are modeled as a collection of random fields. The TRF approach has been shown to have the advantages of being computationally more efficient in inference than LSTM LMs with close performance and being able to flexibly integrate rich features. In this paper we propose neural TRFs, beyond of the previous discrete TRFs that only use linear potentials with discrete features. The idea is to use nonlinear potentials with continuous features, implemented by neural networks (NNs), in the TRF framework. Neural TRFs combine the advantages of both NNs and TRFs. The benefits of word embedding, nonlinear feature learning and larger context modeling are inherited from the use of NNs. At the same time, the strength of efficient inference by avoiding expensive softmax is preserved. A number of technical contributions, including employing deep convolutional neural networks (CNNs) to define the potentials and incorporating the joint stochastic approximation (JSA) strategy in the training algorithm, are developed in this work, which enable us to successfully train neural TRF LMs. Various LMs are evaluated in terms of speech recognition WERs by rescoring the 1000-best lists of WSJ'92 test data. The results show that neural TRF LMs not only improve over discrete TRF LMs, but also perform slightly better than LSTM LMs with only one fifth of parameters and 16x faster inference efficiency.
引用
收藏
页码:294 / 300
页数:7
相关论文
共 50 条
  • [1] Trans-dimensional Random Fields for Language Modeling
    Wang, Bin
    Ou, Zhijian
    Tan, Zhiqiang
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 785 - 794
  • [2] Trans-dimensional random fields for language modeling
    Department of Electronic Engineering, Tsinghua University, Beijing
    100084, China
    不详
    NJ
    08854, United States
    ACL-IJCNLP - Annu. Meet. Assoc. Comput. Linguist. Int. Jt. Conf. Nat. Lang. Process. Asian Fed. Nat. Lang. Process., Proc. Conf., (785-794):
  • [3] Learning Trans-Dimensional Random Fields with Applications to Language Modeling
    Wang, Bin
    Ou, Zhijian
    Tan, Zhiqiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 876 - 890
  • [4] LEARNING NEURAL TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS WITH NOISE-CONTRASTIVE ESTIMATION
    Wang, Bin
    Ou, Zhijian
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6134 - 6138
  • [5] IMPROVED TRAINING OF NEURAL TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS WITH DYNAMIC NOISE-CONTRASTIVE ESTIMATION
    Wang, Bin
    Ou, Zhijian
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 70 - 76
  • [6] INTEGRATING DISCRETE AND NEURAL FEATURES VIA MIXED-FEATURE TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS
    Gao, Silin
    Ou, Zhijian
    Yang, Wei
    Xu, Huifang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6169 - 6173
  • [7] Trans-Dimensional Generative Modeling via Jump Diffusion Models
    Campbell, Andrew
    Harvey, William
    Weilbach, Christian
    De Bortoli, Valentin
    Rainforth, Tom
    Doucet, Arnaud
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Trans-dimensional geoacoustic inversion
    Dettmer, Jan
    Dosso, Stan E.
    Holland, Charles W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (06): : 3393 - 3405
  • [9] Beyond trans-dimensional RJMCMC with a case study in impulsive data modeling
    Karakus, O.
    Kuruoglu, E. E.
    Altinkaya, M. A.
    SIGNAL PROCESSING, 2018, 153 : 396 - 410
  • [10] Trans-dimensional imaging of the random inhomogeneity structure in the southern Ryukyu arc, Japan
    Takahashi, Tsutomu
    Kaiho, Yuka
    Ishihara, Yasushi
    Obana, Koichiro
    Miura, Seiichi
    Kodaira, Shuichi
    Kaneda, Yoshiyuki
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2022, 229 (02) : 1392 - 1407