LANGUAGE MODELING WITH NEURAL TRANS-DIMENSIONAL RANDOM FIELDS

被引:0
|
作者
Wang, Bin [1 ]
Ou, Zhijian [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
Language modeling; Random field; Stochastic approximation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trans-dimensional random field language models (TRF LMs) have recently been introduced, where sentences are modeled as a collection of random fields. The TRF approach has been shown to have the advantages of being computationally more efficient in inference than LSTM LMs with close performance and being able to flexibly integrate rich features. In this paper we propose neural TRFs, beyond of the previous discrete TRFs that only use linear potentials with discrete features. The idea is to use nonlinear potentials with continuous features, implemented by neural networks (NNs), in the TRF framework. Neural TRFs combine the advantages of both NNs and TRFs. The benefits of word embedding, nonlinear feature learning and larger context modeling are inherited from the use of NNs. At the same time, the strength of efficient inference by avoiding expensive softmax is preserved. A number of technical contributions, including employing deep convolutional neural networks (CNNs) to define the potentials and incorporating the joint stochastic approximation (JSA) strategy in the training algorithm, are developed in this work, which enable us to successfully train neural TRF LMs. Various LMs are evaluated in terms of speech recognition WERs by rescoring the 1000-best lists of WSJ'92 test data. The results show that neural TRF LMs not only improve over discrete TRF LMs, but also perform slightly better than LSTM LMs with only one fifth of parameters and 16x faster inference efficiency.
引用
收藏
页码:294 / 300
页数:7
相关论文
共 50 条
  • [21] Trans-dimensional Bayesian geoacoustic inversion in shallow water
    WU Weiwen
    REN Qunyan
    LU Licheng
    Guo Shengming
    MA Li
    ChineseJournalofAcoustics, 2023, 42 (04) : 407 - 422
  • [22] Trans-Dimensional Surface Reconstruction With Different Classes of Parameterization
    Hawkins, Rhys
    Bodin, Thomas
    Sambridge, Malcolm
    Choblet, Gael
    Husson, Laurent
    GEOCHEMISTRY GEOPHYSICS GEOSYSTEMS, 2019, 20 (01): : 505 - 529
  • [23] Modeling of two-dimensional random fields
    Jankowski, R
    Walukiewicz, H
    PROBABILISTIC ENGINEERING MECHANICS, 1997, 12 (02) : 115 - 121
  • [24] Trans-dimensional Bayesian inference for gravitational lens substructures
    Brewer, Brendon J.
    Huijser, David
    Lewis, Geraint F.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 455 (02) : 1819 - 1829
  • [25] Development of a Trans-Dimensional Fault Slip Inversion for Geodetic Data
    Tomita, Fumiaki
    Iinuma, Takeshi
    Agata, Ryoichiro
    Hori, Takane
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2021, 126 (05)
  • [26] Interacting sequential Monte Carlo samplers for trans-dimensional simulation
    Jasra, Ajay
    Doucet, Arnaud
    Stephens, David A.
    Holmes, Christopher C.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) : 1765 - 1791
  • [27] Efficient hierarchical trans-dimensional Bayesian inversion of magnetotelluric data
    Xiang, Enming
    Guo, Rongwen
    Dosso, Stan E.
    Liu, Jianxin
    Dong, Hao
    Ren, Zhengyong
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2018, 213 (03) : 1751 - 1767
  • [28] A distance-based diagnostic for trans-dimensional Markov chains
    Sisson, S. A.
    Fan, Y.
    STATISTICS AND COMPUTING, 2007, 17 (04) : 357 - 367
  • [29] A Bayesian trans-dimensional approach for the fusion of multiple geophysical datasets
    JafarGandomi, Arash
    Binley, Andrew
    JOURNAL OF APPLIED GEOPHYSICS, 2013, 96 : 38 - 54
  • [30] A distance-based diagnostic for trans-dimensional Markov chains
    S. A. Sisson
    Y. Fan
    Statistics and Computing, 2007, 17 : 357 - 367