Large Vocabulary SOUL Neural Network Language Models

被引:0
|
作者
Le, Hai-Son [1 ,2 ]
Oparin, Ilya [2 ]
Messaoudi, Abdel [2 ]
Allauzen, Alexandre [1 ,2 ]
Gauvain, Jean-Luc [2 ]
Yvon, Francois [1 ,2 ]
机构
[1] Univ Paris 11, BP 133, F-91403 Orsay, France
[2] Spoken Language Proc Grp, CNRS, LIMSI, F-91403 Orsay, France
关键词
Neural Network Language Model; Automatic Speech Recognition; Speech-To-Text;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents continuation of research on Structured OUt-put Layer Neural Network language models (SOUL NNLM) for automatic speech recognition. As SOUL NNLMs allow estimating probabilities for all in-vocabulary words and not only for those pertaining to a limited shortlist, we investigate. its performance on a large-vocabulary task. Significant improvements both in perplexity and word error rate over conventional shortlist-based NNLMs are shown on a challenging Arabic GALE task characterized by a recognition vocabulary of about 300k entries. A new training scheme is proposed for SOUL NNLMs that is based on separate training of the out-of-shortlist part of the output layer. It enables using more data at each iteration of a neural network without any considerable slow-down in training and brings additional improvements in speech recognition performance.
引用
收藏
页码:1480 / +
页数:2
相关论文
共 50 条
  • [31] Efficient transfer learning for neural network language models
    Skryzalin, Jacek
    Link, Hamilton
    Wendt, Jeremy
    Field, Richard
    Richter, Samuel N.
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 897 - 902
  • [32] DISCRIMINATIVE METHOD FOR RECURRENT NEURAL NETWORK LANGUAGE MODELS
    Tachioka, Yuuki
    Watanabe, Shinji
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5386 - 5390
  • [33] Compositional Neural Network Language Models for Agglutinative Languages
    Arisoy, Ebru
    Saraclar, Murat
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3494 - 3498
  • [34] Even deeper problems with neural network models of language
    Bever, Thomas G.
    Chomsky, Noam
    Fong, Sandiway
    Piattelli-Palmarini, Massimo
    BEHAVIORAL AND BRAIN SCIENCES, 2023, 46
  • [35] A Primer on Neural Network Models for Natural Language Processing
    Goldberg, Yoav
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 57 : 345 - 420
  • [36] Latent Words Recurrent Neural Network Language Models
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Masataki, Hirokazu
    Sakauchi, Sumitaka
    Ito, Akinori
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2380 - 2384
  • [37] Finding Fuzziness in Neural Network Models of Language Processing
    Misra, Kanishka
    Rayz, Julia Taylor
    EXPLAINABLE AI AND OTHER APPLICATIONS OF FUZZY TECHNIQUES, NAFIPS 2021, 2022, 258 : 278 - 290
  • [38] Neural Network Language Models for Translation with United Data
    Khalilov, Maxim
    Fonollosa, Jose A. R.
    Zamora-Martinez, R.
    Castro-Bleda, M. J.
    Espana-Boquera, S.
    20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 2, PROCEEDINGS, 2008, : 445 - +
  • [39] Neural network models for language acquisition: A brief survey
    Poveda, Jordi
    Vellido, Alfredo
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 1346 - 1357
  • [40] Machine Translation based on Neural Network Language Models
    Zamora-Martinez, Francisco
    Jose Castro-Bleda, Maria
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 221 - 228