Long-Distance Continuous Space Language Modeling for Speech Recognition

被引:0
|
作者
Talaat, Mohamed [1 ]
Abdou, Sherif [1 ]
Shoman, Mahmoud [1 ]
机构
[1] Cairo Univ, Fac Comp & Informat, Giza 12613, Egypt
关键词
Language model; n-gram; Continuous space; Latent semantic analysis; Word co-occurrence matrix; Long distance; Tied-mixture model; HYBRID;
D O I
10.1007/978-3-319-18117-2_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The n-gram language models has been the most frequently used language model for a long time as they are easy to build models and require the minimum effort for integration in different NLP applications. Although of its popularity, n-gram models suffer from several drawbacks such as its ability to generalize for the unseen words in the training data, the adaptability to new domains, and the focus only on short distance word relations. To overcome the problems of the n-gram models the continuous parameter space LMs were introduced. In these models the words are treated as vectors of real numbers rather than of discrete entities. As a result, semantic relationships between the words could be quantified and can be integrated into the model. The infrequent words are modeled using the more frequent ones that are semantically similar. In this paper we present a long distance continuous language model based on a latent semantic analysis (LSA). In the LSA framework, the word-document co-occurrence matrix is commonly used to tell how many times a word occurs in a certain document. Also, the word-word co-occurrence matrix is used in many previous studies. In this research, we introduce a different representation for the text corpus, this by proposing long-distance word co-occurrence matrices. These matrices to represent the long range co-occurrences between different words on different distances in the corpus. By applying LSA to these matrices, words in the vocabulary are moved to the continuous vector space. We represent each word with a continuous vector that keeps the word order and position in the sentences. We use tied-mixture HMM modeling (TM-HMM) to robustly estimate the LM parameters and word probabilities. Experiments on the Arabic Gigaword corpus show improvements in the perplexity and the speech recognition results compared to the conventional n-gram.
引用
收藏
页码:549 / 564
页数:16
相关论文
共 50 条
  • [1] PLSA enhanced with a long-distance bigram language model for speech recognition
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    European Signal Processing Conference, 2013,
  • [2] PLSA ENHANCED WITH A LONG-DISTANCE BIGRAM LANGUAGE MODEL FOR SPEECH RECOGNITION
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [3] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
  • [4] Long-distance recognition
    Nelson, B
    WORKFORCE, 2000, 79 (08): : 50 - 52
  • [5] Syllable modeling in continuous speech recognition for Tamil language
    Thangarajan, R.
    Natarajan, A.
    Selvam, M.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2009, 12 (01) : 47 - 57
  • [6] Continuous Speech Recognition of Kannada Language using Triphone Modeling
    Sajjan, Sharada C.
    Vijaya, C.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
  • [7] Connectionist language modeling for large vocabulary continuous speech recognition
    Schwenk, H
    Gauvain, JL
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
  • [8] Reducing language to rhythm: Amazonian Bora drummed language exploits speech rhythm for long-distance communication
    Seifart, Frank
    Meyer, Julien
    Grawunder, Sven
    Dentel, Laure
    ROYAL SOCIETY OPEN SCIENCE, 2018, 5 (04):
  • [9] The perceptibility of long-distance coarticulation in speech and sign A study of English and American Sign Language
    Grosvald, Michael
    Corina, David
    SIGN LANGUAGE & LINGUISTICS, 2012, 15 (01) : 73 - 103
  • [10] Analysis of Long-distance Word Dependencies and Pronunciation Variability at Conversational Russian Speech Recognition
    Kipyatkova, Irina
    Karpov, Alexey
    Verkhodanova, Vasilisa
    Zelezny, Milos
    2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 719 - 725