Long-Distance Continuous Space Language Modeling for Speech Recognition

被引:0
|
作者
Talaat, Mohamed [1 ]
Abdou, Sherif [1 ]
Shoman, Mahmoud [1 ]
机构
[1] Cairo Univ, Fac Comp & Informat, Giza 12613, Egypt
关键词
Language model; n-gram; Continuous space; Latent semantic analysis; Word co-occurrence matrix; Long distance; Tied-mixture model; HYBRID;
D O I
10.1007/978-3-319-18117-2_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The n-gram language models has been the most frequently used language model for a long time as they are easy to build models and require the minimum effort for integration in different NLP applications. Although of its popularity, n-gram models suffer from several drawbacks such as its ability to generalize for the unseen words in the training data, the adaptability to new domains, and the focus only on short distance word relations. To overcome the problems of the n-gram models the continuous parameter space LMs were introduced. In these models the words are treated as vectors of real numbers rather than of discrete entities. As a result, semantic relationships between the words could be quantified and can be integrated into the model. The infrequent words are modeled using the more frequent ones that are semantically similar. In this paper we present a long distance continuous language model based on a latent semantic analysis (LSA). In the LSA framework, the word-document co-occurrence matrix is commonly used to tell how many times a word occurs in a certain document. Also, the word-word co-occurrence matrix is used in many previous studies. In this research, we introduce a different representation for the text corpus, this by proposing long-distance word co-occurrence matrices. These matrices to represent the long range co-occurrences between different words on different distances in the corpus. By applying LSA to these matrices, words in the vocabulary are moved to the continuous vector space. We represent each word with a continuous vector that keeps the word order and position in the sentences. We use tied-mixture HMM modeling (TM-HMM) to robustly estimate the LM parameters and word probabilities. Experiments on the Arabic Gigaword corpus show improvements in the perplexity and the speech recognition results compared to the conventional n-gram.
引用
收藏
页码:549 / 564
页数:16
相关论文
共 50 条
  • [41] Efficient Structured Language Modeling for Speech Recognition
    Rastrow, Ariya
    Dredze, Mark
    Khudanpur, Sanjeev
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1658 - 1661
  • [42] Study on the integration of speech and language processing in recognition of Chinese continuous speech
    Zhao, L.
    Zhou, C.R.
    Wu, Z.Y.
    Shengxue Xuebao/Acta Acustica, 2001, 26 (01): : 73 - 78
  • [43] Continuous Variable Entanglement Distribution for Long-Distance Quantum Communication
    Zhao Jun-Jun
    Guo Xiao-Min
    Wang Xu-Yang
    Wang Ning
    Li Yong-Min
    Peng Kun-Chi
    CHINESE PHYSICS LETTERS, 2013, 30 (06)
  • [44] Long-distance rhythmic dependencies and their application to automatic language identification
    Tepperman, Joseph
    Nava, Emily
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1068 - 1071
  • [45] Long-Distance/Environment Face Image Enhancement Method for Recognition
    Wang, Zhengning
    Ma, Shanshan
    Han, Mingyan
    Hu, Guang
    Liu, Shuaicheng
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 501 - 511
  • [46] 24 HOUR CONTINUOUS ECG RECORDINGS IN LONG-DISTANCE RUNNERS
    TALAN, DA
    BAUERNFEIND, RA
    ASHLEY, WW
    KANAKIS, C
    ROSEN, KM
    CHEST, 1982, 82 (01) : 19 - 24
  • [47] Continuous Variable Entanglement Distribution for Long-Distance Quantum Communication
    赵军军
    郭晓敏
    王旭阳
    王宁
    李永民
    彭堃墀
    Chinese Physics Letters, 2013, 30 (06) : 17 - 20
  • [48] Possible role of electrodynamic interactions in long-distance biomolecular recognition
    Preto, Jordane
    Pettini, Marco
    Tuszynski, Jack A.
    PHYSICAL REVIEW E, 2015, 91 (05)
  • [49] Research on long-distance hand recognition based on depth information
    Fu, Yuyang
    Miao, Lanfang
    Li, Zhifei
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [50] Feature sets in continuous speech recognition for the Portuguese language
    dos Santos, SCB
    Alcaim, A
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 126 - 129