Efficient Structured Language Modeling for Speech Recognition

被引:0
|
作者
Rastrow, Ariya [1 ]
Dredze, Mark [1 ]
Khudanpur, Sanjeev [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The structured language model (SLM) of [1] was one of the first to successfully integrate syntactic structure into language models. We extend the SLM framework in two new directions. First, we propose a new syntactic hierarchical interpolation that improves over previous approaches. Second, we develop a general information-theoretic algorithm for pruning the underlying Jelinek-Mercer interpolated LM used in [1], which substantially reduces the size of the LM, enabling us to train on large data. When combined with hill-climbing [2] the SLM is an accurate model, space-efficient and fast for rescoring large speech lattices. Experimental results on broadcast news demonstrate that the SLM outperforms a large 4-gram LM.
引用
收藏
页码:1658 / 1661
页数:4
相关论文
共 50 条
  • [1] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
    Bjorklund, Johanna
    Cleophas, Loek
    Karlsson, My
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
  • [2] RELEVANCE LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chen, Kuan-Yu
    Chen, Berlin
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5568 - 5571
  • [3] Accurate and Structured Pruning for Efficient Automatic Speech Recognition
    Jiang, Huiqiang
    Zhang, Li Lyna
    Li, Yuang
    Wu, Yu
    Cao, Shijie
    Cao, Ting
    Yang, Yuqing
    Li, Jinyu
    Yang, Mao
    Qiu, Lili
    INTERSPEECH 2023, 2023, : 4104 - 4108
  • [4] Joint acoustic and language modeling for speech recognition
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
  • [5] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
  • [6] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [7] POSITION INFORMATION FOR LANGUAGE MODELING IN SPEECH RECOGNITION
    Chiu, Hsuan-Sheng
    Chen, Guan-Yu
    Lee, Chun-Jen
    Chen, Berlin
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 101 - 104
  • [8] Latent semantic language modeling for speech recognition
    Bellegarda, JR
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
  • [9] Efficient structured reporting in radiology using an intelligent dialogue system based on speech recognition and natural language processing
    Tobias Jorg
    Benedikt Kämpgen
    Dennis Feiler
    Lukas Müller
    Christoph Düber
    Peter Mildenberger
    Florian Jungmann
    Insights into Imaging, 14
  • [10] Efficient structured reporting in radiology using an intelligent dialogue system based on speech recognition and natural language processing
    Jorg, Tobias
    Kaempgen, Benedikt
    Feiler, Dennis
    Mueller, Lukas
    Dueber, Christoph
    Mildenberger, Peter
    Jungmann, Florian
    INSIGHTS INTO IMAGING, 2023, 14 (01)