Syllable language models for Mandarin speech recognition: Exploiting character language models

被引:18
|
作者
Liu, Xunying [1 ]
Hieronymus, James L. [2 ]
Gales, Mark J. F. [1 ]
Woodland, Philip C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Int Comp Sci Inst, Berkeley, CA 94704 USA
来源
关键词
CHINESE-LANGUAGE; ADAPTATION; ALGORITHM;
D O I
10.1121/1.4768800
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Mandarin Chinese is based on characters which are syllabic in nature and morphological in meaning. All spoken languages have syllabiotactic rules which govern the construction of syllables and their allowed sequences. These constraints are not as restrictive as those learned from word sequences, but they can provide additional useful linguistic information. Hence, it is possible to improve speech recognition performance by appropriately combining these two types of constraints. For the Chinese language considered in this paper, character level language models (LMs) can be used as a first level approximation to allowed syllable sequences. To test this idea, word and character level n-gram LMs were trained on 2.8 billion words (equivalent to 4.3 billion characters) of texts from a wide collection of text sources. Both hypothesis and model based combination techniques were investigated to combine word and character level LMs. Significant character error rate reductions up to 7.3% relative were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using an adapted history dependent multi-level LM that performs a log-linearly combination of character and word level LMs. This supports the hypothesis that character or syllable sequence models are useful for improving Mandarin speech recognition performance. (C) 2013 Acoustical Society of America. [http://dx.doi.org/10.1121/1.4768800]
引用
收藏
页码:519 / 528
页数:10
相关论文
共 50 条
  • [21] SEMANTIC LANGUAGE MODELS FOR AUTO MATIC SPEECH RECOGNITION
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 7 - 12
  • [22] PROMPTING LARGE LANGUAGE MODELS WITH SPEECH RECOGNITION ABILITIES
    Fathullah, Yassir
    Wu, Chunyang
    Lakomkin, Egor
    Jia, Junteng
    Shangguan, Yuan
    Li, Ke
    Guo, Jinxi
    Xiong, Wenhan
    Mahadeokar, Jay
    Kalinli, Ozlem
    Fuegen, Christian
    Seltzer, Mike
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 13351 - 13355
  • [23] JOINT LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 199 - 203
  • [24] On the Strength of Character Language Models for Multilingual Named Entity Recognition
    Yu, Xiaodong
    Mayhew, Stephen
    Sammons, Mark
    Roth, Dan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3073 - 3077
  • [25] Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2726 - +
  • [26] Syllable-based Myanmar Language Model for Speech Recognition
    Soe, Wunna
    Thein, Yadana
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 291 - 296
  • [27] ANALYZING A SIMPLE LANGUAGE MODEL - SOME GENERAL CONCLUSIONS FOR LANGUAGE MODELS FOR SPEECH RECOGNITION
    UEBERLA, J
    COMPUTER SPEECH AND LANGUAGE, 1994, 8 (02): : 153 - 176
  • [29] MIXED PRECISION QUANTIZATION OF TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION
    Xu, Junhao
    Hu, Shoukang
    Yu, Jianwei
    Liu, Xunying
    Meng, Helen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7383 - 7387
  • [30] Automatic Speech Recognition for Irish: testing lexicons and language models
    Qian, Mengjie
    Berthelsen, Harald
    Lonergan, Liam
    Murphy, Andy
    O'Neill, Claire
    Chiarain, Neasa Ni
    Gobl, Christer
    Chasaide, Ailbhe Ni
    2022 33RD IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2022,