An embedded multilingual speech recognition system for Mandarin, Cantonese, and English

被引:0
|
作者
Wang, X [1 ]
Cao, Y [1 ]
Ding, F [1 ]
Tang, YZ [1 ]
机构
[1] Nokia Res Ctr, Audio Visual Syst Lab, Beijing, Peoples R China
关键词
embedded multilingual speech recognition; non-native speech recognition; automatic language identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a small-footprint, speaker-independent, multilingual system for isolated word recognition of Mandarin. Cantonese, and English. The baseline system got very promising results without any phoneme shared between different languages. By sharing phonemes, the memory and computational complexity was reduced by about 40%. Non-native, accented speech recognition and mixed language words support are the distinguishing features of our system. Automatic language identification (LID) is one of the key elements in language-independent automatic speech recognition (ASR) systems. LID performance is also analyzed in addition to the engine performance of the proposed system. Supervised Bayesian online adaptation was proved to be effective in compensation for accent mismatch, environment mismatch, as well as for modeling inaccuracy introduced by combined training.
引用
收藏
页码:758 / 764
页数:7
相关论文
共 50 条
  • [41] Embedded Speech recognition interaction system research
    Luo, Qiong
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 1035 - 1038
  • [42] Development of Text and Speech Corpus for Designing the Multilingual Recognition System
    Bansal, Shweta
    Agrawal, Shyam S.
    2018 ORIENTAL COCOSDA - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2018, : 1 - 7
  • [43] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [44] Acquisition and Interpretation of Mandarin Speech Prosody by Native Speakers and Cantonese Learners
    Chen, Xi
    Chen, Si
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1800 - 1809
  • [45] A Probe into the Different Approaches Adopted by Cantonese and Mandarin in Translating English Food
    Zhao, Yuan
    Chu, Tian-shu
    Deng, Yi-yun
    Zhao, You-bin
    INTERNATIONAL CONFERENCE ON EDUCATION AND MANAGEMENT SCIENCE (ICEMS 2014), 2014, : 514 - 517
  • [46] Syllabic reduction in Mandarin and English speech
    Burchfield, L. Ann
    Bradlow, Ann R.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06): : EL270 - EL276
  • [47] A low cost embedded mandarin Speech Recognition system based on 16-bit fixed-point DSP
    He, Q
    ICCC2004: Proceedings of the 16th International Conference on Computer Communication Vol 1and 2, 2004, : 1203 - 1206
  • [48] A bilingual speech recognition system for English and Tamil
    Kumar, CS
    Wei, FS
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1641 - 1644
  • [49] Acoustic data augmentation for Mandarin-English code-switching speech recognition
    Long, Yanhua
    Li, Yijie
    Zhang, Qiaozheng
    Wei, Shuang
    Ye, Hong
    Yang, Jichen
    APPLIED ACOUSTICS, 2020, 161
  • [50] NON-AUTOREGRESSIVE MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Chuang, Shun-Po
    Chang, Heng-Jui
    Huang, Sung-Feng
    Lee, Hung-yi
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 465 - 472