An embedded multilingual speech recognition system for Mandarin, Cantonese, and English

被引:0
|
作者
Wang, X [1 ]
Cao, Y [1 ]
Ding, F [1 ]
Tang, YZ [1 ]
机构
[1] Nokia Res Ctr, Audio Visual Syst Lab, Beijing, Peoples R China
关键词
embedded multilingual speech recognition; non-native speech recognition; automatic language identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a small-footprint, speaker-independent, multilingual system for isolated word recognition of Mandarin. Cantonese, and English. The baseline system got very promising results without any phoneme shared between different languages. By sharing phonemes, the memory and computational complexity was reduced by about 40%. Non-native, accented speech recognition and mixed language words support are the distinguishing features of our system. Automatic language identification (LID) is one of the key elements in language-independent automatic speech recognition (ASR) systems. LID performance is also analyzed in addition to the engine performance of the proposed system. Supervised Bayesian online adaptation was proved to be effective in compensation for accent mismatch, environment mismatch, as well as for modeling inaccuracy introduced by combined training.
引用
收藏
页码:758 / 764
页数:7
相关论文
共 50 条
  • [1] Feature masking in an embedded Mandarin speech recognition system
    Tang, YZ
    Wang, X
    Cao, Y
    Ding, F
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 245 - 248
  • [2] International Students' Investment in Learning Cantonese, English, Mandarin, and Portuguese in Multilingual Macao
    Reynolds, Barry Lee
    Ren, Ning
    Li, Janis Zhiyou
    Fang, Fan
    JOURNAL OF LANGUAGE IDENTITY AND EDUCATION, 2024,
  • [3] Tone recognition for Chinese speech: A comparative study of Mandarin and Cantonese
    Peng, G
    Zheng, HY
    Wang, WSY
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 233 - 236
  • [4] Robust Mandarin speech recognition in car environments for embedded navigation system
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 584 - 590
  • [5] Acoustic modelling for Chinese speech recognition: A comparative study of Mandarin and Cantonese
    Gao, S
    Lee, T
    Wong, YW
    Xu, B
    Ching, PC
    Huang, T
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1261 - 1264
  • [6] The CUHK Dysarthric Speech Recognition Systems for English and Cantonese
    Hu, Shoukang
    Liu, Shansong
    Chang, Heng Fai
    Geng, Mengzhe
    Chen, Jiani
    Chung, Lau Wing
    Hei, To Ka
    Yu, Jianwei
    Wong, Ka Ho
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2019, 2019, : 3669 - 3670
  • [7] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
    Ngoc Thang Vu
    Lyu, Dau-Cheng
    Weiner, Jochen
    Telaar, Dominic
    Schlippe, Tim
    Blaicher, Fabian
    Chng, Eng-Siong
    Schultz, Tanja
    Li, Haizhou
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
  • [8] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
    Ngoc Thang Vu
    Lyu, Dau-Cheng
    Weiner, Jochen
    Telaar, Dominic
    Schlippe, Tim
    Blaicher, Fabian
    Chng, Eng-Siong
    Schultz, Tanja
    Li, Haizhou
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
  • [9] A scalable architecture for multilingual speech recognition on embedded devices
    Raab, Martin
    Gruhn, Rainer
    Noeth, Elmar
    SPEECH COMMUNICATION, 2011, 53 (01) : 62 - 74
  • [10] Embedded deep learning models for multilingual speech recognition
    Rahmouni, Mohamed Hedi
    Salhi, Mohamed Salah
    Touti, Ezzeddine
    Allagui, Hatem
    Aoudia, Mouloud
    Barr, Mohammad
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123