Mandarin-English Code-switching Speech Recognition

被引:0
|
作者
Xu, Haihua [1 ]
Van Tung Pham [1 ,2 ]
Kyaw, Zin Tun [2 ]
Lim, Zhi Hao [1 ]
Chng, Eng Siong [1 ,2 ]
Li, Haizhou [3 ]
机构
[1] Nanyang Technol Univ, Temasek Labs, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
关键词
code-switch speech recognition; phone sets; context-aware language model; TDNN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents the development of a Mandarin-English code-switching speech recognition system. We demonstrate three key novelties in our system. First, we increase our lexicon coverage to 360K words, where phone sets of different languages are maintained separately. Secondly, we used over 1000 hours of training data combining both mono-lingual and code-switch corpus to develop the acoustic model. Finally, for language modelling, we applied context-aware text normalization and word-class language model. When testing on our internal code-switch close talk microphone recording, the system achieves recognition performance that can support real applications.
引用
收藏
页码:554 / 555
页数:2
相关论文
共 50 条
  • [31] TEXTUAL DATA AUGMENTATION FOR ARABIC-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Hussein, Amir
    Chowdhury, Shammur Absar
    Abdelali, Ahmed
    Dehak, Najim
    Ali, Ahmed
    Khudanpur, Sanjeev
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 777 - 784
  • [32] Mandarin-English bilingual speech recognition for real world music retrieval
    Zhang, Qingqing
    Pan, Jielin
    Yan, Yonghong
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4253 - 4256
  • [33] Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed Speech
    Gupta, Shashi Kant
    Hiray, Sushant
    Kukde, Prashant
    INTERSPEECH 2023, 2023, : 4114 - 4118
  • [34] Speech recognition on code-switching among the Chinese dialects
    Lyu, Dau-cheng
    Lyu, Ren-yuan
    Chiang, Yuang-chin
    Hsu, Chun-nan
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1105 - 1108
  • [35] MECOS: A bilingual Manipuri-English spontaneous code-switching speech corpus for automatic speech recognition
    Singh, Naorem Karline
    Chanu, Yambem Jina
    Pangsatabam, Hoomexsun
    COMPUTER SPEECH AND LANGUAGE, 2024, 87
  • [36] Multi-Task Learning in Deep Neural Networks for Mandarin-English Code-Mixing Speech Recognition
    Chen, Mengzhe
    Pan, Jielin
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2554 - 2557
  • [37] Improving speech transcription for Mandarin-English translation
    Tomalin, M.
    Gales, M. J. F.
    Liu, X. A.
    Sim, K. C.
    Sinha, R.
    Wang, L.
    Woodland, P. C.
    Yu, K.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 97 - +
  • [38] Improving End-to-End Modeling For Mandarin-English Code-Switching Using Lightweight Switch-Routing Mixture-of-Experts
    Tan, Fengyun
    Feng, Chaofeng
    Wei, Tao
    Gong, Shuai
    Leng, Jinqiang
    Chu, Wei
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    INTERSPEECH 2023, 2023, : 4224 - 4228
  • [39] Language choice and code-switching in bilingual children's interaction under multilingual contexts: evidence from Mandarin-English bilingual preschoolers
    Zhang, Haijing
    Huang, Fangwei
    Wang, Cong
    INTERNATIONAL JOURNAL OF MULTILINGUALISM, 2024,
  • [40] PHONE MODELING AND COMBINING DISCRIMINATIVE TRAINING FOR MANDARIN-ENGLISH BILINGUAL SPEECH RECOGNITION
    Qian, Yanmin
    Liu, Jia
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4918 - 4921