Mandarin-English Code-switching Speech Recognition

被引:0
|
作者
Xu, Haihua [1 ]
Van Tung Pham [1 ,2 ]
Kyaw, Zin Tun [2 ]
Lim, Zhi Hao [1 ]
Chng, Eng Siong [1 ,2 ]
Li, Haizhou [3 ]
机构
[1] Nanyang Technol Univ, Temasek Labs, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
关键词
code-switch speech recognition; phone sets; context-aware language model; TDNN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents the development of a Mandarin-English code-switching speech recognition system. We demonstrate three key novelties in our system. First, we increase our lexicon coverage to 360K words, where phone sets of different languages are maintained separately. Secondly, we used over 1000 hours of training data combining both mono-lingual and code-switch corpus to develop the acoustic model. Finally, for language modelling, we applied context-aware text normalization and word-class language model. When testing on our internal code-switch close talk microphone recording, the system achieves recognition performance that can support real applications.
引用
收藏
页码:554 / 555
页数:2
相关论文
共 50 条
  • [11] TALCS: AN OPEN-SOURCE MANDARIN-ENGLISH CODE-SWITCHING CORPUS AND A SPEECH RECOGNITION BASELINE
    Li, Chengfei
    Deng, Shuhao
    Wang, Yaoping
    Wang, Guangjing
    Gong, Yaguang
    Chen, Changbin
    Bai, Jinfeng
    INTERSPEECH 2022, 2022, : 1741 - 1745
  • [12] Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition
    Fan, Zhiyun
    Dong, Linhao
    Shen, Chen
    Liang, Zhenlin
    Zhang, Jun
    Lu, Lu
    Ma, Zejun
    INTERSPEECH 2023, 2023, : 3322 - 3326
  • [13] A Review of the Mandarin-English Code-switching Corpus: SEAME
    Lee, Grandee
    Ho, Thi-Nga
    Chng, Eng-Siong
    Li, Haizhou
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 210 - 213
  • [14] Mandarin-English code-switching speech corpus in South-East Asia: SEAME
    Lyu, Dau-Cheng
    Tan, Tien-Ping
    Chng, Eng-Siong
    Li, Haizhou
    LANGUAGE RESOURCES AND EVALUATION, 2015, 49 (03) : 581 - 600
  • [15] Bi-encoder Transformer Network for Mandarin-English Code-switching Speech Recognition using Mixture of Experts
    Lu, Yizhou
    Huang, Mingkun
    Li, Hao
    Guo, Jiaqi
    Qian, Yanmin
    INTERSPEECH 2020, 2020, : 4766 - 4770
  • [16] SEAME: a Mandarin-English Code-switching Speech Corpus in South-East Asia
    Lyu, Dau-Cheng
    Tan, Tien-Ping
    Chng, Eng-Siong
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1986 - +
  • [17] Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition
    Zhang, Shuai
    Yi, Jiangyan
    Tian, Zhengkun
    Tao, Jianhua
    Bai, Ye
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [18] Insertional code-switching as interactional resource in Mandarin-English bilingual conversation
    Wang, Wei
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2024,
  • [19] Hybrid CTC Language Identification Structure for Mandarin-English Code-Switching ASR
    Yin, Hengxin
    Hu, Guangyu
    Wang, Fei
    Ren, Pengfei
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 537 - 541
  • [20] Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-switching Speech Recognition
    Zhang, Haobo
    Xu, Haihua
    Van Tung Pham
    Huang, Hao
    Chng, Eng Siong
    INTERSPEECH 2020, 2020, : 2392 - 2396