An embedded multilingual speech recognition system for Mandarin, Cantonese, and English

被引：0

作者：

Wang, X ^{[1
]}

Cao, Y ^{[1
]}

Ding, F ^{[1
]}

Tang, YZ ^{[1
]}

机构：

[1] Nokia Res Ctr, Audio Visual Syst Lab, Beijing, Peoples R China

来源：

2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS | 2003年

关键词：

embedded multilingual speech recognition; non-native speech recognition; automatic language identification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a small-footprint, speaker-independent, multilingual system for isolated word recognition of Mandarin. Cantonese, and English. The baseline system got very promising results without any phoneme shared between different languages. By sharing phonemes, the memory and computational complexity was reduced by about 40%. Non-native, accented speech recognition and mixed language words support are the distinguishing features of our system. Automatic language identification (LID) is one of the key elements in language-independent automatic speech recognition (ASR) systems. LID performance is also analyzed in addition to the engine performance of the proposed system. Supervised Bayesian online adaptation was proved to be effective in compensation for accent mismatch, environment mismatch, as well as for modeling inaccuracy introduced by combined training.

引用

页码：758 / 764

页数：7

共 50 条

[1] Feature masking in an embedded Mandarin speech recognition system
Tang, YZ
Wang, X
Cao, Y
Ding, F
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 245 - 248
[2] International Students' Investment in Learning Cantonese, English, Mandarin, and Portuguese in Multilingual Macao
Reynolds, Barry Lee
Ren, Ning
Li, Janis Zhiyou
Fang, Fan
JOURNAL OF LANGUAGE IDENTITY AND EDUCATION, 2024,
[3] Tone recognition for Chinese speech: A comparative study of Mandarin and Cantonese
Peng, G
Zheng, HY
Wang, WSY
2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 233 - 236
[4] Robust Mandarin speech recognition in car environments for embedded navigation system
Ding, Pei
He, Lei
Yan, Xiang
Zhao, Rui
Hao, Jie
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 584 - 590
[5] Acoustic modelling for Chinese speech recognition: A comparative study of Mandarin and Cantonese
Gao, S
Lee, T
Wong, YW
Xu, B
Ching, PC
Huang, T
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1261 - 1264
[6] The CUHK Dysarthric Speech Recognition Systems for English and Cantonese
Hu, Shoukang
Liu, Shansong
Chang, Heng Fai
Geng, Mengzhe
Chen, Jiani
Chung, Lau Wing
Hei, To Ka
Yu, Jianwei
Wong, Ka Ho
Liu, Xunying
Meng, Helen
INTERSPEECH 2019, 2019, : 3669 - 3670
[7] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
Ngoc Thang Vu
Lyu, Dau-Cheng
Weiner, Jochen
Telaar, Dominic
Schlippe, Tim
Blaicher, Fabian
Chng, Eng-Siong
Schultz, Tanja
Li, Haizhou
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
[8] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
Ngoc Thang Vu
Lyu, Dau-Cheng
Weiner, Jochen
Telaar, Dominic
Schlippe, Tim
Blaicher, Fabian
Chng, Eng-Siong
Schultz, Tanja
Li, Haizhou
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
[9] A scalable architecture for multilingual speech recognition on embedded devices
Raab, Martin
Gruhn, Rainer
Noeth, Elmar
SPEECH COMMUNICATION, 2011, 53 (01) : 62 - 74
[10] Embedded deep learning models for multilingual speech recognition
Rahmouni, Mohamed Hedi
Salhi, Mohamed Salah
Touti, Ezzeddine
Allagui, Hatem
Aoudia, Mouloud
Barr, Mohammad
COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123

← 1 2 3 4 5 →