An embedded multilingual speech recognition system for Mandarin, Cantonese, and English

被引：0

作者：

Wang, X ^{[1
]}

Cao, Y ^{[1
]}

Ding, F ^{[1
]}

Tang, YZ ^{[1
]}

机构：

[1] Nokia Res Ctr, Audio Visual Syst Lab, Beijing, Peoples R China

来源：

2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS | 2003年

关键词：

embedded multilingual speech recognition; non-native speech recognition; automatic language identification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a small-footprint, speaker-independent, multilingual system for isolated word recognition of Mandarin. Cantonese, and English. The baseline system got very promising results without any phoneme shared between different languages. By sharing phonemes, the memory and computational complexity was reduced by about 40%. Non-native, accented speech recognition and mixed language words support are the distinguishing features of our system. Automatic language identification (LID) is one of the key elements in language-independent automatic speech recognition (ASR) systems. LID performance is also analyzed in addition to the engine performance of the proposed system. Supervised Bayesian online adaptation was proved to be effective in compensation for accent mismatch, environment mismatch, as well as for modeling inaccuracy introduced by combined training.

引用

页码：758 / 764

页数：7

共 50 条

[41] Embedded Speech recognition interaction system research
Luo, Qiong
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 1035 - 1038
[42] Development of Text and Speech Corpus for Designing the Multilingual Recognition System
Bansal, Shweta
Agrawal, Shyam S.
2018 ORIENTAL COCOSDA - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2018, : 1 - 7
[43] Language Modeling for Speech Recognition of Spoken Cantonese
Yeung, Yu Ting
Cao, Houwei
Zheng, N. H.
Lee, Tan
Ching, P. C.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
[44] Acquisition and Interpretation of Mandarin Speech Prosody by Native Speakers and Cantonese Learners
Chen, Xi
Chen, Si
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1800 - 1809
[45] A Probe into the Different Approaches Adopted by Cantonese and Mandarin in Translating English Food
Zhao, Yuan
Chu, Tian-shu
Deng, Yi-yun
Zhao, You-bin
INTERNATIONAL CONFERENCE ON EDUCATION AND MANAGEMENT SCIENCE (ICEMS 2014), 2014, : 514 - 517
[46] Syllabic reduction in Mandarin and English speech
Burchfield, L. Ann
Bradlow, Ann R.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06): : EL270 - EL276
[47] A low cost embedded mandarin Speech Recognition system based on 16-bit fixed-point DSP
He, Q
ICCC2004: Proceedings of the 16th International Conference on Computer Communication Vol 1and 2, 2004, : 1203 - 1206
[48] A bilingual speech recognition system for English and Tamil
Kumar, CS
Wei, FS
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1641 - 1644
[49] Acoustic data augmentation for Mandarin-English code-switching speech recognition
Long, Yanhua
Li, Yijie
Zhang, Qiaozheng
Wei, Shuang
Ye, Hong
Yang, Jichen
APPLIED ACOUSTICS, 2020, 161
[50] NON-AUTOREGRESSIVE MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
Chuang, Shun-Po
Chang, Heng-Jui
Huang, Sung-Feng
Lee, Hung-yi
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 465 - 472

← 1 2 3 4 5 →