IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION

被引：0

作者：

Chao, Hao ^{[1
]}

Yang, Zhanlei ^{[1
]}

Liu, Wenju ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

tone modeling; Mandarin; speech recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

For the same tone pattern, different articulatory characteristics may make the pitch contour change. This paper applies articulatory features, which represent the articulatory information, as well as prosodic features to the tone modeling. Three kinds of tone models are trained to verify the effectiveness of articulatory features. Tone recognition experiments indicate significant improvement can be achieved when using both articulatory features and prosodic features. After the first pass search of a speech recognition system, tone models using new tonal features are employed to rescoring the N-best hypotheses, and a 6.5% relative reduction of character error rate is achieved.

引用

页码：4741 / 4744

页数：4

共 50 条

[1] Improved Tone Modeling for Mandarin Broadcast News Speech Recognition
Lei, Xin
Siu, Manhung
Hwang, Mei-Yuh
Ostendorf, Mari
Lee, Tan
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1237 - +
[2] Tone Modeling for Continuous Mandarin Speech Recognition
Cao, Yang
Zhang, Shuwu
Huang, Taiyi
Xu, Bo
International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
[3] Pitch tracking and tone features for Mandarin speech recognition
Huang, HCH
Seide, F
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1523 - 1526
[4] Tone articulation modeling for mandarin spontaneous speech recognition
Zhou, JL
Tian, Y
Shi, Y
Huang, C
Chang, E
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 997 - 1000
[5] Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features
Lin, Ju
Xie, Yanlu
Gao, Yingming
Zhang, Jinsong
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[6] Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features
Zhan, Qingran
Motlicek, Petr
Du, Shixuan
Shan, Yahui
Ma, Sifan
Xie, Xiang
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1912 - 1916
[7] MAXIMUM ENTROPY BASED TONE MODELING FOR MANDARIN SPEECH RECOGNITION
Wang, Xinhao
Yu, Yansuo
Wu, Xihong
Chi, Huisheng
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4850 - 4853
[8] WORD-LEVEL TONE MODELING FOR MANDARIN SPEECH RECOGNITION
Lei, Xin
Ostendorf, Mari
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 665 - +
[9] Improved mandarin speech recognition by lattice rescoring with enhanced tone models
Wang, Huanliang
Qian, Yao
Soong, Frank
Zhou, Jian-Lai
Han, Jiqing
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 445 - +
[10] Integration of Articulatory Knowledge and Voicing Features Based on DNN/HMM for Mandarin Speech Recognition
Tan, Ying-Wei
Liu, Wen-Ju
Jiang, Wei
Zheng, Hao
2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,

← 1 2 3 4 5 →