IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION

被引:0
|
作者
Chao, Hao [1 ]
Yang, Zhanlei [1 ]
Liu, Wenju [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
关键词
tone modeling; Mandarin; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For the same tone pattern, different articulatory characteristics may make the pitch contour change. This paper applies articulatory features, which represent the articulatory information, as well as prosodic features to the tone modeling. Three kinds of tone models are trained to verify the effectiveness of articulatory features. Tone recognition experiments indicate significant improvement can be achieved when using both articulatory features and prosodic features. After the first pass search of a speech recognition system, tone models using new tonal features are employed to rescoring the N-best hypotheses, and a 6.5% relative reduction of character error rate is achieved.
引用
收藏
页码:4741 / 4744
页数:4
相关论文
共 50 条
  • [1] Improved Tone Modeling for Mandarin Broadcast News Speech Recognition
    Lei, Xin
    Siu, Manhung
    Hwang, Mei-Yuh
    Ostendorf, Mari
    Lee, Tan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1237 - +
  • [2] Tone Modeling for Continuous Mandarin Speech Recognition
    Cao, Yang
    Zhang, Shuwu
    Huang, Taiyi
    Xu, Bo
    International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
  • [3] Pitch tracking and tone features for Mandarin speech recognition
    Huang, HCH
    Seide, F
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1523 - 1526
  • [4] Tone articulation modeling for mandarin spontaneous speech recognition
    Zhou, JL
    Tian, Y
    Shi, Y
    Huang, C
    Chang, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 997 - 1000
  • [5] Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features
    Lin, Ju
    Xie, Yanlu
    Gao, Yingming
    Zhang, Jinsong
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [6] Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features
    Zhan, Qingran
    Motlicek, Petr
    Du, Shixuan
    Shan, Yahui
    Ma, Sifan
    Xie, Xiang
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1912 - 1916
  • [7] MAXIMUM ENTROPY BASED TONE MODELING FOR MANDARIN SPEECH RECOGNITION
    Wang, Xinhao
    Yu, Yansuo
    Wu, Xihong
    Chi, Huisheng
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4850 - 4853
  • [8] WORD-LEVEL TONE MODELING FOR MANDARIN SPEECH RECOGNITION
    Lei, Xin
    Ostendorf, Mari
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 665 - +
  • [9] Improved mandarin speech recognition by lattice rescoring with enhanced tone models
    Wang, Huanliang
    Qian, Yao
    Soong, Frank
    Zhou, Jian-Lai
    Han, Jiqing
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 445 - +
  • [10] Integration of Articulatory Knowledge and Voicing Features Based on DNN/HMM for Mandarin Speech Recognition
    Tan, Ying-Wei
    Liu, Wen-Ju
    Jiang, Wei
    Zheng, Hao
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,