Study on the consistency analysis between the prosody and the spectrum for Mandarin speech

被引:1
|
作者
Yeh, Cheng-Yu [1 ]
Chen, Kuan-Lin [2 ]
Hwang, Shaw-Hwa [2 ]
Yan, Long-Jhe [2 ]
机构
[1] Natl Chin Yi Univ Technol, Dept Elect Engn, Taichung 41170, Taiwan
[2] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
关键词
INFORMATION; CONVERSION; ALGORITHM; SYSTEM;
D O I
10.1049/iet-spr.2012.0099
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, a consistency analysis between the prosody and the spectrum for Mandarin speech is presented. Found by an inspection on the pronunciation process of human beings, the consistency can be interpreted as a close correlated relation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used firstly to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Secondly, based on a designated syllable, the vector quantisation (VQ) with the Linde-Buzo-Gray algorithm is used to train the VQ codebooks of each segment. Thirdly, the prosodic vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyse the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the warping process between the spectrum and the prosody intra a syllable must be considered in a text-to-speech system to improve the speech quality.
引用
收藏
页码:158 / 165
页数:8
相关论文
共 50 条
  • [1] Study on the consistency analysis between the prosody and the spectrum for mandarin speech
    Department of Electrical Engineering, National Chin-Yi University of Technology, 57, Sec. 2, Zhongshan Road, Taiping Dist., Taichung
    41170, Taiwan
    不详
    10608, Taiwan
    IET Signal Proc., 2 (158-165):
  • [2] Consistency analysis of the spectrum and prosody within a syllable for Mandarin speech
    Chen, Kuan-Lin
    Yeh, Cheng-Yu
    Hwang, Shaw-Hwa
    Yan, Long-Jhe
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2013, 36 (14) : 1851 - 1861
  • [3] A study on the consistency analysis of energy parameter for Mandarin speech
    Shen, Li-Te
    Yeh, Cheng-Yu
    Hwang, Shaw-Hwa
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [4] A study on the consistency analysis of energy parameter for Mandarin speech
    Li-Te Shen
    Cheng-Yu Yeh
    Shaw-Hwa Hwang
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [5] Prosody for Mandarin Speech Recognition: a Comparative Study of Read and Spontaneous Speech
    Yeung, Yu Ting
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1133 - +
  • [6] Prosody Dependent Mandarin Speech Recognition
    Ni, Chong-Jia
    Liu, Wen-Ju
    Xu, Bo
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
  • [7] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
    Jia, Huibin
    Tao, Jianhua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
  • [8] Evaluating Prosody of Mandarin Speech for Language Learning
    Dong, Minghui
    Li, Haizhou
    Nwe, Tin Lay
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1986 - 1989
  • [9] An Automatic Prosody Labeling Method for Mandarin Speech
    Chiang, Chen-Yu
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 725 - +
  • [10] Hierarchical prosody modeling for Mandarin spontaneous speech
    Lin, Cheng-Hsien
    You, Chung-Long
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04): : 2576 - 2596