Study on the consistency analysis between the prosody and the spectrum for Mandarin speech

被引:1
|
作者
Yeh, Cheng-Yu [1 ]
Chen, Kuan-Lin [2 ]
Hwang, Shaw-Hwa [2 ]
Yan, Long-Jhe [2 ]
机构
[1] Natl Chin Yi Univ Technol, Dept Elect Engn, Taichung 41170, Taiwan
[2] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
关键词
INFORMATION; CONVERSION; ALGORITHM; SYSTEM;
D O I
10.1049/iet-spr.2012.0099
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, a consistency analysis between the prosody and the spectrum for Mandarin speech is presented. Found by an inspection on the pronunciation process of human beings, the consistency can be interpreted as a close correlated relation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used firstly to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Secondly, based on a designated syllable, the vector quantisation (VQ) with the Linde-Buzo-Gray algorithm is used to train the VQ codebooks of each segment. Thirdly, the prosodic vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyse the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the warping process between the spectrum and the prosody intra a syllable must be considered in a text-to-speech system to improve the speech quality.
引用
收藏
页码:158 / 165
页数:8
相关论文
共 50 条
  • [31] High-Quality Prosody Generation in Mandarin Text-to-Speech System
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2010, 46 (01): : 40 - 46
  • [32] An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus
    Chiang, Chen-Yu
    Tang, Cheng-Chang
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 148 - +
  • [33] A parametric prosody coding approach for Mandarin speech using a hierarchical prosodic model
    Chen-Yu Chiang
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [34] Perceptual and acoustic analysis of prosody in Mandarin Chinese refusals
    Hao, Yen-Chen
    Su, Yunwen
    Chang, Yufen
    JOURNAL OF PRAGMATICS, 2024, 233 : 3 - 20
  • [35] Can Natural Speech Prosody Distinguish Autism Spectrum Disorders? A Meta-Analysis
    Ma, Wen
    Xu, Lele
    Zhang, Hao
    Zhang, Shurui
    BEHAVIORAL SCIENCES, 2024, 14 (02)
  • [36] Automatic Analysis of Speech Prosody in Dutch
    Hu, Na
    Janssen, Berit
    Hanssen, Judith
    Gussenhoven, Carlos
    Chen, Aoju
    INTERSPEECH 2020, 2020, : 155 - 159
  • [37] A statistical model with hierarchical structure for predicting prosody in a mandarin text-to-speech system
    Yu, MS
    Pan, NH
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2005, 28 (03) : 385 - 399
  • [38] CONFLICT BETWEEN PROSODY AND SYNTAX IN RESTRAINED SPEECH
    BUTTET, J
    ASSAL, G
    REVUE NEUROLOGIQUE, 1980, 136 (10) : 665 - 667
  • [39] FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
    Liu, Rui
    Xi, Jiatian
    Jiang, Ziyue
    Li, Haizhou
    INTERSPEECH 2024, 2024, : 3435 - 3439
  • [40] The relation between musical abilities and speech prosody perception: A meta-analysis
    Jansen, Nelleke
    Harding, Eleanor E.
    Loerts, Hanneke
    Baskent, Deniz
    Lowie, Wander
    JOURNAL OF PHONETICS, 2023, 101