Speech morphing by gradually changing spectrum parameter and fundamental frequency

被引:0
|
作者
Abe, M
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new application of speech modification called ''speech morphing''. In image processing, morphing is a well known technique that gradually changes one person's face to that of someone else. Speech morphing produces duces similar results for speech; i.e., one person's speech is gradually changed to that of someone else, Speech morphing makes it possible to create movies or multi-media entertainment together with image morphing. The proposed algorithm pitch-synchronously modifies fundamental frequency (F-0) and DFT spectrum and outputs high quality speech. To clarify the balance of F-0 modification and spectrum modification, listening tests were carried out using 20 male speakers. The results yielded the relationship between the amount of modification and speaker identity. In terms of overall performance, listening tests show that the proposed algorithm successfully generates smooth, high quality voice changes.
引用
收藏
页码:2235 / 2238
页数:4
相关论文
共 50 条
  • [1] Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency
    Kameoka, Hirokazu
    Ono, Nobutaka
    Sagayama, Shigeki
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1507 - 1516
  • [2] Pre-processing of fundamental frequency contours of speech for automatic parameter extraction
    Fujisaki, H
    Narusawa, S
    Maruno, M
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 722 - 725
  • [3] Automatic parameter extraction of fundamental frequency contours of speech based on a generative model
    Fujisaki, H
    Ohno, S
    Tomita, O
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 729 - 732
  • [4] Extraction of fundamental frequency of speech based on exponentiated band-limited spectrum
    Takagi, H
    Shimamura, T
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4165 - 4165
  • [5] PERCEPTION OF CHANGING FUNDAMENTAL FREQUENCY
    LEHISTE, I
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 86 - &
  • [6] Speech Segregation in Active Middle Ear Stimulation: Masking Release With Changing Fundamental Frequency
    Auinger, Alice Barbara
    Liepins, Rudolfs
    Kaider, Alexandra
    Vyskocil, Erich
    Riss, Dominik
    Arnoldner, Christoph
    EAR AND HEARING, 2021, 42 (03): : 709 - 717
  • [7] Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency
    Hirose, Keikichi
    Ochi, Keiko
    Mihara, Ryusuke
    Hashimoto, Hiroya
    Saito, Daisuke
    Minematsu, Nobuaki
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2804 - +
  • [8] FUNDAMENTAL FREQUENCY OF VOICE IN CONTINUOUS SPEECH
    KITZING, P
    RUNDQVIST, HE
    FOLIA PHONIATRICA, 1976, 28 (4-5): : 253 - 253
  • [9] FUNDAMENTAL FREQUENCY IN SPEECH OF INFANTS AND CHILDREN
    KEATING, P
    BUHR, R
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (02): : 567 - 571
  • [10] Fundamental frequency variation in parkinsonian speech
    Visser, W.
    Schlegel, U.
    Skodda, S. K.
    MOVEMENT DISORDERS, 2007, 22 : S78 - S78