Speech morphing by gradually changing spectrum parameter and fundamental frequency

被引：0

作者：

Abe, M

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a new application of speech modification called ''speech morphing''. In image processing, morphing is a well known technique that gradually changes one person's face to that of someone else. Speech morphing produces duces similar results for speech; i.e., one person's speech is gradually changed to that of someone else, Speech morphing makes it possible to create movies or multi-media entertainment together with image morphing. The proposed algorithm pitch-synchronously modifies fundamental frequency (F-0) and DFT spectrum and outputs high quality speech. To clarify the balance of F-0 modification and spectrum modification, listening tests were carried out using 20 male speakers. The results yielded the relationship between the amount of modification and speaker identity. In terms of overall performance, listening tests show that the proposed algorithm successfully generates smooth, high quality voice changes.

引用

页码：2235 / 2238

页数：4

共 50 条

[1] Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency
Kameoka, Hirokazu
Ono, Nobutaka
Sagayama, Shigeki
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1507 - 1516
[2] Pre-processing of fundamental frequency contours of speech for automatic parameter extraction
Fujisaki, H
Narusawa, S
Maruno, M
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 722 - 725
[3] Automatic parameter extraction of fundamental frequency contours of speech based on a generative model
Fujisaki, H
Ohno, S
Tomita, O
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 729 - 732
[4] Extraction of fundamental frequency of speech based on exponentiated band-limited spectrum
Takagi, H
Shimamura, T
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4165 - 4165
[5] PERCEPTION OF CHANGING FUNDAMENTAL FREQUENCY
LEHISTE, I
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 86 - &
[6] Speech Segregation in Active Middle Ear Stimulation: Masking Release With Changing Fundamental Frequency
Auinger, Alice Barbara
Liepins, Rudolfs
Kaider, Alexandra
Vyskocil, Erich
Riss, Dominik
Arnoldner, Christoph
EAR AND HEARING, 2021, 42 (03): : 709 - 717
[7] Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency
Hirose, Keikichi
Ochi, Keiko
Mihara, Ryusuke
Hashimoto, Hiroya
Saito, Daisuke
Minematsu, Nobuaki
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2804 - +
[8] FUNDAMENTAL FREQUENCY OF VOICE IN CONTINUOUS SPEECH
KITZING, P
RUNDQVIST, HE
FOLIA PHONIATRICA, 1976, 28 (4-5): : 253 - 253
[9] FUNDAMENTAL FREQUENCY IN SPEECH OF INFANTS AND CHILDREN
KEATING, P
BUHR, R
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (02): : 567 - 571
[10] Fundamental frequency variation in parkinsonian speech
Visser, W.
Schlegel, U.
Skodda, S. K.
MOVEMENT DISORDERS, 2007, 22 : S78 - S78

← 1 2 3 4 5 →