Generating fundamental frequency contours for speech synthesis in Yoruba

被引：0

作者：

van Niekerk, Daniel R. ^{[1
]}

Barnard, Etienne ^{[2
]}

机构：

[1] North West Univ, Ctr Text Technol, Potchefstroom, South Africa

[2] North West Univ, Multilingual Speech Technol, Vanderbijlpark, South Africa

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

speech synthesis; text-to-speech; fundamental frequency; tone language; under-resourced; Yoruba; HTS; TONE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present methods for modelling and synthesising fundamental frequency (F-0) contours suitable for application in text to-speech (TTS) synthesis of Yoffiba (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F-0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.

引用

页码：1026 / 1030

页数：5

共 50 条

[41] Analysis and synthesis of fundamental frequency contours of Standard Chinese using the command-response model
Fujisaki, H
Wang, CF
Ohno, S
Gu, WT
SPEECH COMMUNICATION, 2005, 47 (1-2) : 59 - 70
[42] MEASURING OF THE CONTOURS OF INTENSITY AND FUNDAMENTAL PERIOD OF SPEECH FOR AUTOMATIC SPEAKER RECOGNITION
NEY, H
FREQUENZ, 1981, 35 (10) : 265 - 270
[43] FUNDAMENTAL FREQUENCY OF VOICE IN CONTINUOUS SPEECH
KITZING, P
RUNDQVIST, HE
FOLIA PHONIATRICA, 1976, 28 (4-5): : 253 - 253
[44] FUNDAMENTAL FREQUENCY IN SPEECH OF INFANTS AND CHILDREN
KEATING, P
BUHR, R
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (02): : 567 - 571
[45] Fundamental frequency variation in parkinsonian speech
Visser, W.
Schlegel, U.
Skodda, S. K.
MOVEMENT DISORDERS, 2007, 22 : S78 - S78
[46] PERCEPTION OF FUNDAMENTAL FREQUENCY OF CONTINUOUS SPEECH
BRANDT, JF
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02): : 437 - 437
[47] A preliminary study on the modeling of fundamental frequency contours of Thai utterances
Fujisaki, H
Ohno, S
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 516 - 519
[48] An Optimization of Fundamental Frequency and Length of Syllables for Rule-Based Speech Synthesis
Win, Kyawt Yin
Takara, Tomio
FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 114 - 124
[49] CHANGES IN STUTTERERS FUNDAMENTAL-FREQUENCY CONTOURS FOLLOWING THERAPY
SACCO, PR
METZ, DE
JOURNAL OF FLUENCY DISORDERS, 1987, 12 (01) : 1 - 8
[50] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
Dagba, Theophile K.
Aoga, John O. R.
Fanou, Codjo C.
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169

← 1 2 3 4 5 →