Generating fundamental frequency contours for speech synthesis in Yoruba

被引:0
|
作者
van Niekerk, Daniel R. [1 ]
Barnard, Etienne [2 ]
机构
[1] North West Univ, Ctr Text Technol, Potchefstroom, South Africa
[2] North West Univ, Multilingual Speech Technol, Vanderbijlpark, South Africa
关键词
speech synthesis; text-to-speech; fundamental frequency; tone language; under-resourced; Yoruba; HTS; TONE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present methods for modelling and synthesising fundamental frequency (F-0) contours suitable for application in text to-speech (TTS) synthesis of Yoffiba (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F-0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.
引用
收藏
页码:1026 / 1030
页数:5
相关论文
共 50 条
  • [31] EFFECT ON FUNDAMENTAL FREQUENCY CONTOURS OF MODALITY OPERATORS
    ALLEN, J
    OSHAUGHNESSY, D
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S2 - S2
  • [32] Synthesis of fundamental frequency contours for standard chinese based on superpositional and tone nucleus models
    Hirose, Keikichi
    Sun, Qinghua
    Minematsu, Nobuaki
    ARCHIVES OF ACOUSTICS, 2007, 32 (01) : 41 - 50
  • [33] A transversal study of fundamental frequency contours in parkinsonian voices
    Rodriguez-Perez, Pablo
    Fraile, Ruben
    Garcia-Escrig, Miguel
    Saenz-Lechon, Nicolas
    Gutierrez-Arriola, Juana M.
    Osma-Ruiz, Victor
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 51 : 374 - 381
  • [34] QUANTITATIVE DESCRIPTION AND DIFFERENTIATION OF FUNDAMENTAL-FREQUENCY CONTOURS
    MOORE, CA
    COHN, JF
    KATZ, GS
    COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04): : 385 - 404
  • [35] PERCEPTUAL CHARACTERIZATION OF FUNDAMENTAL FREQUENCY CONTOURS OF DISYLLABIC WORDS
    CAMPBELL, HW
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S57 - S57
  • [36] DIFFERENCE LIMENS FOR FUNDAMENTAL-FREQUENCY CONTOURS IN SENTENCES
    HARRIS, MS
    UMEDA, N
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 (04): : 1139 - 1145
  • [37] MUSICAL INTERVALS IN FUNDAMENTAL FREQUENCY CONTOURS IN ENGLISH INTONATION
    LEVINE, A
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S20 - S20
  • [38] Use of poisson processes to generate fundamental frequency contours
    Ni, Jinfu
    Nakamura, Satoshi
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3, 2007, : 825 - 828
  • [39] FUNDAMENTAL-FREQUENCY CONTOURS OF AUXILIARY PHRASES IN ENGLISH
    ALLEN, J
    OSHAUGHN.D
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S32 - S32
  • [40] Fundamental frequency modeling for speech synthesis based on a statistical learning technique
    Sakai, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 489 - 495