Generating fundamental frequency contours for speech synthesis in Yoruba

被引:0
|
作者
van Niekerk, Daniel R. [1 ]
Barnard, Etienne [2 ]
机构
[1] North West Univ, Ctr Text Technol, Potchefstroom, South Africa
[2] North West Univ, Multilingual Speech Technol, Vanderbijlpark, South Africa
关键词
speech synthesis; text-to-speech; fundamental frequency; tone language; under-resourced; Yoruba; HTS; TONE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present methods for modelling and synthesising fundamental frequency (F-0) contours suitable for application in text to-speech (TTS) synthesis of Yoffiba (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F-0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.
引用
收藏
页码:1026 / 1030
页数:5
相关论文
共 50 条
  • [21] REPRESENTING FUNDAMENTAL FREQUENCY CONTOURS GENERATED BY HMM-BASED SPEECH SYNTHESIS USING GENERATION PROCESS MODEL
    Hirose, Keikichi
    Matsuda, Tatsuya
    Hashimoto, Hiroya
    Minematsu, Nobuaki
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [22] A Targets-based Superpositional Model of Fundamental Frequency Contours Applied to HMM-based Speech Synthesis
    Ni, Jinfu
    Shiga, Yoshinori
    Hori, Chiori
    Kidawara, Yutaka
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1051 - 1055
  • [23] FRED - AN INTERACTIVE GRAPHICS PROGRAM TO MODIFY FUNDAMENTAL-FREQUENCY CONTOURS IN RESYNTHESIZED SPEECH
    SILVERMAN, KEA
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1986, 18 (04): : 395 - 397
  • [24] FUNDAMENTAL FREQUENCY CONTOURS AT SYNTACTIC BOUNDARIES
    COOPER, WE
    SORENSEN, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (03): : 683 - 692
  • [25] Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis
    Matsuda, Tetsuya
    Hirose, Keikichi
    Minematsu, Nobuaki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2012, 33 (04) : 221 - 228
  • [26] Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech
    Gu, Wentao
    Hirose, Keikichi
    Fujisaki, Hiroya
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 31 - +
  • [27] Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners
    Zhang, Linjun
    Li, Yu
    Wu, Han
    Li, Xin
    Shu, Hua
    Zhang, Yang
    Li, Ping
    FRONTIERS IN PSYCHOLOGY, 2016, 7
  • [28] On Fundamental Frequency Contour Synthesis and Control Method for Chinese Speech Synthesis
    Zhang Peng
    Wang Lihong
    Liu Sheng
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 5, 2008, : 739 - +
  • [29] Detection of syntactic boundaries by partial analysis-by-synthesis of fundamental frequency contours
    Hirose, K
    Sakurai, A
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 809 - 812
  • [30] Generative Modeling of Voice Fundamental Frequency Contours
    Kameoka, Hirokazu
    Yoshizato, Kota
    Ishihara, Tatsuma
    Kadowaki, Kento
    Ohishi, Yasunori
    Kashino, Kunio
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (06) : 1042 - 1053