Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引:0
|
作者
Abou-Zleikha, Mohamed [1 ]
Szekely, Eva [1 ]
Cahill, Peter [1 ]
Carson-Berndsen, Julie [1 ]
机构
[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland
关键词
speech prosody; duration generation; exemplar-based model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.
引用
收藏
页码:59 / 62
页数:4
相关论文
共 50 条
  • [31] Exemplar-Based Texture Synthesis: the Efros-Leung Algorithm
    Aguerrebere, Cecilia
    Gousseau, Yann
    Tartavel, Guillaume
    IMAGE PROCESSING ON LINE, 2013, 3 : 223 - 241
  • [32] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Xie Sun
    Yunxin Zhao
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [33] Real-Time Exemplar-Based Face Sketch Synthesis
    Song, Yibing
    Bao, Linchao
    Yang, Qingxiong
    Yang, Ming-Hsuan
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 800 - 813
  • [34] Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech
    Reddy, Mittapalle Kiran
    Alku, Paavo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1386 - 1396
  • [35] Understanding and Improving the Exemplar-based Generation for Open-domain Conversation
    Han, Seungju
    Kim, Beomsu
    Seo, Seokjun
    Erdenee, Enkhbayar
    Chang, Buru
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 218 - 230
  • [36] Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition
    Dighe, Pranay
    Asaei, Afsaneh
    Bourlard, Herve
    SPEECH COMMUNICATION, 2016, 76 : 230 - 244
  • [37] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Sun, Xie
    Zhao, Yunxin
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [38] SUPERVISED SPEECH DEREVERBERATION IN NOISY ENVIRONMENTS USING EXEMPLAR-BASED SPARSE REPRESENTATIONS
    Baby, Deepak
    Van Hamme, Hugo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 156 - 160
  • [39] Development of Multi-Level Speech based Person Authentication System
    Rohan Kumar Das
    Sarfaraz Jelil
    S. R. Mahadeva Prasanna
    Journal of Signal Processing Systems, 2017, 88 : 259 - 271
  • [40] Development of Multi-Level Speech based Person Authentication System
    Das, Rohan Kumar
    Jelil, Sarfaraz
    Prasanna, S. R. Mahadeva
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (03): : 259 - 271