Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引:0
|
作者
Abou-Zleikha, Mohamed [1 ]
Szekely, Eva [1 ]
Cahill, Peter [1 ]
Carson-Berndsen, Julie [1 ]
机构
[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland
关键词
speech prosody; duration generation; exemplar-based model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.
引用
收藏
页码:59 / 62
页数:4
相关论文
共 50 条
  • [21] Fast Spatially Controllable Multi-dimensional Exemplar-Based Texture Synthesis and Morphing
    Manke, Felix
    Wunsche, Burkhard
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS: THEORY AND APPLICATIONS, 2010, 68 : 21 - 34
  • [22] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Hurmalainen, Antti
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
  • [23] FCN based preprocessing for exemplar-based face sketch synthesis
    Lu, Dan
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Zhang, Xuetao
    NEUROCOMPUTING, 2019, 365 : 113 - 124
  • [24] Expressive Multi-level Modeling for the Semantic Web
    Brasileiro, Freddy
    Almeida, Joao Paulo A.
    Carvalho, Victorio A.
    Guizzardi, Giancarlo
    SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 53 - 69
  • [25] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
    Gia-Nhu Nguyen
    Trung-Nghia Phung
    EURASIP Journal on Audio, Speech, and Music Processing, 2017
  • [26] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
    Gia-Nhu Nguyen
    Trung-Nghia Phung
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
  • [27] An Exemplar-based CRF for Multi-instance Object Segmentation
    He, Xuming
    Gould, Stephen
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 296 - 303
  • [28] Exemplar-based Pattern Synthesis with Implicit Periodic Field Network
    Chen, Haiwei
    Liu, Jiayi
    Chen, Weikai
    Liu, Shichen
    Zhao, Yajie
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3698 - 3707
  • [29] Multi-level Prosody and Spectrum Conversion for Emotional Speech Synthesis
    Wang, Zexun
    Yu, Yibiao
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 588 - 593
  • [30] Image restoration with morphological erosion and exemplar-based texture synthesis
    Guo, Hao
    An, Jubai
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,