Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引:0
|
作者
Abou-Zleikha, Mohamed [1 ]
Szekely, Eva [1 ]
Cahill, Peter [1 ]
Carson-Berndsen, Julie [1 ]
机构
[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland
关键词
speech prosody; duration generation; exemplar-based model;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.
引用
收藏
页码:59 / 62
页数:4
相关论文
共 50 条
  • [41] Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition
    1600, Institute of Electrical and Electronics Engineers Inc., United States (22):
  • [42] EXEMPLAR-BASED LARGE VOCABULARY SPEECH RECOGNITION USING K-NEAREST NEIGHBORS
    Xu, Yanbo
    Siohan, Olivier
    Simcha, David
    Kumar, Sanjiv
    Liao, Hank
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5167 - 5171
  • [43] Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
    Kallasjoki, Heikki
    Gemmeke, Jort F.
    Palomaki, Kalle J.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 368 - 380
  • [44] EXEMPLAR-BASED PITCH CONTOUR GENERATION USING DOP FOR SYNTACTIC TREE DECOMPOSITION
    Abou-Zleikha, Mohamed
    Cahill, Peter
    Carson-Berndsen, Julie
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4585 - 4588
  • [45] EXEMPLAR-BASED NOISE ROBUST AUTOMATIC SPEECH RECOGNITION USING MODULATION SPECTROGRAM FEATURES
    Baby, Deepak
    Virtanen, Tuomas
    Gemmeke, Jort F.
    Barker, Tom
    Van Hamme, Hugo
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 519 - 524
  • [46] Multi-class Semantic Video Segmentation with Exemplar-based Object Reasoning
    Liu, Buyu
    He, Xuming
    Gould, Stephen
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1014 - 1021
  • [47] An Exemplar-Based Multi-View Domain Generalization Framework for Visual Recognition
    Niu, Li
    Li, Wen
    Xu, Dong
    Cai, Jianfei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) : 259 - 272
  • [48] Vowel Discrimination by English, French and Turkish Speakers: Evidence for an Exemplar-Based Approach to Speech Perception
    Ettlinger, Marc
    Johnson, Keith
    PHONETICA, 2009, 66 (04) : 222 - 242
  • [49] Constructing multi-level speech database for spontaneous speech processing
    Hahn, M
    Kim, S
    Lee, JC
    Lee, YJ
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1930 - 1933
  • [50] INTEGRATING META-INFORMATION INTO EXEMPLAR-BASED SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
    Demuynck, Kris
    Seppi, Dino
    Van Compernolle, Dirk
    Patrick Nguyen
    Zweig, Geoffrey
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5048 - 5051