Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引：0

作者：

Abou-Zleikha, Mohamed ^{[1
]}

Szekely, Eva ^{[1
]}

Cahill, Peter ^{[1
]}

Carson-Berndsen, Julie ^{[1
]}

机构：

[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland

来源：

PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II | 2012年

关键词：

speech prosody; duration generation; exemplar-based model;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.

引用

页码：59 / 62

页数：4

共 50 条

[21] Fast Spatially Controllable Multi-dimensional Exemplar-Based Texture Synthesis and Morphing
Manke, Felix
Wunsche, Burkhard
COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS: THEORY AND APPLICATIONS, 2010, 68 : 21 - 34
[22] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
Gemmeke, Jort F.
Virtanen, Tuomas
Hurmalainen, Antti
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
[23] FCN based preprocessing for exemplar-based face sketch synthesis
Lu, Dan
Chen, Zhenxue
Wu, Q. M. Jonathan
Zhang, Xuetao
NEUROCOMPUTING, 2019, 365 : 113 - 124
[24] Expressive Multi-level Modeling for the Semantic Web
Brasileiro, Freddy
Almeida, Joao Paulo A.
Carvalho, Victorio A.
Guizzardi, Giancarlo
SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 53 - 69
[25] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP Journal on Audio, Speech, and Music Processing, 2017
[26] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
[27] An Exemplar-based CRF for Multi-instance Object Segmentation
He, Xuming
Gould, Stephen
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 296 - 303
[28] Exemplar-based Pattern Synthesis with Implicit Periodic Field Network
Chen, Haiwei
Liu, Jiayi
Chen, Weikai
Liu, Shichen
Zhao, Yajie
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3698 - 3707
[29] Multi-level Prosody and Spectrum Conversion for Emotional Speech Synthesis
Wang, Zexun
Yu, Yibiao
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 588 - 593
[30] Image restoration with morphological erosion and exemplar-based texture synthesis
Guo, Hao
An, Jubai
2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,

← 1 2 3 4 5 →