Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引：0

作者：

Abou-Zleikha, Mohamed ^{[1
]}

Szekely, Eva ^{[1
]}

Cahill, Peter ^{[1
]}

Carson-Berndsen, Julie ^{[1
]}

机构：

[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland

来源：

PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II | 2012年

关键词：

speech prosody; duration generation; exemplar-based model;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.

引用

页码：59 / 62

页数：4

共 50 条

[31] Exemplar-Based Texture Synthesis: the Efros-Leung Algorithm
Aguerrebere, Cecilia
Gousseau, Yann
Tartavel, Guillaume
IMAGE PROCESSING ON LINE, 2013, 3 : 223 - 241
[32] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
Xie Sun
Yunxin Zhao
EURASIP Journal on Audio, Speech, and Music Processing, 2014
[33] Real-Time Exemplar-Based Face Sketch Synthesis
Song, Yibing
Bao, Linchao
Yang, Qingxiong
Yang, Ming-Hsuan
COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 800 - 813
[34] Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech
Reddy, Mittapalle Kiran
Alku, Paavo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1386 - 1396
[35] Understanding and Improving the Exemplar-based Generation for Open-domain Conversation
Han, Seungju
Kim, Beomsu
Seo, Seokjun
Erdenee, Enkhbayar
Chang, Buru
PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 218 - 230
[36] Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition
Dighe, Pranay
Asaei, Afsaneh
Bourlard, Herve
SPEECH COMMUNICATION, 2016, 76 : 230 - 244
[37] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
Sun, Xie
Zhao, Yunxin
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[38] SUPERVISED SPEECH DEREVERBERATION IN NOISY ENVIRONMENTS USING EXEMPLAR-BASED SPARSE REPRESENTATIONS
Baby, Deepak
Van Hamme, Hugo
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 156 - 160
[39] Development of Multi-Level Speech based Person Authentication System
Rohan Kumar Das
Sarfaraz Jelil
S. R. Mahadeva Prasanna
Journal of Signal Processing Systems, 2017, 88 : 259 - 271
[40] Development of Multi-Level Speech based Person Authentication System
Das, Rohan Kumar
Jelil, Sarfaraz
Prasanna, S. R. Mahadeva
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (03): : 259 - 271

← 1 2 3 4 5 →