Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis

被引：0

作者：

Abou-Zleikha, Mohamed ^{[1
]}

Szekely, Eva ^{[1
]}

Cahill, Peter ^{[1
]}

Carson-Berndsen, Julie ^{[1
]}

机构：

[1] Univ Coll Dublin, Sch Informat & Comp Sci, CNGL, Dublin 2, Ireland

来源：

PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II | 2012年

关键词：

speech prosody; duration generation; exemplar-based model;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The generation of duration of speech units from linguistic information, as one component of a prosody model, is considered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the production of duration. The implementation of this model for duration generation for speech synthesis is not straight forward and requires a set of modifications to the model and that the linguistically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full context information of the input example and corpus exemplars.

引用

页码：59 / 62

页数：4

共 50 条

[41] Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition
1600, Institute of Electrical and Electronics Engineers Inc., United States (22):
[42] EXEMPLAR-BASED LARGE VOCABULARY SPEECH RECOGNITION USING K-NEAREST NEIGHBORS
Xu, Yanbo
Siohan, Olivier
Simcha, David
Kumar, Sanjiv
Liao, Hank
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5167 - 5171
[43] Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
Kallasjoki, Heikki
Gemmeke, Jort F.
Palomaki, Kalle J.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 368 - 380
[44] EXEMPLAR-BASED PITCH CONTOUR GENERATION USING DOP FOR SYNTACTIC TREE DECOMPOSITION
Abou-Zleikha, Mohamed
Cahill, Peter
Carson-Berndsen, Julie
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4585 - 4588
[45] EXEMPLAR-BASED NOISE ROBUST AUTOMATIC SPEECH RECOGNITION USING MODULATION SPECTROGRAM FEATURES
Baby, Deepak
Virtanen, Tuomas
Gemmeke, Jort F.
Barker, Tom
Van Hamme, Hugo
2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 519 - 524
[46] Multi-class Semantic Video Segmentation with Exemplar-based Object Reasoning
Liu, Buyu
He, Xuming
Gould, Stephen
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1014 - 1021
[47] An Exemplar-Based Multi-View Domain Generalization Framework for Visual Recognition
Niu, Li
Li, Wen
Xu, Dong
Cai, Jianfei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) : 259 - 272
[48] Vowel Discrimination by English, French and Turkish Speakers: Evidence for an Exemplar-Based Approach to Speech Perception
Ettlinger, Marc
Johnson, Keith
PHONETICA, 2009, 66 (04) : 222 - 242
[49] Constructing multi-level speech database for spontaneous speech processing
Hahn, M
Kim, S
Lee, JC
Lee, YJ
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1930 - 1933
[50] INTEGRATING META-INFORMATION INTO EXEMPLAR-BASED SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
Demuynck, Kris
Seppi, Dino
Van Compernolle, Dirk
Patrick Nguyen
Zweig, Geoffrey
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5048 - 5051

← 1 2 3 4 5 →