Diction based prosody modeling in table-to-speech synthesis

被引：0

作者：

Spiliotopoulos, D ^{[1
]}

Xydas, G ^{[1
]}

Kouroupetroglou, G ^{[1
]}

机构：

[1] Univ Athens, Dept Informat & Telecommun, GR-10679 Athens, Greece

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2005年 / 3658卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis.

引用

页码：294 / 301

页数：8

共 50 条

[31] Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis
Zhang, Guangyan
Qiu, Shirong
Qin, Ying
Lee, Tan
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[32] Modeling prosody for language identification on read and spontaneous speech
Rouas, JL
Farinas, J
Pellegrino, F
André-Obrecht, R
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 753 - 756
[33] Modeling prosody for language identification on read and spontaneous speech
Rouas, JL
Farinas, J
Pellegrino, F
André-Obrecht, R
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 40 - 43
[34] Articulatory-Functional Modeling of Speech Prosody: A Review
Xu, Yi
Prom-on, Santitham
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 46 - +
[35] An Innovative Prosody Modeling Method for Chinese Speech Recognition
Gang Peng
William S.-Y. Wang
International Journal of Speech Technology, 2004, 7 (2-3) : 129 - 140
[36] Modeling prosody in a German concept-to-speech system
Alter, K
Matiasek, J
Niklfeld, G
NATURAL LANGUAGE PROCESSING AND SPEECH TECHNOLOGY: RESULTS OF THE 3RD KONVENS CONFERENCE, 1996, : 156 - 165
[37] Unsupervised joint prosody labeling and modeling for Mandarin speech
Chiang, Chen-Yu
Chen, Sin-Horng
Yu, Hsiu-Min
Wang, Yih-Ru
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (02): : 1164 - 1183
[38] Novel Eigenpitch-based Prosody Model for Text-to-Speech Synthesis
Tian, Jilei
Nurminen, Jani
Kiss, Imre
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 313 - 316
[39] An optimized neural network based prosody model of Chinese speech synthesis system
Tao, JH
Cai, LH
Tropf, H
2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 477 - 480
[40] Mixing HMM-Based Spanish speech synthesis with a CBR for prosody estimation
Gonzalvo, Xavi
Iriondo, Ignasi
Socoro, Joan Claudi
Alias, Francesc
Monzo, Carlos
ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 78 - 85

← 1 2 3 4 5 →