Diction based prosody modeling in table-to-speech synthesis

被引：0

作者：

Spiliotopoulos, D ^{[1
]}

Xydas, G ^{[1
]}

Kouroupetroglou, G ^{[1
]}

机构：

[1] Univ Athens, Dept Informat & Telecommun, GR-10679 Athens, Greece

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2005年 / 3658卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis.

引用

页码：294 / 301

页数：8

共 50 条

[1] Prosody analysis and modeling for emotional speech synthesis
Jiang, DN
Zhang, W
Shen, LQ
Cai, LH
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 281 - 284
[2] Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Peng, Yukun
Ling, Zhenhua
INTERSPEECH 2022, 2022, : 4257 - 4261
[3] HIERARCHICAL PROSODY MODELING FOR NON-AUTOREGRESSIVE SPEECH SYNTHESIS
Chien, Chung-Ming
Lee, Hung-yi
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 446 - 453
[4] MEASURING THE EFFECT OF LINGUISTIC RESOURCES ON PROSODY MODELING FOR SPEECH SYNTHESIS
Rosenberg, Andrew
Fernandez, Raul
Ramabhadran, Bhuvana
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5114 - 5118
[5] Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks
Reddy, V. Ramu
Rao, K. Sreenivasa
NEUROCOMPUTING, 2016, 171 : 1323 - 1334
[6] ACCENT GROUP MODELING FOR IMPROVED PROSODY IN STATISTICAL PARAMETERIC SPEECH SYNTHESIS
Anumanchipalli, Gopala Krishna
Oliveira, Luis C.
Black, Alan W.
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6890 - 6894
[7] Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features
Tao, Jianhua
Cai, Lianhong
Shengxue Xuebao/Acta Acustica, 2003, 28 (05): : 395 - 402
[8] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
Jia, Huibin
Tao, Jianhua
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
[9] Fluent speech prosody: Framework and modeling
Tseng, CY
Pin, SH
Lee, Y
Wang, HM
Chen, YC
SPEECH COMMUNICATION, 2005, 46 (3-4) : 284 - 309
[10] Fluent speech prosody: Framework and modeling
Tseng, Chiu-Yu
Pin, Shao-Huang
Lee, Yehlin
Wang, Hsin-Min
Chen, Yong-Cheng
Speech Commun, 3-4 (284-309):

← 1 2 3 4 5 →