Diction based prosody modeling in table-to-speech synthesis

被引:0
|
作者
Spiliotopoulos, D [1 ]
Xydas, G [1 ]
Kouroupetroglou, G [1 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, GR-10679 Athens, Greece
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2005年 / 3658卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis.
引用
收藏
页码:294 / 301
页数:8
相关论文
共 50 条
  • [31] Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis
    Zhang, Guangyan
    Qiu, Shirong
    Qin, Ying
    Lee, Tan
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [32] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 753 - 756
  • [33] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 40 - 43
  • [34] Articulatory-Functional Modeling of Speech Prosody: A Review
    Xu, Yi
    Prom-on, Santitham
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 46 - +
  • [35] An Innovative Prosody Modeling Method for Chinese Speech Recognition
    Gang Peng
    William S.-Y. Wang
    International Journal of Speech Technology, 2004, 7 (2-3) : 129 - 140
  • [36] Modeling prosody in a German concept-to-speech system
    Alter, K
    Matiasek, J
    Niklfeld, G
    NATURAL LANGUAGE PROCESSING AND SPEECH TECHNOLOGY: RESULTS OF THE 3RD KONVENS CONFERENCE, 1996, : 156 - 165
  • [37] Unsupervised joint prosody labeling and modeling for Mandarin speech
    Chiang, Chen-Yu
    Chen, Sin-Horng
    Yu, Hsiu-Min
    Wang, Yih-Ru
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (02): : 1164 - 1183
  • [38] Novel Eigenpitch-based Prosody Model for Text-to-Speech Synthesis
    Tian, Jilei
    Nurminen, Jani
    Kiss, Imre
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 313 - 316
  • [39] An optimized neural network based prosody model of Chinese speech synthesis system
    Tao, JH
    Cai, LH
    Tropf, H
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 477 - 480
  • [40] Mixing HMM-Based Spanish speech synthesis with a CBR for prosody estimation
    Gonzalvo, Xavi
    Iriondo, Ignasi
    Socoro, Joan Claudi
    Alias, Francesc
    Monzo, Carlos
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 78 - 85