Diction based prosody modeling in table-to-speech synthesis

被引:0
|
作者
Spiliotopoulos, D [1 ]
Xydas, G [1 ]
Kouroupetroglou, G [1 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, GR-10679 Athens, Greece
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis.
引用
收藏
页码:294 / 301
页数:8
相关论文
共 50 条
  • [1] Prosody analysis and modeling for emotional speech synthesis
    Jiang, DN
    Zhang, W
    Shen, LQ
    Cai, LH
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 281 - 284
  • [2] Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
    Peng, Yukun
    Ling, Zhenhua
    INTERSPEECH 2022, 2022, : 4257 - 4261
  • [3] HIERARCHICAL PROSODY MODELING FOR NON-AUTOREGRESSIVE SPEECH SYNTHESIS
    Chien, Chung-Ming
    Lee, Hung-yi
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 446 - 453
  • [4] MEASURING THE EFFECT OF LINGUISTIC RESOURCES ON PROSODY MODELING FOR SPEECH SYNTHESIS
    Rosenberg, Andrew
    Fernandez, Raul
    Ramabhadran, Bhuvana
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5114 - 5118
  • [5] Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks
    Reddy, V. Ramu
    Rao, K. Sreenivasa
    NEUROCOMPUTING, 2016, 171 : 1323 - 1334
  • [6] ACCENT GROUP MODELING FOR IMPROVED PROSODY IN STATISTICAL PARAMETERIC SPEECH SYNTHESIS
    Anumanchipalli, Gopala Krishna
    Oliveira, Luis C.
    Black, Alan W.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6890 - 6894
  • [7] Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features
    Tao, Jianhua
    Cai, Lianhong
    Shengxue Xuebao/Acta Acustica, 2003, 28 (05): : 395 - 402
  • [8] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
    Jia, Huibin
    Tao, Jianhua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
  • [9] Fluent speech prosody: Framework and modeling
    Tseng, CY
    Pin, SH
    Lee, Y
    Wang, HM
    Chen, YC
    SPEECH COMMUNICATION, 2005, 46 (3-4) : 284 - 309
  • [10] Fluent speech prosody: Framework and modeling
    Tseng, Chiu-Yu
    Pin, Shao-Huang
    Lee, Yehlin
    Wang, Hsin-Min
    Chen, Yong-Cheng
    Speech Commun, 3-4 (284-309):