Hearer model based stress prediction for Chinese TTS system

被引:0
|
作者
Hu, GP [1 ]
Liu, QF [1 ]
Hu, Y [1 ]
Wang, RH [1 ]
机构
[1] Univ Sci & Technol China, Ifly Speech Lab, Hefei 230026, Peoples R China
关键词
hearer model; stress prediction; speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People often feel tired if he/she listens synthesized speech for a long time. This is mainly because synthesized speech is too flat and never stresses the focus. Different to traditional TTS research approach of simulating speaker, this paper does the stress prediction research from the point of the hearer. An ideal hearer model is first proposed to predict the stress distribution based on the hypothesis: people speak with limited stress effort and distribute the limited effort to ensure that the hearer can understand the speaker easily. Then according to the limited research resource, this paper modifies the ideal hearer model and presents a practical model. Experiments show that the stress prediction achieves an acceptable rate of 87.36%.
引用
收藏
页码:161 / 164
页数:4
相关论文
共 50 条
  • [1] Prediction of Prosodic Word Boundaries in Chinese TTS Based on Maximum Entropy Markov Model and Transformation Based Learning
    Zhao, Ziping
    Ma, Xirong
    PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 258 - 261
  • [2] A maximum entropy Markov model for prediction of prosodic phrase boundaries in Chinese TTS
    Zhao, Ziping
    Zhao, Tingjian
    Zhu, Yaoting
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 498 - 501
  • [3] The Power of Special Characters in ProsodicWord Prediction for Chinese TTS
    Zhang, Zhengchen
    Dong, Minghui
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 280 - 283
  • [4] Study on prediction of prosodic phrase boundaries in Chinese TTS
    Zhao, Ziping
    Zhu, Yaoting
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 354 - +
  • [5] Integrated approaches to prosodic word prediction for Chinese TTS
    Fu, GH
    Luke, KK
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 413 - 418
  • [6] The control of juncture and prosody in Chinese TTS system
    Chu, M
    Lu, SA
    Si, HY
    He, L
    Guan, DH
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 725 - 728
  • [7] Automatic Stress Annotation and Prediction for Expressive Mandarin TTS
    He, Wendi
    Lin, Yiting
    Ye, Jianhao
    Zhou, Hongbin
    Ren, Kaimeng
    He, Tianwei
    Tan, Pengfei
    Lu, Heng
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 306 - 317
  • [8] Fujisaki Model Based Intonation Modeling for Korean TTS System
    Kim, Byeongchang
    Lee, Jinsik
    Lee, Gary Geunbae
    UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, 2010, 75 : 103 - +
  • [9] Sinusoidal model parameterization for HMM-based TTS system
    Shechtman, Slava
    Sorin, Alex
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 805 - 808
  • [10] Active learning for prediction of prosodic word boundaries in Chinese TTS using Maximum Entropy Markov model
    Zhao, Ziping
    Ma, Xirong
    Journal of Software, 2013, 8 (12) : 3222 - 3228