Hearer model based stress prediction for Chinese TTS system

被引:0
|
作者
Hu, GP [1 ]
Liu, QF [1 ]
Hu, Y [1 ]
Wang, RH [1 ]
机构
[1] Univ Sci & Technol China, Ifly Speech Lab, Hefei 230026, Peoples R China
关键词
hearer model; stress prediction; speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People often feel tired if he/she listens synthesized speech for a long time. This is mainly because synthesized speech is too flat and never stresses the focus. Different to traditional TTS research approach of simulating speaker, this paper does the stress prediction research from the point of the hearer. An ideal hearer model is first proposed to predict the stress distribution based on the hypothesis: people speak with limited stress effort and distribute the limited effort to ensure that the hearer can understand the speaker easily. Then according to the limited research resource, this paper modifies the ideal hearer model and presents a practical model. Experiments show that the stress prediction achieves an acceptable rate of 87.36%.
引用
收藏
页码:161 / 164
页数:4
相关论文
共 50 条
  • [31] Constructing Scalable TTS System based on Corpus Approach
    Zhang Wei
    Ling Zheng-hua
    Dai Li-rong
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 853 - +
  • [32] TTS System for Coal Miners in MANET Based Disaster Management System
    Ambika, N.
    Murali, Rama G. K.
    Sheka, Smitha
    Kumar, Narendra G.
    IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 1 - 7
  • [33] A novel grey prediction model with system structure based on energy background: A case study of Chinese electricity
    Duan, Huiming
    Pang, Xinyu
    JOURNAL OF CLEANER PRODUCTION, 2023, 390
  • [34] Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination
    Chen, Xiao
    Ke, Dengfeng
    Xu, Bo
    PRACTICAL APPLICATIONS OF INTELLIGENT SYSTEMS, ISKE 2013, 2014, 279 : 1069 - 1078
  • [35] Prediction and Analysis for Chinese Machinery Manufacturing System based on the ARMA
    Hu, Lei
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 3396 - 3400
  • [36] PL-TTS: A Generalizable Prompt-based Diffusion TTS Augmented by Large Language Model
    Li, Shuhua
    Mao, Qirong
    Shi, Jiatong
    INTERSPEECH 2024, 2024, : 4888 - 4892
  • [37] Interdisciplinary Innovation of EDI Module Case Using 'Design for X' and 'Prediction of X' Knowledge and Methods System Based on TTS
    Kopecky, M.
    Hosnedl, S.
    Dvorak, J.
    Janik, L.
    MODERN METHODS OF CONSTRUCTION DESIGN, 2014, : 451 - 457
  • [38] Semi Supervised Learning for Prediction of Prosodic Phrase Boundaries in Chinese TTS Using Conditional Random Fields
    Zhao, Ziping
    Ma, Xirong
    Pei, Weidong
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 477 - 485
  • [39] Curriculum Learning Based Approach for Faster Convergence of TTS Model
    Kaur, Navneet
    Ghosh, Prasanta Kumar
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 208 - 221
  • [40] A Rule Based Schwa Deletion Algorithm for Punjabi TTS System
    Singh, Parminder
    Lehal, Gurpreet Singh
    INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 98 - +