Hearer model based stress prediction for Chinese TTS system

被引：0

作者：

Hu, GP ^{[1
]}

Liu, QF ^{[1
]}

Hu, Y ^{[1
]}

Wang, RH ^{[1
]}

机构：

[1] Univ Sci & Technol China, Ifly Speech Lab, Hefei 230026, Peoples R China

来源：

2004 International Symposium on Chinese Spoken Language Processing, Proceedings | 2004年

关键词：

hearer model; stress prediction; speech synthesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

People often feel tired if he/she listens synthesized speech for a long time. This is mainly because synthesized speech is too flat and never stresses the focus. Different to traditional TTS research approach of simulating speaker, this paper does the stress prediction research from the point of the hearer. An ideal hearer model is first proposed to predict the stress distribution based on the hypothesis: people speak with limited stress effort and distribute the limited effort to ensure that the hearer can understand the speaker easily. Then according to the limited research resource, this paper modifies the ideal hearer model and presents a practical model. Experiments show that the stress prediction achieves an acceptable rate of 87.36%.

引用

页码：161 / 164

页数：4

共 50 条

[1] Prediction of Prosodic Word Boundaries in Chinese TTS Based on Maximum Entropy Markov Model and Transformation Based Learning
Zhao, Ziping
Ma, Xirong
PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 258 - 261
[2] A maximum entropy Markov model for prediction of prosodic phrase boundaries in Chinese TTS
Zhao, Ziping
Zhao, Tingjian
Zhu, Yaoting
GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 498 - 501
[3] The Power of Special Characters in ProsodicWord Prediction for Chinese TTS
Zhang, Zhengchen
Dong, Minghui
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 280 - 283
[4] Study on prediction of prosodic phrase boundaries in Chinese TTS
Zhao, Ziping
Zhu, Yaoting
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 354 - +
[5] Integrated approaches to prosodic word prediction for Chinese TTS
Fu, GH
Luke, KK
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 413 - 418
[6] The control of juncture and prosody in Chinese TTS system
Chu, M
Lu, SA
Si, HY
He, L
Guan, DH
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 725 - 728
[7] Automatic Stress Annotation and Prediction for Expressive Mandarin TTS
He, Wendi
Lin, Yiting
Ye, Jianhao
Zhou, Hongbin
Ren, Kaimeng
He, Tianwei
Tan, Pengfei
Lu, Heng
MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 306 - 317
[8] Fujisaki Model Based Intonation Modeling for Korean TTS System
Kim, Byeongchang
Lee, Jinsik
Lee, Gary Geunbae
UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, 2010, 75 : 103 - +
[9] Sinusoidal model parameterization for HMM-based TTS system
Shechtman, Slava
Sorin, Alex
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 805 - 808
[10] Active learning for prediction of prosodic word boundaries in Chinese TTS using Maximum Entropy Markov model
Zhao, Ziping
Ma, Xirong
Journal of Software, 2013, 8 (12) : 3222 - 3228

← 1 2 3 4 5 →