Prediction of Prosodic Word Boundaries in Chinese TTS Based on Maximum Entropy Markov Model and Transformation Based Learning

被引:1
|
作者
Zhao, Ziping [1 ]
Ma, Xirong [1 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin, Peoples R China
关键词
Prosodic Word; Text-to-speech system(TTS); Transformation-based error-driven learning(TBL); Maximum Entropy Markov Model(MEMM);
D O I
10.1109/CIS.2012.64
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical prosody structure generation is a key component for a speech synthesis system. As the basic prosodic unit, the prosodic word plays an important role for the naturalness and the intelligibility for the Chinese TTS system. In this paper we proposed an approach for prediction of Chinese prosodic word boundaries in unrestricted Chinese text, which combines Maximum Entropy Markov Model(MEMM) and TBL model. First MEMM is trained to predict the prosodic word boundaries. After that we apply a TBL based error driven learning approach to amend the initial prediction. A comparison is conducted between the new model and HMM for prosodic word boundaries prediction. Experiments show that the combined approach improves overall performance. The precision and recall ratio are improved.
引用
收藏
页码:258 / 261
页数:4
相关论文
共 50 条
  • [21] Integration of named entity information for Chinese word segmentation based on Maximum Entropy
    Leong, Ka Seng
    Wong, Fai
    Li, Yiping
    Dong, Ming Chui
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 962 - +
  • [22] Chinese Prosodic Structure Prediction Based on a Pretrained Language Representation Model
    Zhang P.
    Lu C.
    Wang R.
    Zhang, Pengyuan (zhangpengyuan@hccl.ioa.ac.cn), 1600, Tianjin University (53): : 265 - 271
  • [23] Enhance the word vector with prosodic information for the recurrent neural network based TTS system
    Wang, Xin
    Takaki, Shinji
    Yamagishi, Junichi
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2856 - 2860
  • [24] Word Alignment Model Based on Maximum Entropy in Foreign Language Translation
    Jun, Chen
    2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 635 - 638
  • [25] Research on ore prospecting prediction based on maximum entropy model
    Zhen Chen
    Arabian Journal of Geosciences, 2024, 17 (1)
  • [26] Extrinsic Behavior Prediction of Pedestrians via Maximum Entropy Markov Model and Graph-Based Features Mining
    Ghadi, Yazeed Yasin
    Akhter, Israr
    Aljuaid, Hanan
    Gochoo, Munkhjargal
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [27] Deep Learning-Based Speaking Rate-Dependent Hierarchical Prosodic Model for Mandarin TTS
    Lin, Yen-Ting
    Chiang, Chen-Yu
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1278 - 1283
  • [28] LEARNING AND PREDICTION BASED ON A RELATIONAL HIDDEN MARKOV MODEL
    Elfers, Carsten
    Wagner, Thomas
    ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 211 - 216
  • [29] A seqlet-based maximum entropy Markov approach for protein secondary structure prediction
    Qiwen Dong
    Xiaolong Wang
    Lei Lin
    Yi Guan
    Science in China Series C: Life Sciences, 2005, 48 : 394 - 405