High-quality prosody generation in Mandarin text-to-speech system

被引:0
|
作者
Guo, Qing [1 ]
Zhang, Jie [2 ,3 ]
Katae, Nobuyuki [1 ]
Yu, Hao [1 ]
机构
[1] Fujitsu Research and Development Center Co., Ltd., China
[2] Fujitsu Laboratories Ltd., Japan
[3] Kyushu Institute of Design, Fukuoka, Japan
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:40 / 46
相关论文
共 50 条
  • [31] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
  • [33] An HMM-based Mandarin Chinese Text-to-Speech system
    Qian, Yao
    Soong, Frank
    Chen, Yining
    Chu, Min
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 223 - +
  • [34] Towards a multilingual prosody model for text-to-speech
    Jokisch, O
    Ding, HW
    Kruschke, H
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 421 - 424
  • [35] CAMNet: A controllable acoustic model for efficient, expressive, high-quality text-to-speech
    Alvarez, Jesus Monge
    Francois, Holly
    Sung, Hosang
    Choi, Seungdo
    Jeong, Jonghoon
    Choo, Kihyun
    Min, Kyoungbo
    Park, Sangjun
    APPLIED ACOUSTICS, 2022, 186
  • [36] SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network
    Wang, Kexin
    Zhang, Jiahong
    Ren, Yong
    Yao, Man
    Di Shang
    Xu, Bo
    Li, Guoqi
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7927 - 7940
  • [37] High-Quality Text-to-Speech Implementation via Active Shallow Diffusion Mechanism
    Deng, Junlin
    Hou, Ruihan
    Deng, Yan
    Long, Yongqiu
    Wu, Ning
    SENSORS, 2025, 25 (03)
  • [38] Towards including prosody in a text-to-speech system for modern standard Arabic
    Ramsay, Allan
    Mansour, Hanady
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (01): : 84 - 103
  • [39] Chinese Prosody Generation Based on C-ToBI Representation for Text-To-Speech
    Kim, Byeongchang
    ADVANCES IN COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2010, 6059 : 558 - 571
  • [40] Towards high-quality next-generation text-to-speech synthesis:: A multidomain approach by automatic domain classification
    Alias, Francesc
    Sevillano, Xavier
    Socoro, Joan Claudi
    Gonzalvo, Xavier
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (07): : 1340 - 1354