HMM-based singing voice synthesis system using pitch-shifted pseudo training data

被引:0
|
作者
Mase, Ayami [1 ]
Oura, Keiichiro [1 ]
Nankaku, Yoshihiko [1 ]
Tokuda, Keiichi [1 ]
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi, Japan
关键词
singing voice synthesis; HMM-based speech synthesis; pitch-shift;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last few years, a statistical parametric approach to singing voice synthesis based on hidden Markov models (HMMs) has been grown over. In this approach, spectrum, excitation, and duration of singing voices are simultaneously modeled by context-dependent HMMs, and waveforms are generated from HMMs themselves. However, pitches that rarely appear in training data cannot be properly generated because the system cannot model their fundamental frequency (F-0) contours. In this paper, we propose a technique for training HMMs using pitch-shifted pseudo data. Subjective listening test results show that the proposed technique improves the naturalness of the synthesized singing voices.
引用
收藏
页码:845 / 848
页数:4
相关论文
共 50 条
  • [1] PITCH ADAPTIVE TRAINING FOR HMM-BASED SINGING VOICE SYNTHESIS
    Oura, Keiichiro
    Mase, Ayami
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5377 - 5380
  • [2] INTEGRATION OF SPEAKER AND PITCH ADAPTIVE TRAINING FOR HMM-BASED SINGING VOICE SYNTHESIS
    Shirota, Kanako
    Nakamura, Kazuhiro
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] An HMM-based Singing Voice Synthesis System
    Saino, Keijiro
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2274 - 2277
  • [4] A HMM-based Mandarin Chinese Singing Voice Synthesis System
    Xian Li
    Zengfu Wang
    IEEE/CAA Journal of Automatica Sinica, 2016, 3 (02) : 192 - 202
  • [5] A HMM-based Mandarin Chinese Singing Voice Synthesis System
    Li, Xian
    Wang, Zengfu
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2016, 3 (02) : 192 - 202
  • [6] HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling
    Nose, Takashi
    Kanemoto, Misa
    Koriyama, Tomoki
    Kobayashi, Takao
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 308 - 322
  • [7] HMM-BASED SINGING VOICE SYNTHESIS AND ITS APPLICATION TO JAPANESE AND ENGLISH
    Nakamura, Kazuhiro
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] A FULL TRAINING FRAMEWORK OF CROSS-STREAM DEPENDENCE MODELLING FOR HMM-BASED SINGING VOICE SYNTHESIS
    Wang, Xin
    Dong, Minghui
    Ling, Zhen-Hua
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5165 - 5169
  • [9] Revealing the processing history of pitch-shifted voice using CNNs
    Wang, Lihua
    Liang, Huixin
    Lin, Xiaodan
    Kang, Xiangui
    2018 10TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2018,
  • [10] Factored Maximum Likelihood Kernelized Regression for HMM-based Singing Voice Synthesis
    Sung, June Sig
    Hong, Doo Hwa
    Koo, Hyun Woo
    Kim, Nam Soo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 359 - 363