Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans

被引:0
|
作者
Fujii, Kazuki [1 ]
Saito, Yuki [1 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo,133-8656, Japan
关键词
Engineering Village;
D O I
2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022
中图分类号
学科分类号
摘要
Correct error - Embeddings - End to end - Errors correction - Human listeners - Human-in-the-loop - State of the art - Synthetic speech - Text to speech - Text-to-speech system
引用
收藏
页码:1702 / 1707
相关论文
共 50 条
  • [21] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
    Kim, Jaehyeon
    Kong, Jungil
    Son, Juhee
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [22] END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE
    Yasuda, Yusuke
    Wang, Xin
    Yamagishi, Junichi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5694 - 5698
  • [23] On the Training and Testing Data Preparation for End-to-End Text-to-Speech Application
    Duc Chung Tran
    Khan, M. K. A. Ahamed
    Sridevi, S.
    2020 11TH IEEE CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM (ICSGRC), 2020, : 73 - 75
  • [24] SEMI-SUPERVISED END-TO-END SPEECH RECOGNITION USING TEXT-TO-SPEECH AND AUTOENCODERS
    Karita, Shigeki
    Watanabe, Shinji
    Iwata, Tomoharu
    Delcroix, Marc
    Ogawa, Atsunori
    Nakatani, Tomohiro
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6166 - 6170
  • [25] End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
    Mitsui, Kentaro
    Zhao, Tianyu
    Sawada, Kei
    Hono, Yukiya
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    INTERSPEECH 2022, 2022, : 2328 - 2332
  • [26] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
    Prakash, Anusha
    Murthy, Hema A.
    INTERSPEECH 2020, 2020, : 2962 - 2966
  • [27] Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech
    Hong, Changi
    Lee, Jung Hyuk
    Jeon, Moongu
    Kim, Hong Kook
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1060 - 1061
  • [28] Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech
    Yoon, Hyungchan
    Um, Seyun
    Kim, Changhwan
    Kang, Hong-Goo
    INTERSPEECH 2023, 2023, : 3023 - 3027
  • [29] SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
    Cho, Hyunjae
    Jung, Wonbin
    Lee, Junhyeok
    Woo, Sang Hoon
    INTERSPEECH 2022, 2022, : 1 - 5
  • [30] Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech
    Kakegawa, Naoto
    Hara, Sunao
    Abe, Masanobu
    Ijima, Yusuke
    INTERSPEECH 2021, 2021, : 126 - 130