Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans

被引：0

作者：

Fujii, Kazuki ^{[1
]}

Saito, Yuki ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo,133-8656, Japan

来源：

Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 | 2022年

关键词：

Engineering Village;

D O I：

2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

中图分类号：

学科分类号：

摘要：

Correct error - Embeddings - End to end - Errors correction - Human listeners - Human-in-the-loop - State of the art - Synthetic speech - Text to speech - Text-to-speech system

引用

页码：1702 / 1707

共 50 条

[21] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Kim, Jaehyeon
Kong, Jungil
Son, Juhee
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[22] END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE
Yasuda, Yusuke
Wang, Xin
Yamagishi, Junichi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5694 - 5698
[23] On the Training and Testing Data Preparation for End-to-End Text-to-Speech Application
Duc Chung Tran
Khan, M. K. A. Ahamed
Sridevi, S.
2020 11TH IEEE CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM (ICSGRC), 2020, : 73 - 75
[24] SEMI-SUPERVISED END-TO-END SPEECH RECOGNITION USING TEXT-TO-SPEECH AND AUTOENCODERS
Karita, Shigeki
Watanabe, Shinji
Iwata, Tomoharu
Delcroix, Marc
Ogawa, Atsunori
Nakatani, Tomohiro
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6166 - 6170
[25] End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
Mitsui, Kentaro
Zhao, Tianyu
Sawada, Kei
Hono, Yukiya
Nankaku, Yoshihiko
Tokuda, Keiichi
INTERSPEECH 2022, 2022, : 2328 - 2332
[26] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
Prakash, Anusha
Murthy, Hema A.
INTERSPEECH 2020, 2020, : 2962 - 2966
[27] Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech
Hong, Changi
Lee, Jung Hyuk
Jeon, Moongu
Kim, Hong Kook
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1060 - 1061
[28] Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech
Yoon, Hyungchan
Um, Seyun
Kim, Changhwan
Kang, Hong-Goo
INTERSPEECH 2023, 2023, : 3023 - 3027
[29] SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Cho, Hyunjae
Jung, Wonbin
Lee, Junhyeok
Woo, Sang Hoon
INTERSPEECH 2022, 2022, : 1 - 5
[30] Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech
Kakegawa, Naoto
Hara, Sunao
Abe, Masanobu
Ijima, Yusuke
INTERSPEECH 2021, 2021, : 126 - 130

← 1 2 3 4 5 →