Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans

被引：0

作者：

Fujii, Kazuki ^{[1
]}

Saito, Yuki ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo,133-8656, Japan

来源：

Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 | 2022年

关键词：

Engineering Village;

D O I：

2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

中图分类号：

学科分类号：

摘要：

Correct error - Embeddings - End to end - Errors correction - Human listeners - Human-in-the-loop - State of the art - Synthetic speech - Text to speech - Text-to-speech system

引用

页码：1702 / 1707

共 50 条

[31] End-to-End Speech Synthesis for Bangla with Text Normalization
Pial, Tanzir Islam
Aunti, Shahreen Salim
Ahmed, Shabbir
Heickal, Hasnain
2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 66 - 71
[32] End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders
Masumura, Ryo
Sato, Hiroshi
Tanaka, Tomohiro
Moriya, Takafumi
Ijima, Yusuke
Oba, Takanobu
INTERSPEECH 2019, 2019, : 1606 - 1610
[33] A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning
Oyucu, Saadin
ELECTRONICS, 2023, 12 (08)
[34] Reinforce-Aligner: Reinforcement Alignment Search for Robust End-to-End Text-to-Speech
Chung, Hyunseung
Lee, Sang-Hoon
Lee, Seong-Whan
INTERSPEECH 2021, 2021, : 3635 - 3639
[35] PREDICTING EXPRESSIVE SPEAKING STYLE FROM TEXT IN END-TO-END SPEECH SYNTHESIS
Stanton, Daisy
Wang, Yuxuan
Skerry-Ryan, R. J.
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 595 - 602
[36] Speaker Adaptation Experiments with Limited Data for End-to-End Text-To-Speech Synthesis using Tacotron2
Mandeel, Ali Raheem
Al-Radhi, Mohammed Salah
Csapo, Tamas Gabor
INFOCOMMUNICATIONS JOURNAL, 2022, 14 (03): : 55 - 62
[37] Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture
Fady K. Fahmy
Hazem M. Abbas
Mahmoud I. Khalil
International Journal of Speech Technology, 2022, 25 : 79 - 88
[38] You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation
Laptev, Aleksandr
Korostik, Roman
Svischev, Aleksey
Andrusenko, Andrei
Medennikov, Ivan
Rybin, Sergey
2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 439 - 444
[39] Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture
Fahmy, Fady K.
Abbas, Hazem M.
Khalil, Mahmoud, I
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 79 - 88
[40] BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in A Text-to-Speech Front-End
Zheng, Yibin
Tao, Jianhua
Wen, Zhengqi
Li, Ya
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 47 - 51

← 1 2 3 4 5 →