共 50 条
- [21] PROSODIC CLUSTERING FOR PHONEME-LEVEL PROSODY CONTROL IN END-TO-END SPEECH SYNTHESIS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5719 - 5723
- [22] Myanmar Text-to-Speech System based on Tacotron (End-to-End Generative Model) 11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 572 - 577
- [23] BI-LEVEL STYLE AND PROSODY DECOUPLING MODELING FOR PERSONALIZED END-TO-END SPEECH SYNTHESIS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6568 - 6572
- [24] IMPROVING PROSODY MODELLING WITH CROSS-UTTERANCE BERT EMBEDDINGS FOR END-TO-END SPEECH SYNTHESIS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6079 - 6083
- [27] TOWARDS END-TO-END UNSUPERVISED SPEECH RECOGNITION 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 221 - 228
- [28] Speaker Adaptation Experiments with Limited Data for End-to-End Text-To-Speech Synthesis using Tacotron2 INFOCOMMUNICATIONS JOURNAL, 2022, 14 (03): : 55 - 62
- [29] SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2829 - 2837
- [30] Towards End-to-End Speech-to-Text Summarization TEXT, SPEECH, AND DIALOGUE, TSD 2023, 2023, 14102 : 304 - 316