共 50 条
- [1] CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5920 - 5932
- [3] Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control Circuits, Systems, and Signal Processing, 2023, 42 : 361 - 384
- [4] Information Sieve: Content Leakage Reduction in End-to-End Prosody Transfer for Expressive Speech Synthesis INTERSPEECH 2021, 2021, : 131 - 135
- [5] Deep End-to-End Representation Learning for Food Type Recognition from Speech ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 574 - 578
- [6] Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [7] End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation INTERSPEECH 2022, 2022, : 3819 - 3823
- [8] Speech Representation Learning for Emotion Recognition Using End-to-End ASR with Factorized Adaptation INTERSPEECH 2020, 2020, : 536 - 540
- [9] ROBUST AND FINE-GRAINED PROSODY CONTROL OF END-TO-END SPEECH SYNTHESIS 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5911 - 5915
- [10] Unsupervised Learning of Disentangled Speech Content and Style Representation INTERSPEECH 2021, 2021, : 4089 - 4093