共 50 条
- [42] Streamable Speech Representation Disentanglement and Multi-Level Prosody Modeling for Live One-Shot Voice Conversion INTERSPEECH 2022, 2022, : 2578 - 2582
- [43] Two-stage and Self-supervised Voice Conversion for Zero-Shot Dysarthric Speech Reconstruction 2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 423 - 427
- [44] WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
- [46] Comparison of Multi-Scale Speaker Vectors and S-Vectors for Zero-Shot Speech Synthesis 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 247 - 248
- [48] SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model INTERSPEECH 2021, 2021, : 3645 - 3649
- [49] Multi-level Fusion of Multi-modal Semantic Embeddings for Zero Shot Learning PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 310 - 318