共 50 条
- [41] SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model INTERSPEECH 2021, 2021, : 3645 - 3649
- [42] Cross-lingual multi-speaker speech synthesis with limited bilingual training data COMPUTER SPEECH AND LANGUAGE, 2023, 77
- [43] NNSPEECH: SPEAKER-GUIDED CONDITIONAL VARIATIONAL AUTOENCODER FOR ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4293 - 4297
- [45] Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1708 - 1712
- [46] Incorporating Cross-speaker Style Transfer for Multi-language Text-to-Speech INTERSPEECH 2021, 2021, : 1619 - 1623
- [48] Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image INTERSPEECH 2020, 2020, : 1321 - 1325
- [49] Gated Recurrent Attention for Multi-Style Speech Synthesis APPLIED SCIENCES-BASEL, 2020, 10 (15):