共 50 条
- [31] Deep End-to-End Representation Learning for Food Type Recognition from Speech ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 574 - 578
- [32] End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation INTERSPEECH 2022, 2022, : 3819 - 3823
- [33] Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition INTERSPEECH 2021, 2021, : 2886 - 2890
- [34] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework INTERSPEECH 2020, 2020, : 2962 - 2966
- [35] Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech 2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1060 - 1061
- [36] M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation INTERSPEECH 2022, 2022, : 111 - 115
- [37] Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech INTERSPEECH 2023, 2023, : 3023 - 3027
- [38] Efficient Adaptation of Spoken Language Understanding based on End-to-End Automatic Speech Recognition INTERSPEECH 2023, 2023, : 3959 - 3963
- [39] Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition INTERSPEECH 2023, 2023, : 1249 - 1253
- [40] End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders INTERSPEECH 2019, 2019, : 1606 - 1610