共 50 条
- [1] Deep Learning-Based End-to-End Speaker Identification Using Time–Frequency Representation of Speech Signal Circuits, Systems, and Signal Processing, 2024, 43 : 1839 - 1861
- [2] End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain INTERSPEECH 2021, 2021, : 3046 - 3050
- [3] End-to-end speech recognition from raw speech: Multi time-frequency resolution CNN architecture for efficient representation learning 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 536 - 540
- [5] END-TO-END DEEP LEARNING-BASED ADAPTATION CONTROL FOR FREQUENCY-DOMAIN ADAPTIVE SYSTEM IDENTIFICATION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 766 - 770
- [7] End-To-End Speech Emotion Recognition Based on Time and Frequency Information Using Deep Neural Networks ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 969 - 975
- [8] End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning INTERSPEECH 2019, 2019, : 4425 - 4429
- [10] Deep End-to-End Representation Learning for Food Type Recognition from Speech ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 574 - 578