共 50 条
- [31] Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4578 - 4587
- [32] Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition IEEE ACCESS, 2016, 4 : 4299 - 4309
- [33] APPLICATION OF NEURAL NETWORKS IN WHISPERED SPEECH RECOGNITION 2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 728 - 731
- [34] WORD TONE RECOGNITION IN VIETNAMESE WHISPERED SPEECH WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1961, 17 (01): : 11 - 15
- [35] GENERATING SYNTHETIC AUDIO DATA FOR ATTENTION-BASED SPEECH RECOGNITION SYSTEMS 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7069 - 7073
- [38] GENERATIVE ADVERSARIAL NETWORKS BASED DATA AUGMENTATION FOR NOISE ROBUST SPEECH RECOGNITION 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5044 - 5048
- [39] Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition INTERSPEECH 2024, 2024, : 1040 - 1044
- [40] Lattice-based Data Augmentation for Code-switching Speech Recognition PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1667 - 1672