共 50 条
- [42] CCC-WAV2VEC 2.0: CLUSTERING AIDED CROSS CONTRASTIVE SELF-SUPERVISED LEARNING OF SPEECH REPRESENTATIONS 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1 - 8
- [43] Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction INTERSPEECH 2022, 2022, : 4088 - 4092
- [44] Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings INTERSPEECH 2023, 2023, : 4134 - 4138
- [45] Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio-Visual Information IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 2 - 12
- [47] Automatic Classification of Parkinson's Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT II, 2024, 15049 : 313 - 323
- [48] Automatic detection of Parkinson's disease in running speech spoken in three different languages JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (01): : 481 - 500
- [49] Assessment of Non-Native Speech Intelligibility using Wav2vec2-based Mispronunciation Detection and Multi-level Goodness of Pronunciation Transformer INTERSPEECH 2023, 2023, : 984 - 988