共 50 条
- [32] A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques EURASIP Journal on Advances in Signal Processing, 2007
- [33] EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1144 - 1154
- [35] Bio-Inspired Sparse Representation of Speech and Audio Using Psychoacoustic Adaptive Matching Pursuit SPEECH AND COMPUTER, 2016, 9811 : 156 - 164
- [38] Mi-Go: tool which uses YouTube as data source for evaluating general-purpose speech recognition machine learning models EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
- [40] Difusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 755 - 762