共 50 条
- [1] Building large-vocabulary speaker-independent lipreading systems 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2648 - 2652
- [2] Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention 2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
- [3] A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2485 - 2494
- [4] Large-vocabulary Audio-visual Speech Recognition in Noisy Environments IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
- [5] Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 341 - 345
- [6] CROSS-ATTENTION WATERMARKING OF LARGE LANGUAGE MODELS 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4625 - 4629