共 50 条
- [41] Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10543 - 10553
- [44] Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition INTERSPEECH 2022, 2022, : 4740 - 4744
- [45] Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1336 - 1345
- [47] Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9723 - 9732
- [48] Audio-to-Image Cross-Modal Generation 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [49] Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals 2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 83 - 89