共 50 条
- [21] Learning Multi-Modal Word Representation Grounded in Visual Context THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5626 - 5633
- [23] Multi-Modal Representation Learning with Text-Driven Soft Masks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2798 - 2807
- [25] MMEarth: Exploring Multi-modal Pretext Tasks for Geospatial Representation Learning COMPUTER VISION - ECCV 2024, PT LXIV, 2025, 15122 : 164 - 182
- [28] Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7661 - 7671
- [30] SSDMM-VAE: variational multi-modal disentangled representation learning Applied Intelligence, 2023, 53 : 8467 - 8481