共 50 条
- [21] VLP: A Survey on Vision-language Pre-training Machine Intelligence Research, 2023, 20 : 38 - 56
- [22] Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6620 - 6630
- [23] UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4153 - 4163
- [24] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [25] RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11747 - 11762
- [26] CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising* PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5600 - 5608
- [27] Pre-training A Prompt Pool for Vision-Language Model 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [30] Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7296 - 7304