共 50 条
- [21] Multimodal detection of hateful memes by applying a vision-language pre-training model PLOS ONE, 2022, 17 (09):
- [22] Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6967 - 6977
- [23] Transferable Multimodal Attack on Vision-Language Pre-training Models 45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 1722 - 1740
- [24] Enhancing Dynamic Image Advertising with Vision-Language Pre-training PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3310 - 3314
- [25] Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1073 - 1083
- [27] Too Large; Data Reduction for Vision-Language Pre-Training 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3124 - 3134
- [28] Scaling Up Vision-Language Pre-training for Image Captioning 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17959 - 17968
- [29] Towards Adversarial Attack on Vision-Language Pre-training Models PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5005 - 5013
- [30] MAFA: Managing False Negatives for Vision-Language Pre-training 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27304 - 27314