共 50 条
- [31] VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [32] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
- [33] Multimodal Search on Iconclass using Vision-Language Pre-Trained Models 2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDL, 2023, : 285 - 287
- [34] Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9590 - 9601
- [35] Vision-language models for medical report generation and visual question answering: a review FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
- [36] Generating Robot Action Sequences: An Efficient Vision-Language Models with Visual Prompts 2024 INTERNATIONAL WORKSHOP ON INTELLIGENT SYSTEMS, IWIS 2024, 2024,
- [37] Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5718 - 5728
- [38] Patch is enough: naturalistic adversarial patch against vision-language pre-training models Visual Intelligence, 2 (1):
- [39] Boosting Transferability in Vision-Language Attacks via Diversification Along the Intersection Region of Adversarial Trajectory COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 442 - 460
- [40] Concept-Based Analysis of Neural Networks via Vision-Language Models AI VERIFICATION, SAIV 2024, 2024, 14846 : 49 - 77