共 41 条
- [21] Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9848 - 9858
- [22] Prior-Experience-Based Vision-Language Model for Remote Sensing Image-Text Retrieval IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [23] SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7766 - 7774
- [25] Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 869 - 893
- [29] Image caption generation via improved vision-language pre-training model: perception towards image retrieval IMAGING SCIENCE JOURNAL, 2025,
- [30] Reject Decoding via Language-Vision Models for Text-to-Image Synthesis THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2785 - 2794