共 41 条
- [31] CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1405 - 1413
- [33] Bridging the Lexical Gap: Generative Text-to-Image Retrieval for Parts-of-Speech Imbalance in Vision-Language Models PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON DEEP MULTIMODAL GENERATION AND RETRIEVAL, MMGR 2024, 2024, : 25 - 33
- [34] Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18208 - 18217
- [36] Alzheimer's disease recognition using graph neural network by leveraging image-text similarity from vision language model SCIENTIFIC REPORTS, 2025, 15 (01):
- [37] Open-world driving scene segmentation via multi-stage and multi-modality fusion of vision-language embedding 2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
- [39] OPEN-VOCABULARY SKELETON ACTION RECOGNITION WITH DIFFUSION GRAPH CONVOLUTIONAL NETWORK AND PRE-TRAINED VISION-LANGUAGE MODELS 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3195 - 3199
- [40] End-to-End: A Simple Template for the Long-Tailed-Recognition of Transmission Line Clamps via a Vision-Language Model APPLIED SCIENCES-BASEL, 2023, 13 (05):