共 50 条
- [1] Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 147 - 164
- [3] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
- [4] VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [5] ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22168 - 22178
- [6] GalLoP: Learning Global and Local Prompts for Vision-Language Models COMPUTER VISION - ECCV 2024, PT LXI, 2025, 15119 : 264 - 282
- [7] Leveraging per Image-Token Consistency for Vision-Language Pre-training 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19155 - 19164
- [8] ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15375 - 15385
- [9] Toward Real-world Panoramic Image Enhancement 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2675 - 2684
- [10] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186