共 50 条
- [2] Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 388 - 404
- [4] Multi-label semantic sharing based on graph convolutional network for image-to-text retrieval VISUAL COMPUTER, 2025, 41 (03): : 1827 - 1840
- [5] Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5517 - 5526
- [7] Evaluating Text-to-Visual Generation with Image-to-Text Generation COMPUTER VISION - ECCV 2024, PT IX, 2025, 15067 : 366 - 384
- [10] Causal image-text retrieval embedded with consensus knowledge Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (02): : 317 - 328