共 50 条
- [41] Scaling Vision-Language Models with Sparse Mixture of Experts FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11329 - 11344
- [43] On Evaluating Adversarial Robustness of Large Vision-Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [45] Evaluating Object Hallucination in Large Vision-Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 292 - 305
- [46] Adapting vision-language AI models to cardiology tasks NATURE MEDICINE, 2024, 30 (05) : 1245 - 1246
- [47] BRAVE: Broadening the Visual Encoding of Vision-Language Models COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 113 - 132
- [48] Multimodal Search on Iconclass using Vision-Language Pre-Trained Models 2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDL, 2023, : 285 - 287
- [49] Scale Alone Does not Improve Mechanistic Interpretability in Vision Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [50] TEXT-IMAGE DE-CONTEXTUALIZATION DETECTION USING VISION-LANGUAGE MODELS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8967 - 8971