共 50 条
- [21] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
- [22] ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3208 - 3216
- [23] Learning the Visualness of Text Using Large Vision-Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2394 - 2408
- [27] VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [28] Fast Certification of Vision-Language Models Using Incremental Randomized Smoothing IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 252 - 271
- [29] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 3, 2024, : 1932 - 1940