共 50 条
- [1] Visual Hallucinations of Multi-modal Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 9614 - 9631
- [2] Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5292 - 5305
- [3] Generative Multi-Modal Knowledge Retrieval with Large Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18733 - 18741
- [6] Multi-modal Prompts with Feature Decoupling for Open-Vocabulary Object Detection GENERALIZING FROM LIMITED RESOURCES IN THE OPEN WORLD, GLOW-IJCAI 2024, 2024, 2160 : 180 - 194
- [7] Multi-modal Queried Object Detection in the Wild ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [8] MOQAGPT: Zero-Shot Multi-modal Open-domain Question Answering with Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 1195 - 1210
- [9] VIAssist: Adapting Multi-modal Large Language Models for Users with Visual Impairments PROCEEDINGS 2024 IEEE INTERNATIONAL WORKSHOP ON FOUNDATION MODELS FOR CYBER-PHYSICAL SYSTEMS & INTERNET OF THINGS, FMSYS 2024, 2024, : 32 - 37
- [10] LaMI: Large Language Models for Multi-Modal Human-Robot Interaction EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,