共 50 条
- [42] VCoder: Versatile Vision Encoders for Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27992 - 28002
- [43] Multimodal large language models for inclusive collaboration learning tasks NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 202 - 210
- [46] Exploring the Transferability of Visual Prompting for Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26552 - 26562
- [47] Enhancing Urban Walkability Assessment with Multimodal Large Language Models COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT V, 2024, 14819 : 394 - 411
- [49] UniCode: Learning a Unified Codebook for Multimodal Large Language Models COMPUTER VISION - ECCV 2024, PT VIII, 2025, 15066 : 426 - 443
- [50] QueryMintAI: Multipurpose Multimodal Large Language Models for Personal Data IEEE ACCESS, 2024, 12 : 144631 - 144651