共 50 条
- [31] SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 36 - 55
- [32] MMA: Multi-Modal Adapter for Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23826 - +
- [33] Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning APPLIED SCIENCES-BASEL, 2021, 11 (17):
- [34] An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (12): : 4333 - 4336
- [35] Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16741 - 16750
- [36] Open-Set Semi-Supervised Object Detection COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 143 - 159
- [37] Real-time dense small object detection algorithm based on multi-modal tea shoots FRONTIERS IN PLANT SCIENCE, 2023, 14
- [38] Multi-task Multi-modal Models for Collective Anomaly Detection 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 177 - 186
- [39] Multi-modal object detection and localization for high integrity driving assistance Machine Vision and Applications, 2014, 25 : 583 - 598