共 50 条
- [31] Multimodal Pre-training Method for Vision-language Understanding and Generation Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2024 - 2034
- [32] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
- [34] Knowledge Boosting: Rethinking Medical Contrastive Vision-Language Pre-training MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 405 - 415
- [35] Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [36] Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 14 (3-4): : 163 - 352
- [37] Position-guided Text Prompt for Vision-Language Pre-training 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23242 - 23251
- [38] Kaleido-BERT: Vision-Language Pre-training on Fashion Domain 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12642 - 12652
- [40] Subsampling of Frequent Words in Text for Pre-training a Vision-Language Model PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 61 - 67