共 41 条
- [1] Revisiting Classifier: Transferring Vision-Language Models for Video Recognition THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2847 - 2855
- [2] Vision-Language Pre-Training for Boosting Scene Text Detectors 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15660 - 15670
- [4] Unified Vision-Language Pre-Training for Image Captioning and VQA THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13041 - 13049
- [5] Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 284 - 302
- [6] BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [10] Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15586 - 15595