共 50 条
- [31] VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [32] Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning INTERSPEECH 2021, 2021, : 1244 - 1248
- [33] Survey on Vision-language Pre-training Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2000 - 2023
- [34] Speech Model Pre-training for End-to-End Spoken Language Understanding INTERSPEECH 2019, 2019, : 814 - 818
- [35] Pre-training Language Models for Comparative Reasoning 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 12421 - 12433
- [36] Sigmoid Loss for Language Image Pre-Training 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11941 - 11952
- [37] MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6078 - 6087
- [38] Grounded Language-Image Pre-training 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10955 - 10965
- [39] VILA: On Pre-training for Visual Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26679 - 26689
- [40] RELATION ENHANCED VISION LANGUAGE PRE-TRAINING 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2286 - 2290