共 50 条
- [1] Video-Text Pre-training with Learned Regions for Retrieval THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3100 - 3108
- [2] VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6787 - 6800
- [3] LocVTP: Video-Text Pre-training for Temporal Localization COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 : 38 - 56
- [4] MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 691 - 708
- [5] Stitching Segments and Sentences towards Generalization in Video-Text Pre-training THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4080 - 4088
- [8] Contrastive Transformer Cross-Modal Hashing for Video-Text Retrieval PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1227 - 1235
- [10] MimCo: Masked Image Modeling Pre-training with Contrastive Teacher PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4487 - 4495