共 50 条
- [1] X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
- [2] Multi-event Video-Text Retrieval 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22056 - 22066
- [4] Deep learning for video-text retrieval: a review International Journal of Multimedia Information Retrieval, 2023, 12
- [5] Fine-Grained Cross-Modal Contrast Learning for Video-Text Retrieval ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14866 : 298 - 310
- [7] MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval Neural Processing Letters, 56
- [9] SViTT: Temporal Learning of Sparse Video-Text Transformers 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18919 - 18929
- [10] Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 10635 - 10644