共 50 条
- [31] Joint embeddings with multimodal cues for video-text retrieval International Journal of Multimedia Information Retrieval, 2019, 8 : 3 - 18
- [36] X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4996 - 5005
- [38] Text-video retrieval method based on enhanced self-attention and multi-task learning Multimedia Tools and Applications, 2023, 82 : 24387 - 24406
- [39] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6540 - 6548
- [40] A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4385 - 4394