共 50 条
- [41] Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 396 - 404
- [42] Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 19 - 27
- [43] Multi-grained unsupervised evidence retrieval for question answering Neural Computing and Applications, 2023, 35 : 21247 - 21257
- [44] Multi-grained unsupervised evidence retrieval for question answering NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 21247 - 21257
- [45] Video-Text Pre-training with Learned Regions for Retrieval THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3100 - 3108
- [46] Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3847 - 3856
- [47] Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18962 - 18972
- [48] Adaptive Token Excitation with Negative Selection for Video-Text Retrieval ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 349 - 361
- [49] Uncertainty-Aware with Negative Samples for Video-Text Retrieval PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 318 - 332