共 50 条
- [21] VTLayout: A Multi-Modal Approach for Video Text Layout PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2775 - 2784
- [22] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6565 - 6574
- [23] KnowER: Knowledge enhancement for efficient text-video retrieval Intelligent and Converged Networks, 2023, 4 (02): : 93 - 105
- [24] TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11563 - 11573
- [25] UATVR: Uncertainty-Adaptive Text-Video Retrieval 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13677 - 13687
- [26] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2470 - 2481
- [28] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 970 - 981
- [30] Everything at Once - Multi-modal Fusion Transformer for Video Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19988 - 19997