共 50 条
- [21] Video Question Answering with Iterative Video-Text Co-tokenization COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 76 - 94
- [22] Bridging Video-text Retrieval with Multiple Choice Questions 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16146 - 16155
- [23] Video text extraction from images for character recognition 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
- [26] SViTT: Temporal Learning of Sparse Video-Text Transformers 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18919 - 18929
- [27] HANet: Hierarchical Alignment Networks for Video-Text Retrieval PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3518 - 3527
- [28] Dual Encoding Integrating Key Frame Extraction for Video-text Cross-modal Entity Resolution Binggong Xuebao/Acta Armamentarii, 2022, 43 (05): : 1107 - 1116
- [29] Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [30] Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18962 - 18972