共 50 条
- [41] Deep learning for video-text retrieval: a review International Journal of Multimedia Information Retrieval, 2023, 12
- [42] Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2930 - 2940
- [44] Retrieval-augmented Image Captioning 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3666 - 3681
- [48] Compositional Learning of Image-Text Query for Image Retrieval 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1139 - 1148
- [49] SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models COMPUTER VISION - ECCV 2024, PT XLII, 2025, 15100 : 330 - 348
- [50] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6914 - 6924