共 50 条
- [42] A Vision Enhanced Framework for Indonesian Multimodal Abstractive Text-Image Summarization PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 61 - 66
- [43] Animating Images to Transfer CLIP for Video-Text Retrieval PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1906 - 1911
- [44] Multilevel Language and Vision Integration for Text-to-Clip Retrieval THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9062 - 9069
- [47] A System of Multimodal Image-Text Retrieval Based on Pre-Trained Models Fusion CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (03):
- [50] Multimodal medical image retrieval system Multimedia Tools and Applications, 2017, 76 : 2955 - 2978