共 50 条
- [42] Fusing BO and LiDAR for SAR Image Translation with Multi-Modal Generative Adversarial Networks 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
- [43] TraVL: Transferring Pre-trained Visual-Linguistic Models for Cross-Lingual Image Captioning WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 341 - 355
- [44] Cross-Modal Retrieval Algorithm for Image and Text Based on Pre-Trained Models and Encoders Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (05): : 112 - 117
- [46] TED TALK TEASER GENERATION WITH PRE-TRAINED MODELS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8067 - 8071
- [47] MaxFusion: Plug&Play Multi-modal Generation in Text-to-Image Diffusion Models COMPUTER VISION-ECCV 2024, PT XXXVIII, 2025, 15096 : 93 - 110
- [50] Multi-modal lung ultrasound image classification by fusing image-based features and probe information 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2022), 2022, : 45 - 50