共 50 条
- [41] Bridging the Gap between Vision and Language Domains for Improved Image Captioning MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4153 - 4161
- [42] Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [43] Scaling Data Generation in Vision-and-Language Navigation 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11975 - 11986
- [44] AerialVLN (sic) : Vision-and-Language Navigation for UAVs 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15338 - 15348
- [45] Language Features Matter: Effective Language Representations for Vision-Language Tasks 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7473 - 7482
- [46] Vision-and-Language Navigation via Causal Learning 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13139 - 13150
- [47] Image Captioning with Pretrained Language Generators CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 427 - 427
- [48] Unpaired Image Captioning by Language Pivoting COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 519 - 535
- [49] VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5217 - 5227