共 50 条
- [32] RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [33] Multimodal Cross-guided Attention Networks for Visual Question Answering PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTER MODELING, SIMULATION AND ALGORITHM (CMSA 2018), 2018, 151 : 347 - 353
- [34] Multimodal Graph Transformer for Multimodal Question Answering 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 189 - 200
- [35] Multimodal Graph Transformer for Multimodal Question Answering EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, 2023, : 189 - 200
- [37] Visual Question Answering 2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 6 - 10
- [38] Dual-Branch Collaborative Learning for Visual Question Answering ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 96 - 107
- [40] Modular dual-stream visual fusion network for visual question answering VISUAL COMPUTER, 2025, 41 (01): : 549 - 562