共 50 条
- [11] Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
- [13] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering IEEE ACCESS, 2018, 6 : 31516 - 31524
- [14] HUMAN GUIDED CROSS-MODAL REASONING WITH SEMANTIC ATTENTION LEARNING FOR VISUAL QUESTION ANSWERING 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2775 - 2779
- [15] Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7151 - 7159
- [18] VISUAL QUESTION ANSWERING FROM REMOTE SENSING IMAGES 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4951 - 4954
- [19] LANGUAGE TRANSFORMERS FOR REMOTE SENSING VISUAL QUESTION ANSWERING 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4855 - 4858
- [20] Multistep Question-Driven Visual Question Answering for Remote Sensing IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61