共 50 条
- [32] Dual-Key Multimodal Backdoors for Visual Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15354 - 15364
- [33] FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 1330 - 1350
- [35] Contrastive training of a multimodal encoder for medical visual question answering INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18
- [36] Multimodal Graph Networks for Compositional Generalization in Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [37] Fusion of Detected Objects in Text for Visual Question Answering 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2131 - 2140
- [38] Visual Question Answering based on multimodal triplet knowledge accumuation 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
- [40] CONTEXT RELATION FUSION MODEL FOR VISUAL QUESTION ANSWERING 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2112 - 2116