共 48 条
- [1] Scene Text Visual Question Answering 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4290 - 4300
- [2] Compositional Substitutivity of Visual Reasoning for Visual Question Answering COMPUTER VISION - ECCV 2024, PT XLVIII, 2025, 15106 : 143 - 160
- [3] Maintaining Reasoning Consistency in Compositional Visual Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5089 - 5098
- [4] A Multilingual Approach to Scene Text Visual Question Answering DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 65 - 79
- [5] Lightweight Visual Question Answering using Scene Graphs PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3353 - 3357
- [7] Visual Causal Scene Refinement for Video Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 377 - 386
- [8] Multimodal Graph Networks for Compositional Generalization in Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [10] Towards Reasoning Ability in Scene Text Visual Question Answering PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2281 - 2289