共 50 条
- [1] A Multilingual Approach to Scene Text Visual Question Answering DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 65 - 79
- [3] Towards Reasoning Ability in Scene Text Visual Question Answering PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2281 - 2289
- [5] An Empirical Study of Multilingual Scene-Text Visual Question Answering PROCEEDINGS OF THE 2ND WORKSHOP ON USER-CENTRIC NARRATIVE SUMMARIZATION OF LONG VIDEOS, NARSUM 2023, 2023, : 3 - 8
- [6] Improving visual question answering by combining scene-text information Multimedia Tools and Applications, 2022, 81 : 12177 - 12208
- [7] Transductive Cross-Lingual Scene-Text Visual Question Answering NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 452 - 467
- [10] Lightweight Visual Question Answering using Scene Graphs PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3353 - 3357