共 50 条
- [21] CAT: Re-Conv Attention in Transformer for Visual Question Answering 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1471 - 1477
- [23] Visual Question Answering 2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 6 - 10
- [24] MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question Answering PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 878 - 892
- [25] Question Modifiers in Visual Question Answering LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
- [26] Surgical-VQA: Visual Question Answering in Surgical Scenes Using Transformer MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 33 - 43
- [27] Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
- [29] ST-VQA: shrinkage transformer with accurate alignment for visual question answering Applied Intelligence, 2023, 53 : 20967 - 20978
- [30] Transformer Gate Attention Model: An Improved Attention Model for Visual Question Answering 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,