共 50 条
- [1] Local self-attention in transformer for visual question answering Applied Intelligence, 2023, 53 : 16706 - 16723
- [2] Stacked Self-Attention Networks for Visual Question Answering ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 207 - 211
- [5] Intra-Modality Feature Interaction Using Self-attention for Visual Question Answering NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 215 - 222
- [7] TRAR: Routing the Attention Spans in Transformer for Visual Question Answering 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2054 - 2064
- [8] Multi-page Document Visual Question Answering Using Self-attention Scoring Mechanism DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 219 - 232
- [9] SAFFNet: self-attention based on Fourier frequency domain filter network for visual question answering VISUAL COMPUTER, 2025,
- [10] Transformer Gate Attention Model: An Improved Attention Model for Visual Question Answering 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,