共 50 条
- [41] Fusing Attention with Visual Question Answering 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 949 - 953
- [42] Transformer Gate Attention Model: An Improved Attention Model for Visual Question Answering 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [44] LANGUAGE AND VISUAL RELATIONS ENCODING FOR VISUAL QUESTION ANSWERING 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3307 - 3311
- [45] MDAnet: Multiple Fusion Network with Double Attention for Visual Question Answering ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 143 - 147
- [46] OECA-Net: A co-attention network for visual question answering based on OCR scene text feature enhancement Multimedia Tools and Applications, 2024, 83 : 7085 - 7096
- [48] Visual Question Answering with Textual Representations for Images 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3147 - 3150
- [49] Focal Visual-Text Attention for Visual Question Answering 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6135 - 6143