共 50 条
- [11] Multi-modal co-attention relation networks for visual question answering The Visual Computer, 2023, 39 : 5783 - 5795
- [12] Multi-modal co-attention relation networks for visual question answering VISUAL COMPUTER, 2023, 39 (11): : 5783 - 5795
- [14] MoQA - A Multi-modal Question Answering Architecture COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 106 - 113
- [15] Text-Guided Object Detector for Multi-modal Video Question Answering 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1032 - 1042
- [16] Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 427 - 438
- [18] A Survey of Multi-modal Question Answering Systems for Robotics 2017 2ND INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM), 2017, : 189 - 194
- [19] Multi-Modal Correlated Network with Emotional Reasoning Knowledge for Social Intelligence Question-Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3067 - 3073
- [20] The Multi-Modal Video Reasoning and Analyzing Competition 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 806 - 813