共 50 条
- [21] Multi-level, multi-modal interactions for visual question answering over text in images WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1607 - 1623
- [22] A Survey of Multi-modal Question Answering Systems for Robotics 2017 2ND INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM), 2017, : 189 - 194
- [23] Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2941 - 2950
- [25] Differentiated Attention with Multi-modal Reasoning for Video Question Answering 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 525 - 530
- [27] Answer-checking in Context: A Multi-modal Fully Attention Network for Visual Question Answering 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1173 - 1180
- [28] Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1839 - 1848