共 50 条
- [11] Progressive Graph Attention Network for Video Question Answering PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2871 - 2879
- [12] Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19027 - 19036
- [13] Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1999 - 2007
- [14] Video Question Answering Using a Forget Memory Network COMPUTER VISION, PT I, 2017, 771 : 404 - 415
- [15] Improving Visual Question Answering by Multimodal Gate Fusion Network 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [16] Multimodal Graph Transformer for Multimodal Question Answering 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 189 - 200
- [17] Multimodal Graph Transformer for Multimodal Question Answering EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, 2023, : 189 - 200
- [19] Question-Aware Tube-Switch Network for Video Question Answering PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1184 - 1192
- [20] Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2038 - 2046