共 50 条
- [22] Hierarchical Recurrent Contextual Attention Network for Video Question Answering ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 280 - 290
- [25] Multimodal Attention for Visual Question Answering INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 783 - 792
- [26] MIMOQA: Multimodal Input Multimodal Output Question Answering 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5317 - 5332
- [28] Video Graph Transformer for Video Question Answering COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 39 - 58
- [29] Video Reference: A Video Question Answering Engine ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 799 - +
- [30] Hypergraph Convolutional Network for Multi-Hop Knowledge Base Question Answering (Student Abstract) THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13801 - 13802