共 50 条
- [31] Complementary spatiotemporal network for video question answering Multimedia Systems, 2022, 28 : 161 - 169
- [32] Measuring Compositional Consistency for Video Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5036 - 5045
- [34] Remember and forget: video and text fusion for video question answering Multimedia Tools and Applications, 2018, 77 : 29269 - 29282
- [35] Video question answering via traffic knowledge database and question classification Multimedia Systems, 2024, 30
- [37] Question Difficulty Estimation with Directional Modality Association in Video Question Answering ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 287 - 299
- [38] Learning Question-Guided Video Representation for Multi-Turn Video Question Answering 20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 215 - 225
- [39] ViLA: Efficient Video-Language Alignment for Video Question Answering COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 186 - 204
- [40] Knowledge Proxy Intervention for Deconfounded Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2770 - 2781