共 50 条
- [21] Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19027 - 19036
- [23] Dynamic Spatio-Temporal Modular Network for Video Question Answering PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4466 - 4477
- [24] From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21241 - 21250
- [25] Tree -of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19560 - 19568
- [28] HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1678 - 1687
- [30] Chain of Reasoning for Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31