共 50 条
- [31] NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4542 - 4550
- [33] Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3985 - 3993
- [34] RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 547 - 556
- [35] Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 169 - 176
- [36] K-armed Bandit based Multi-modal Network Architecture Search for Visual Question Answering MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1245 - 1254
- [37] TASK-ORIENTED MULTI-MODAL QUESTION ANSWERING FOR COLLABORATIVE APPLICATIONS 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1426 - 1430
- [38] MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4659 - 4664
- [39] Multi-modal Question Answering System Driven by Domain Knowledge Graph 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 43 - 47
- [40] Multi-Modal Knowledge-Aware Attention Network for Question Answering Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (05): : 1037 - 1045