共 50 条
- [32] CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1801 - 1812
- [33] Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5673 - 5693
- [34] MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4868 - 4872
- [35] Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 174 - 191
- [36] Multimodal Analysis for Deep Video Understanding with Video Language Transformer PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7165 - 7169
- [37] Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7094 - 7101
- [38] Popular Hooks: A Multimodal Dataset of Musical Hooks for Music Understanding and Generation<bold> </bold> 2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,