共 50 条
- [32] Task-Oriented Multi-Modal Mutual Learning for Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21902 - 21912
- [33] VEMO: A Versatile Elastic Multi-modal Model for Search-Oriented Multi-task Learning ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 56 - 72
- [34] Corpus Analysis of Spoken Smart-Home Interactions with Older Users SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 735 - 740
- [36] MultiMAE: Multi-modal Multi-task Masked Autoencoders COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 348 - 367
- [38] Exploiting Multi-Modal Interactions: A Unified Framework 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1120 - 1125
- [39] FARMI: A FrAmework for Recording Multi-Modal Interactions PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3969 - 3974
- [40] Multi-Modal Interactions of Mixed Reality Framework 17TH IEEE DALLAS CIRCUITS AND SYSTEMS CONFERENCE, DCAS 2024, 2024,