共 50 条
- [21] Multimodal Speech Recognition for Language-Guided Embodied Agents INTERSPEECH 2023, 2023, : 1608 - 1612
- [22] ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5390 - 5400
- [23] Learning by Planning: Language-Guided Global Image Editing 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13585 - 13594
- [24] A Simple Recipe for Language-guided Domain Generalized Segmentation 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23428 - 23437
- [25] Enhancing Visual Continual Learning with Language-Guided Supervision 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 24068 - 24077
- [26] Learning Visual Representations via Language-Guided Sampling 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19208 - 19220
- [27] Language-guided Human Motion Synthesis with Atomic Actions PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5262 - 5271
- [28] LANGUAGE-GUIDED ZERO-SHOT OBJECT COUNTING 2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
- [29] Video Clip Growth: A General Algorithm for Multi-view Video Summarization ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 112 - 122