共 50 条
- [21] Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6513 - 6521
- [22] VTMF2N: Towards Accurate Visual-Tactile Slip Detection via Multi-modal Feature Fusion in Robotic Grasping PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 103 - 117
- [23] Visual Relation Extraction via Multi-modal Translation Embedding Based Model ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 538 - 548
- [25] Lightweight video salient object detection via channel-shuffle enhanced multi-modal fusion network Multimedia Tools and Applications, 2024, 83 : 1025 - 1039
- [26] Multi-modal Gait Recognition via Effective Spatial-Temporal Feature Fusion 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17949 - 17957
- [29] Multi-modal voice pathology detection architecture based on deep and handcrafted feature fusion ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2022, 36
- [30] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 691 - 707