共 50 条
- [31] Self-Supervised Object Detection from Egocentric Videos 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5202 - 5214
- [33] Self-Supervised Visual Descriptor Learning for Dense Correspondence IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02): : 420 - 427
- [34] Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4042 - 4052
- [35] Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1456 - 1463
- [36] Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3884 - 3892
- [37] Self-Supervised Audio-Visual Feature Learning for Single-Modal Incremental Terrain Type Clustering IEEE ACCESS, 2021, 9 : 64346 - 64357
- [38] Object category detection using audio-visual cues COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 539 - 548
- [39] Temporal structure and complexity affect audio-visual correspondence detection FRONTIERS IN PSYCHOLOGY, 2013, 3
- [40] Object Detection with Self-Supervised Scene Adaptation 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21589 - 21599