共 50 条
- [1] Egocentric Audio-Visual Object Localization 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22910 - 22921
- [2] Audio-Visual Event Localization in Unconstrained Videos COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 252 - 268
- [3] Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10534 - 10542
- [4] Binaural Audio-Visual Localization THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2961 - 2968
- [5] A JOINT AUDIO-VISUAL APPROACH TO AUDIO LOCALIZATION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 454 - 458
- [7] Listen to Look Into the Future: Audio-Visual Egocentric Gaze Anticipation COMPUTER VISION - ECCV 2024, PT IX, 2025, 15067 : 192 - 210
- [10] AVQA: A Dataset for Audio-Visual Question Answering on Videos PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3480 - 3491