共 50 条
- [42] Dynamic interactive learning network for audio-visual event localization Applied Intelligence, 2023, 53 : 30431 - 30442
- [43] Probabilistic speaker localization in noisy enviromments by audio-visual integration 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 4704 - +
- [45] Audio-Visual Clustering for 3D Speaker Localization MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 86 - 97
- [47] Learning Event-Specific Localization Preferences for Audio-Visual Event Localization PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3446 - 3454
- [48] Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5671 - 5672
- [49] Audio-Visual Model for Generating Eating Sounds Using Food ASMR Videos IEEE ACCESS, 2021, 9 : 50106 - 50111
- [50] TIME-DOMAIN AUDIO-VISUAL SPEECH SEPARATION ON LOW QUALITY VIDEOS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 256 - 260