共 50 条
- [44] AVLnet: Learning Audio-Visual Language Representations from Instructional Videos INTERSPEECH 2021, 2021, : 1584 - 1588
- [45] Spherical World-Locking for Audio-Visual Localization in Egocentric Videos COMPUTER VISION - ECCV 2024, PT XXIV, 2025, 15082 : 256 - 274
- [48] Neural responses to sounds presented on and off the beat of ecologically valid music FRONTIERS IN SYSTEMS NEUROSCIENCE, 2013, 7
- [49] SEGMENTATION OF MUSIC VIDEO STREAMS IN MUSIC PIECES THROUGH AUDIO-VISUAL ANALYSIS 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,