共 50 条
- [21] Distinctive feature fusion for improved audio-visual phoneme recognition ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 62 - 65
- [22] Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4848 - 4854
- [23] A JOINT AUDIO-VISUAL APPROACH TO AUDIO LOCALIZATION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 454 - 458
- [25] Temporal Cross-Modal Attention for Audio-Visual Event Localization Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (03): : 263 - 268
- [26] Masked co-attention model for audio-visual event localization Applied Intelligence, 2024, 54 : 1691 - 1705