共 50 条
- [34] Learning Representations from Audio-Visual Spatial Alignment ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [36] AUDIO-VISUAL SPEECH RECOGNITION WITH A HYBRID CTC/ATTENTION ARCHITECTURE 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 513 - 520
- [39] Selective Attention Modulates the Direction of Audio-Visual Temporal Recalibration PLOS ONE, 2014, 9 (07):