共 50 条
- [31] Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10534 - 10542
- [32] Audio-visual speaker identification based on the use of dynamic audio and visual features AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 743 - 751
- [33] Audio-visual integration of emotional cues in song COGNITION & EMOTION, 2008, 22 (08) : 1457 - 1470
- [35] Audio-visual Cues for Cloud Service Monitoring CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 439 - 446
- [36] A Visual Signal Reliability for Robust Audio-Visual Speaker Identification IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2052 - 2055
- [37] Multimodal SpeakerBeam: Single channel target speech extraction with audio-visual speaker clues INTERSPEECH 2019, 2019, : 2718 - 2722
- [38] BEST OF BOTH WORLDS: MULTI-TASK AUDIO-VISUAL AUTOMATIC SPEECH RECOGNITION AND ACTIVE SPEAKER DETECTION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6047 - 6051
- [39] Particle Filtering for Bearing-Only Audio-Visual Speaker Detection and Tracking 2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 161 - +
- [40] E-Talk: Accelerating Active Speaker Detection with Audio-Visual Fusion and Edge-Cloud Computing 2023 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON, 2023,