共 50 条
- [31] Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter INTERSPEECH 2022, 2022, : 3704 - 3708
- [32] Single-speaker/multi-speaker co-channel speech classification 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2322 - 2325
- [33] 3D Audio-Visual Speaker Tracking with A Novel Particle Filter 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7343 - 7348
- [34] 3D AUDIO-VISUAL SPEAKER TRACKING WITH AN ADAPTIVE PARTICLE FILTER 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2896 - 2900
- [36] A MULTI-VIEW APPROACH TO AUDIO-VISUAL SPEAKER VERIFICATION 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6194 - 6198
- [37] A CLOSER LOOK AT AUDIO-VISUAL MULTI-PERSON SPEECH RECOGNITION AND ACTIVE SPEAKER SELECTION 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6863 - 6867
- [38] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 2358 - 2362
- [39] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis INTERSPEECH 2022, 2022, : 2358 - 2362
- [40] LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus INTERSPEECH 2023, 2023, : 5496 - 5500