共 50 条
- [31] Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor INTERSPEECH 2023, 2023, : 3552 - 3556
- [32] Improved Relation Networks for End-to-End Speaker Verification and Identification INTERSPEECH 2022, 2022, : 5085 - 5089
- [33] Investigation of Training Mute-Expressive End-to-End Speech Separation Networks for an Unknown Number of Speakers INTERSPEECH 2023, 2023, : 3764 - 3768
- [34] On the Success and Limitations of Auxiliary Network Based Word-Level End-to-End Neural Speaker Diarization INTERSPEECH 2024, 2024, : 32 - 36
- [35] FRAME-LEVEL SPEAKER EMBEDDINGS FOR TEXT-INDEPENDENT SPEAKER RECOGNITION AND ANALYSIS OF END-TO-END MODEL 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1007 - 1013
- [36] GENERATIVE ADVERSARIAL SPEAKER EMBEDDING NETWORKS FOR DOMAIN ROBUST END-TO-END SPEAKER VERIFICATION 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6226 - 6230
- [37] Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2819 - 2823
- [38] Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4031 - 4041
- [39] SPEAKER-AWARE TRAINING OF ATTENTION-BASED END-TO-END SPEECH RECOGNITION USING NEURAL SPEAKER EMBEDDINGS 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7064 - 7068
- [40] End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors INTERSPEECH 2022, 2022, : 5090 - 5094