Enhancing Speaker Diarization with Deep Neural Network Embeddings and Spectral Clustering

被引:0
|
作者
Yanshan University, China [1 ]
机构
来源
|
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Deep neural networks - Network embeddings
引用
收藏
相关论文
共 50 条
  • [41] Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification
    You, Lanhua
    Guo, Wu
    Dai, Li-Rong
    Du, Jun
    INTERSPEECH 2019, 2019, : 1168 - 1172
  • [42] Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems
    Zibert, Janez
    Mihelic, France
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1040 - +
  • [43] Discriminative Training for Hierarchical Clustering in Speaker Diarization
    Vinyals, Oriol
    Friedland, Gerald
    Morgan, Nelson
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2326 - +
  • [44] Speech refinement using Bi-LSTM and improved spectral clustering in speaker diarization
    Aishwarya Gupta
    Archana Purwar
    Multimedia Tools and Applications, 2024, 83 : 54433 - 54448
  • [45] Online Neural Speaker Diarization With Target Speaker Tracking
    Wang, Weiqing
    Li, Ming
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 5078 - 5091
  • [46] A Triplet Ranking-based Neural Network for Speaker Diarization and Linking
    Le Lan, Gael
    Charlet, Delphine
    Larcher, Anthony
    Meignier, Sylvain
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3572 - 3576
  • [47] Speech refinement using Bi-LSTM and improved spectral clustering in speaker diarization
    Gupta, Aishwarya
    Purwar, Archana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54433 - 54448
  • [48] Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap
    Park, Tae Jin
    Han, Kyu J.
    Kumar, Manoj
    Narayanan, Shrikanth
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 381 - 385
  • [49] ATTENTION-BASED NEURAL NETWORK FOR JOINT DIARIZATION AND SPEAKER EXTRACTION
    Chazan, Shlomo E.
    Gannot, Sharon
    Goldberger, Jacob
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 301 - 305
  • [50] A Comparison of Distance Measures for Clustering in Speaker Diarization
    Niero, Marcelo de Campos
    Veiga Filho, Alvaro de Lima
    Adami, Andre Gustavo
    2014 INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM (ITS), 2014,