Enhancing Speaker Diarization with Deep Neural Network Embeddings and Spectral Clustering

被引:0
|
作者
Yanshan University, China [1 ]
机构
来源
|
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Deep neural networks - Network embeddings
引用
收藏
相关论文
共 50 条
  • [31] Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization
    Wang, Weiqing
    Lin, Qingjian
    Cai, Danwei
    Li, Ming
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2645 - 2658
  • [32] An Attention-based Neural Network on Multiple Speaker Diarization
    Cheng, Shao Wen
    Hung, Kai Jyun
    Chang, Hsie Chia
    Liao, Yen Chin
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 431 - 434
  • [33] Optimal trained artificial neural network for Telugu speaker diarization
    V. Sethuram
    Ande Prasad
    R. Rajeshwara Rao
    Evolutionary Intelligence, 2020, 13 : 631 - 648
  • [34] Speaker Diarization using Leave-one-out Gaussian PLDA Clustering of DNN Embeddings
    McCree, Alan
    Sell, Gregory
    Garcia-Romero, Daniel
    INTERSPEECH 2019, 2019, : 381 - 385
  • [35] Metaheuristic adapted convolutional neural network for Telugu speaker diarization
    Sethuram, V
    Prasad, Ande
    Rao, R. Rajeswara
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 561 - 577
  • [36] Optimal trained artificial neural network for Telugu speaker diarization
    Sethuram, V.
    Prasad, Ande
    Rao, R. Rajeshwara
    EVOLUTIONARY INTELLIGENCE, 2020, 13 (04) : 631 - 648
  • [37] SPEAKER DIARIZATION USING LATENT SPACE CLUSTERING IN GENERATIVE ADVERSARIAL NETWORK
    Pal, Monisankha
    Kumar, Manoj
    Peri, Raghuveer
    Park, Tae Tin
    Kim, So Hyun
    Lord, Catherine
    Bishop, Somer
    Narayanan, Shrikanth
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6504 - 6508
  • [38] SPEAKER EMBEDDINGS FOR DIARIZATION OF BROADCAST DATA IN THE ALLIES CHALLENGE
    Larcher, Anthony
    Mehrish, Ambuj
    Tahon, Marie
    Meignier, Sylvain
    Carrive, Jean
    Doukhan, David
    Galibert, Olivier
    Evans, Nicholas
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5799 - 5803
  • [39] Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
    Xiang, Xu
    Wang, Shuai
    Huang, Houjun
    Qian, Yanmin
    Yu, Kai
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1652 - 1656
  • [40] Optimized Deep Embedded Clustering-Based Speaker Diarization with Speech Enhancement
    Revathy, S. Merlin
    Kumar, S. S.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,