Local Distribution Based Density Clustering for Speaker Diarization

被引:0
|
作者
Rho, Jinsang
Shon, Suwon
Kim, Sung Soo
Lee, Jae-Won
Ko, Hanseok
机构
来源
关键词
Density based clustering; Speaker diarization; DBSCAN; Local density; Over-clustering;
D O I
10.7776/ASK.2015.34.4.303
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker diarization is the task of determining the speakers for unlabeled data, and DBSCAN (Density-Based Spatial Clustering of Applications with Noise) has been widely used in the field of speaker diarization for its simplicity and computational efficiency. One challenging issue, however, is that if different clusters in non-spatial dataset are adjacent to each other, over-clustering may occur which subsequently degrades the performance of DBSCAN. In this paper, we identify the drawbacks of DBSCAN and propose a new density clustering algorithm based on local distribution property around object. Variable density criterions for local density and spreadness of object are used for effective data clustering. We compare the proposed algorithm to DBSCAN in terms of clustering accuracy. Experimental results confirm that the proposed algorithm exhibits higher accuracy than DBSCAN without over-clustering and confirm that the new approach based on local density and object spreadness is efficient.
引用
收藏
页码:303 / 309
页数:7
相关论文
共 50 条
  • [1] Active Learning Based Constrained Clustering For Speaker Diarization
    Yu, Chengzhu
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2188 - 2198
  • [2] Spectral Clustering Approach to Speaker Diarization
    Ning, Huazhong
    Liu, Ming
    Tang, Hao
    Huang, Thomas
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2178 - 2181
  • [3] LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization
    Lin, Qingjian
    Yin, Ruiqing
    Li, Ming
    Bredin, Herve
    Barras, Claude
    INTERSPEECH 2019, 2019, : 366 - 370
  • [4] PLDA-based Clustering for Speaker Diarization of Broadcast Streams
    Silovsky, Jan
    Prazak, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2920 - +
  • [5] Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings
    Luque, J.
    Segura, C.
    Hernando, J.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 383 - 386
  • [6] Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems
    Zibert, Janez
    Mihelic, France
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1040 - +
  • [7] Discriminative Training for Hierarchical Clustering in Speaker Diarization
    Vinyals, Oriol
    Friedland, Gerald
    Morgan, Nelson
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2326 - +
  • [8] A Comparison of Distance Measures for Clustering in Speaker Diarization
    Niero, Marcelo de Campos
    Veiga Filho, Alvaro de Lima
    Adami, Andre Gustavo
    2014 INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM (ITS), 2014,
  • [9] Combination of agglomerative and sequential clustering for speaker diarization
    Vijayasenan, Deepu
    Valente, Fabio
    Bourlard, Herve
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4361 - +
  • [10] Interrelate Training and Clustering for Online Speaker Diarization
    Chen, Yifan
    Cheng, Gaofeng
    Yang, Runyan
    Zhang, Pengyuan
    Yan, Yonghong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1352 - 1364