The SAIL Speaker Diarization System for Analysis of Spontaneous Meetings

被引:0
|
作者
Han, Kyu J. [1 ]
Georgiou, Panayiotis G. [1 ]
Narayanan, Shrikanth S. [1 ]
机构
[1] Univ So Calif, Viterbi Sch Engn, Ming Hsieh Dept Elect Engn, SAIL, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel approach to speaker diarization of spontaneous meetings in our own multimodal SmartRoom environment. The proposed speaker diarization system first applies a sequential clustering concept to segmentation of a given audio data source, and then performs agglomerative hierarchical clustering for speaker-specific classification (or speaker clustering) of speech segments. The speaker clustering algorithm utilizes an incremental Gaussian mixture cluster modeling strategy, and a stopping point estimation method based on information change rate. Through experiments on various meeting conversation data of approximately 200 minutes total length, this system is demonstrated to provide diarization error rate of 18.90% on average.
引用
收藏
页码:970 / 975
页数:6
相关论文
共 50 条
  • [11] Agglomerative Information Bottleneck for speaker diarization of meetings data
    Vijayasenan, Deepu
    Valente, Fabio
    Bourlard, Herve
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 250 - 255
  • [12] SPEAKER DIARIZATION OF MEETINGS BASED ON SPEAKER ROLE N-GRAM MODELS
    Valente, Fabio
    Vijayasenan, Deepu
    Motlicek, Petr
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4416 - 4419
  • [13] Robust speaker segmentation for meetings:: The ICSI-SRI Spring 2005 Diarization System
    Anguera, X
    Wooters, C
    Peskin, B
    Aguiló, M
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 402 - 414
  • [14] Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system
    Anguera, Xavier
    Wooters, Chuck
    Pardo, Jose M.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1674 - 1677
  • [15] An Improved Speaker Diarization System
    Fu, Rong
    Benest, Ian D.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1253 - 1256
  • [16] SPEAKER DIARIZATION OF MEETINGS BASED ON LARGE TDOA FEATURE VECTORS
    Vijayasenan, Deepu
    Valente, Fabio
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4173 - 4176
  • [17] Overlapped speech detection for improved speaker diarization in multiparty meetings
    Boakye, Kofi
    Trueba-Hornero, Beatriz
    Vinyals, Oriol
    Friedland, Gerald
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4353 - 4356
  • [18] Multi-Stream Speaker Diarization Systems for the Meetings Domain
    Gallardo-Antolin, Ascension
    Anguera, Xavier
    Wooters, Chuck
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2186 - +
  • [19] ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings
    Mariotte, Theo
    Larcher, Anthony
    Montresori, Silvio
    Thomas, Jean-Hugh
    INTERSPEECH 2024, 2024, : 1620 - 1624
  • [20] Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings
    Luque, J.
    Segura, C.
    Hernando, J.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 383 - 386