Post-processing techniques for a speaker diarization system

被引:0
|
作者
Tavarez, David [1 ]
Navas, Eva [1 ]
Erro, Daniel [1 ]
Saratxaga, Ibon [1 ]
Hernaez, Inma [1 ]
机构
[1] Univ Basque Country, Alda Urquijo S N, Bilbao, Spain
来源
关键词
Speaker diarization; segmentation; rich transcription;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper presents the post-processing techniques designed to improve the results of a speaker diarization system. Three different techniques are proposed: refinement of speech vs. non speech segmentation, assimilation of short speech segments and fusion of clusters from the same speaker. These techniques have been implemented in a post-processing module that improves the result of the baseline system by 22.3 %. The same module has been applied to another speaker diarization system with a similar architecture to that of the baseline system with a DER improvement of 21 % and to another one with a very different architecture where no improvement has been achieved. It has also been used with another database with an improvement of 17 %. These experiments prove the validity of the techniques developed.
引用
收藏
页码:109 / 115
页数:7
相关论文
共 50 条
  • [1] PURE SEGMENT SELECTION AS SPEAKER DIARIZATION POST-PROCESSING
    Ben-Harush, Oshry
    Guterman, Hugo
    Lapidot, Itshak
    2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 461 - +
  • [2] END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING
    Horiguchi, Shota
    Garcia, Paola
    Fujita, Yusuke
    Watanabe, Shinji
    Nagamatsu, Kenji
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7188 - 7192
  • [3] IMPROVING SEPARATION-BASED SPEAKER DIARIZATION VIA ITERATIVE MODEL REFINEMENT AND SPEAKER EMBEDDING BASED POST-PROCESSING
    Niu, Shu-Tong
    Du, Jun
    Sun, Lei
    Lee, Chin-Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8387 - 8391
  • [4] Improvement of a speaker authentication system through MLP's post-processing
    Rodríguez-Liñares, L
    García-Mateo, C
    Alba-Castro, JL
    NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, : 461 - 470
  • [5] Introduction to post-processing techniques
    Jiru, Filip
    EUROPEAN JOURNAL OF RADIOLOGY, 2008, 67 (02) : 202 - 217
  • [6] X-Vector-Based Speaker Diarization Using Bi-LSTM and Interim Voting-Driven Post-processing
    Mala, J. B.
    Raj, S. M. Alex
    Rajan, Rajeev
    TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT II, 2024, 15049 : 161 - 173
  • [7] POST-PROCESSING TECHNIQUES FOR RADIOMETRIC IMAGES
    Siegenthaler, Stefan
    Canavero, Marco
    Murk, Axel
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 2316 - 2319
  • [8] An Improved Speaker Diarization System
    Fu, Rong
    Benest, Ian D.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1253 - 1256
  • [9] Benchmarking Post-processing Techniques for Offline Arabic Text Recognition System
    Jemni, Sana Khamekhem
    Kesentini, Yousri
    Kanoun, Slim
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 267 - 277
  • [10] Speaker identification by anchor models with PCA/LDA post-processing
    Mami, Y
    Charlet, D
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 180 - 183