Post-processing techniques for a speaker diarization system

被引：0

作者：

Tavarez, David ^{[1
]}

Navas, Eva ^{[1
]}

Erro, Daniel ^{[1
]}

Saratxaga, Ibon ^{[1
]}

Hernaez, Inma ^{[1
]}

机构：

[1] Univ Basque Country, Alda Urquijo S N, Bilbao, Spain

来源：

PROCESAMIENTO DEL LENGUAJE NATURAL | 2012年 / 49期

关键词：

Speaker diarization; segmentation; rich transcription;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

This paper presents the post-processing techniques designed to improve the results of a speaker diarization system. Three different techniques are proposed: refinement of speech vs. non speech segmentation, assimilation of short speech segments and fusion of clusters from the same speaker. These techniques have been implemented in a post-processing module that improves the result of the baseline system by 22.3 %. The same module has been applied to another speaker diarization system with a similar architecture to that of the baseline system with a DER improvement of 21 % and to another one with a very different architecture where no improvement has been achieved. It has also been used with another database with an improvement of 17 %. These experiments prove the validity of the techniques developed.

引用

页码：109 / 115

页数：7

共 50 条

[31] System output combination for improved speaker diarization
Bozonnet, Simon
Evans, Nicholas
Anguera, Xavier
Vinyals, Oriol
Friedland, Gerald
Fredouille, Corinne
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2650 - +
[32] NeMo Open Source Speaker Diarization System
Park, Tae Jin
Koluguri, Nithin Rao
Jia, Fei
Balam, Jagadeesh
Ginsburg, Boris
INTERSPEECH 2022, 2022, : 853 - 854
[33] Speech Enhancement for Multimodal Speaker Diarization System
Ahmad, Rehan
Zubair, Syed
Alquhayz, Hani
IEEE ACCESS, 2020, 8 : 126671 - 126680
[34] IMPROVED BINARY KEY SPEAKER DIARIZATION SYSTEM
Delgado, Hector
Anguera, Xavier
Fredouille, Corinne
Serrano, Javier
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2087 - 2091
[35] A Cluster Purification Algorithm for Speaker Diarization System
Xiang, Zhang
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
[36] Developing On-Line Speaker Diarization System
Dimitriadis, Dimitrios
Fousek, Petr
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2739 - 2743
[37] The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022
Liu, Tao
Xiang, Xu
Chen, Zhengyang
Han, Bing
Yu, Kai
Qian, Yanmin
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 498 - 501
[38] The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
Zhou, Ruohua
Du, Yuxuan
Hu, Chenlei
arXiv, 2022,
[39] MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020
Xiao, Xiong
Kanda, Naoyuki
Chen, Zhuo
Zhou, Tianyan
Yoshioka, Takuya
Chen, Sanyuan
Zhao, Yong
Liu, Gang
Wu, Yu
Wu, Jian
Liu, Shujie
Li, Jinyu
Gong, Yifan
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5824 - 5828
[40] Image filtering techniques for medical image post-processing: an overview
Behrenbruch, CP
Petroudi, S
Bond, S
DeClerck, JD
Leong, FJ
Brady, JM
BRITISH JOURNAL OF RADIOLOGY, 2004, 77 : S126 - S132

← 1 2 3 4 5 →