Speaker diarization:: Towards a more robust and portable system

被引：0

作者：

El Khoury, Elie ^{[1
]}

Senac, Christine ^{[1
]}

Andre-Obrecht, Regine ^{[1
]}

机构：

[1] CNRS, UMR 5505, IRIT, SAMoVA Team, Toulouse, France

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we describe a new method for speaker segmentation and clustering of an audio document. For the segmentation phase, we combine the Generalized Likelihood Ratio (GLR) and the Bayesian Information Criterion (BIC) in a way that avoids most of the parameters tuning. For the clustering phase, we use an existing approach that utilizes the Eigen Vector Space Model (EVSM) with a bottom-up hierarchical grouping but we make some improvements by introducing prosodic information. Evaluation is done on the audio database of the ESTER evaluation campaign for the rich transcription of French Broadcast news. Results show that our method which operates without any a priori knowledge about speakers is suitable for speaker diarization as it outperforms the traditional ones with an overall Diarization error rate (DER) of 16.72%.

引用

页码：489 / +

页数：2

共 50 条

[21] Robust acoustic domain identification with its application to speaker diarization
Kumar A.K.
Waldekar S.
Sahidullah M.
Saha G.
International Journal of Speech Technology, 2022, 25 (04) : 933 - 945
[22] NeMo Open Source Speaker Diarization System
Park, Tae Jin
Koluguri, Nithin Rao
Jia, Fei
Balam, Jagadeesh
Ginsburg, Boris
INTERSPEECH 2022, 2022, : 853 - 854
[23] Robust Statistical Processing of TDOA Estimates for Distant Speaker Diarization
Parada, Pablo Peso
Sharma, Dushyant
van Waterschoot, Toon
Naylor, Patrick A.
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 86 - 90
[24] Speech Enhancement for Multimodal Speaker Diarization System
Ahmad, Rehan
Zubair, Syed
Alquhayz, Hani
IEEE ACCESS, 2020, 8 : 126671 - 126680
[25] IMPROVED BINARY KEY SPEAKER DIARIZATION SYSTEM
Delgado, Hector
Anguera, Xavier
Fredouille, Corinne
Serrano, Javier
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2087 - 2091
[26] A Cluster Purification Algorithm for Speaker Diarization System
Xiang, Zhang
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
[27] Developing On-Line Speaker Diarization System
Dimitriadis, Dimitrios
Fousek, Petr
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2739 - 2743
[28] The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022
Liu, Tao
Xiang, Xu
Chen, Zhengyang
Han, Bing
Yu, Kai
Qian, Yanmin
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 498 - 501
[29] The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
Zhou, Ruohua
Du, Yuxuan
Hu, Chenlei
arXiv, 2022,
[30] MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020
Xiao, Xiong
Kanda, Naoyuki
Chen, Zhuo
Zhou, Tianyan
Yoshioka, Takuya
Chen, Sanyuan
Zhao, Yong
Liu, Gang
Wu, Yu
Wu, Jian
Liu, Shujie
Li, Jinyu
Gong, Yifan
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5824 - 5828

← 1 2 3 4 5 →