Enhancing speaker identification in criminal investigations through clusterization and rank-based scoring

被引：0

作者：

Moura, Antonio Artur ^{[1
]}

Nepomuceno, Napoleao ^{[1
]}

Furtado, Vasco ^{[1
,2
]}

机构：

[1] Univ Fortaleza, Grad Program Appl Informat, Ave Washington Soares 1321, BR-60811905 Fortaleza, Ceara, Brazil

[2] Empresa Tecnol Informacao Ceara, Ave Pontes Vieira 220, BR-60130240 Fortaleza, Ceara, Brazil

来源：

FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION | 2024年 / 49卷

关键词：

Digital forensic; Audio analytics; Speaker recognition;

D O I：

10.1016/j.fsidi.2024.301765

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces an approach that supports speaker identification in criminal investigations, specifically addressing challenges associated with large volumes of audio recordings featuring unknown speaker identities. Our approach clusters related recordings - potentially from the same person - based on representative voice embeddings extracted using the ECAPA-TDNN speaker recognition model. Grouping audio recordings from the same person enhances variability and richness in voice patterns, thereby improving confidence in automatic speaker recognition. We propose a combination of cosine similarity and a rank-based adjustment function to determine matches of audio clusters with individuals in an enrollment database. Our approach was validated through experiments on a Common Voice-based synthesized dataset and a real-life application involving cell phones seized in prisons, which contained thousands of conversational audio recordings. Results demonstrated satisfactory performance and stability, consistently reducing the pool of candidate speakers for subsequent analysis by a human investigator.

引用

页数：13

共 50 条

[21] Reconstruction of Missing Features Based on a Low-Rank Assumption for Robust Speaker Identification
Tzagkarakis, Christos
Mouchtaris, Athanasios
5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 432 - 437
[22] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
Ding, Ing-Jr
Yen, Chih-Ta
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (14) : 5131 - 5140
[23] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
Ing-Jr Ding
Chih-Ta Yen
Multimedia Tools and Applications, 2015, 74 : 5131 - 5140
[24] Identification of a rank-based radiomic signature with individualized prognostic value for lung adenocarcinoma in a multi-cohort study
Liu, Yixin
Wang, Zhihui
Yang, Liping
Zhang, Meng
Li, Mengyue
Zhang, Juxuan
Tang, Lefan
Jiang, Zhiyun
Li, Xin
Deng, Jiaxing
Meng, Qingwei
Liu, Shilong
Wang, Kezheng
Qi, Lishuang
EUROPEAN JOURNAL OF RADIOLOGY, 2024, 181
[25] Cross-platform comparison of immune signatures in immunotherapy-treated patients with advanced melanoma using a rank-based scoring approach
Mao Y.
Gide T.N.
Adegoke N.A.
Quek C.
Maher N.
Potter A.
Patrick E.
Saw R.P.M.
Thompson J.F.
Spillane A.J.
Shannon K.F.
Carlino M.S.
Lo S.N.
Menzies A.M.
da Silva I.P.
Long G.V.
Scolyer R.A.
Wilmott J.S.
Journal of Translational Medicine, 21 (1)
[26] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
M.A. El-Gamal
M.F. Abu El-Yazeed
M.M.H. El Ayadi
International Journal of Speech Technology, 2005, 8 (1) : 93 - 103
[27] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
El-Gamal, M. A.
Abu El-Yazeed, M. F.
El Ayadi, M. M. H.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (01) : 93 - 103
[28] Topic modeling through rank-based aggregation and LLMs: An approach for AI and human-generated scientific texts
Celikten, Tugba
Onan, Aytug
KNOWLEDGE-BASED SYSTEMS, 2025, 314
[29] scDetect: a rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer
Shen, Yifei
Chu, Qinjie
Timko, Michael P.
Fan, Longjiang
BIOINFORMATICS, 2021, 37 (22) : 4115 - 4122
[30] Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement
Ravi, Vijay
Wang, Jinhan
Flint, Jonathan
Alwan, Abeer
COMPUTER SPEECH AND LANGUAGE, 2024, 86

← 1 2 3 4 5 →