Enhancing speaker identification in criminal investigations through clusterization and rank-based scoring

被引:0
|
作者
Moura, Antonio Artur [1 ]
Nepomuceno, Napoleao [1 ]
Furtado, Vasco [1 ,2 ]
机构
[1] Univ Fortaleza, Grad Program Appl Informat, Ave Washington Soares 1321, BR-60811905 Fortaleza, Ceara, Brazil
[2] Empresa Tecnol Informacao Ceara, Ave Pontes Vieira 220, BR-60130240 Fortaleza, Ceara, Brazil
关键词
Digital forensic; Audio analytics; Speaker recognition;
D O I
10.1016/j.fsidi.2024.301765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an approach that supports speaker identification in criminal investigations, specifically addressing challenges associated with large volumes of audio recordings featuring unknown speaker identities. Our approach clusters related recordings - potentially from the same person - based on representative voice embeddings extracted using the ECAPA-TDNN speaker recognition model. Grouping audio recordings from the same person enhances variability and richness in voice patterns, thereby improving confidence in automatic speaker recognition. We propose a combination of cosine similarity and a rank-based adjustment function to determine matches of audio clusters with individuals in an enrollment database. Our approach was validated through experiments on a Common Voice-based synthesized dataset and a real-life application involving cell phones seized in prisons, which contained thousands of conversational audio recordings. Results demonstrated satisfactory performance and stability, consistently reducing the pool of candidate speakers for subsequent analysis by a human investigator.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Reconstruction of Missing Features Based on a Low-Rank Assumption for Robust Speaker Identification
    Tzagkarakis, Christos
    Mouchtaris, Athanasios
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 432 - 437
  • [22] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
    Ding, Ing-Jr
    Yen, Chih-Ta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (14) : 5131 - 5140
  • [23] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
    Ing-Jr Ding
    Chih-Ta Yen
    Multimedia Tools and Applications, 2015, 74 : 5131 - 5140
  • [24] Identification of a rank-based radiomic signature with individualized prognostic value for lung adenocarcinoma in a multi-cohort study
    Liu, Yixin
    Wang, Zhihui
    Yang, Liping
    Zhang, Meng
    Li, Mengyue
    Zhang, Juxuan
    Tang, Lefan
    Jiang, Zhiyun
    Li, Xin
    Deng, Jiaxing
    Meng, Qingwei
    Liu, Shilong
    Wang, Kezheng
    Qi, Lishuang
    EUROPEAN JOURNAL OF RADIOLOGY, 2024, 181
  • [25] Cross-platform comparison of immune signatures in immunotherapy-treated patients with advanced melanoma using a rank-based scoring approach
    Mao Y.
    Gide T.N.
    Adegoke N.A.
    Quek C.
    Maher N.
    Potter A.
    Patrick E.
    Saw R.P.M.
    Thompson J.F.
    Spillane A.J.
    Shannon K.F.
    Carlino M.S.
    Lo S.N.
    Menzies A.M.
    da Silva I.P.
    Long G.V.
    Scolyer R.A.
    Wilmott J.S.
    Journal of Translational Medicine, 21 (1)
  • [26] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
    M.A. El-Gamal
    M.F. Abu El-Yazeed
    M.M.H. El Ayadi
    International Journal of Speech Technology, 2005, 8 (1) : 93 - 103
  • [27] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
    El-Gamal, M. A.
    Abu El-Yazeed, M. F.
    El Ayadi, M. M. H.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (01) : 93 - 103
  • [28] Topic modeling through rank-based aggregation and LLMs: An approach for AI and human-generated scientific texts
    Celikten, Tugba
    Onan, Aytug
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [29] scDetect: a rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer
    Shen, Yifei
    Chu, Qinjie
    Timko, Michael P.
    Fan, Longjiang
    BIOINFORMATICS, 2021, 37 (22) : 4115 - 4122
  • [30] Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement
    Ravi, Vijay
    Wang, Jinhan
    Flint, Jonathan
    Alwan, Abeer
    COMPUTER SPEECH AND LANGUAGE, 2024, 86