Speaker Anonymity and Voice Conversion Vulnerability: A Speaker Recognition Analysis

被引:2
|
作者
Saini, Shalini [1 ]
Saxena, Nitesh [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77834 USA
关键词
Voice Anonymity; Voice Conversion; Speaker Recognition; Privacy and Security;
D O I
10.1109/CNS59707.2023.10289030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Anonymized voice data is essential for maintaining privacy in voice-based exchanges of sensitive information, particularly in healthcare. However, voice conversion methods prioritize target voice identity over completely obscuring the source speaker's voice features. Recent advancements in speaker recognition systems have increased their ability to detect subtle residual voice features of the source speaker in voice-converted samples with greater precision and accuracy, posing potential risks to voice anonymity. Balancing speaker anonymity and recognition accuracy is a persistent challenge in voice-based applications, where maintaining voice anonymity and correctly identifying the speaker are critical but vulnerable. Violating voice anonymity can result in privacy and security threats. In this work, we examine multiple voice conversion and speaker recognition systems to explore the threats to voice anonymity. Our findings demonstrate the significant risk of identifying source speakers from converted voice samples. We discovered that voice anonymity is more vulnerable to breaking with one-to-one conversions compared to many-to-many and any-to-any conversions. The likelihood of identifying the original speaker from anonymized speech data depends on target voice features, voice conversion techniques, and speaker recognition methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Comparison Between Speaker Dependent Mode and Speaker Independent Mode for Voice Recognition
    Mrvaljevic, Nikola
    Sun, Ying
    2009 35TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2009, : 187 - 188
  • [22] Voice conversion based on static speaker characteristics
    Schwardt, LC
    du Preez, JA
    PROCEEDINGS OF THE 1998 SOUTH AFRICAN SYMPOSIUM ON COMMUNICATIONS AND SIGNAL PROCESSING: COMSIG '98, 1998, : 57 - 62
  • [23] SVCGAN: Speaker Voice Conversion Generative Adversarial Network for Children's Speech Conversion and Recognition
    Xie, Chenghuan
    Zhou, Aimin
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 2182 - 2196
  • [24] Voice articulator for Thai speaker recognition system
    Limpanakorn, S
    Tanprasert, C
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2396 - 2400
  • [25] Improving speaker recognition by biometric voice deconstruction
    Miguel Mazaira-Fernandez, Luis
    Alvarez-Marquina, Agustin
    Gomez-Vilda, Pedro
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
  • [26] Noise Robust Voice Detector for Speaker Recognition
    Hernandez, Gabriel
    Calvo, Jose R.
    Fernandez, Rafael
    Rodes, Ivis
    Martinez, Rafael
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2605 - 2608
  • [27] Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech
    Kinnunen, Tomi
    Wu, Zhi-Zheng
    Lee, Kong Aik
    Sedlak, Filip
    Chng, Eng Siong
    Li, Haizhou
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4401 - 4404
  • [28] Voice conversion and spoofing attack on speaker verification systems
    Wu, Zhizheng
    Li, Haizhou
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [29] From Speaker Recognition to Forensic Speaker Recognition
    Drygajlo, Andrzej
    BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
  • [30] Emotion-state conversion for speaker recognition
    Li, DD
    Yang, YC
    Wu, ZH
    Wu, T
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 403 - 410