AntiFake: Using Adversarial Audio to Prevent Unauthorized Speech Synthesis

被引:3
|
作者
Yu, Zhiyuan [1 ]
Zhai, Shixuan [1 ]
Zhang, Ning [1 ]
机构
[1] Washington Univ, St Louis, MO 63110 USA
关键词
Adversarial Machine Learning; Generative AI; Speech Synthesis; DeepFake Defense;
D O I
10.1145/3576915.3623209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of deep neural networks and generative AI has catalyzed growth in realistic speech synthesis. While this technology has great potential to improve lives, it also leads to the emergence of "DeepFake" where synthesized speech can be misused to deceive humans and machines for nefarious purposes. In response to this evolving threat, there has been a significant amount of interest in mitigating this threat by DeepFake detection. Complementary to the existing work, we propose to take the preventative approach and introduce AntiFake, a defense mechanism that relies on adversarial examples to prevent unauthorized speech synthesis. To ensure the transferability to attackers' unknown synthesis models, an ensemble learning approach is adopted to improve the generalizability of the optimization process. To validate the efficacy of the proposed system, we evaluated AntiFake against five state-of-the-art synthesizers using real-world DeepFake speech samples. The experiments indicated that AntiFake achieved over 95% protection rate even to unknown black-box models. We have also conducted usability tests involving 24 human participants to ensure the solution is accessible to diverse populations.
引用
收藏
页码:460 / 474
页数:15
相关论文
共 50 条
  • [41] DeepAdversaryDefense: A Deep Model to Identify and Prevent Adversarial Attacks against Medical Speech Recognition
    Panwar, Kirtee
    Singh, Akansha
    Singh, Krishna Kant
    5TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE, IDDM 2022, 2022, 3302
  • [42] Noisy audio feature enhancement using audio-visual speech data
    Goecke, R
    Potamianos, G
    Neti, C
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2025 - 2028
  • [43] Anomaly Detection of Deepfake Audio Based on Real Audio Using Generative Adversarial Network Model
    Song, Daeun
    Lee, Nayoung
    Kim, Jiwon
    Choi, Eunjung
    IEEE ACCESS, 2024, 12 : 184311 - 184326
  • [44] Acoustic-decoy: Detection of adversarial examples through audio modification on speech recognition system
    Kwon, Hyun
    Yoon, Hyunsoo
    Park, Ki-Woong
    NEUROCOMPUTING, 2020, 417 (417) : 357 - 370
  • [45] Towards the universal defense for query-based audio adversarial attacks on speech recognition system
    Guo, Feng
    Sun, Zheng
    Chen, Yuxuan
    Ju, Lei
    CYBERSECURITY, 2023, 6 (01)
  • [46] Generative Adversarial Networks (GANs) for Audio-Visual Speech Recognition in Artificial Intelligence IoT
    He, Yibo
    Seng, Kah Phooi
    Ang, Li Minn
    INFORMATION, 2023, 14 (10)
  • [47] Towards the universal defense for query-based audio adversarial attacks on speech recognition system
    Feng Guo
    Zheng Sun
    Yuxuan Chen
    Lei Ju
    Cybersecurity, 6
  • [48] StyleWaveGAN: Style-based synthesis of drum sounds using generative adversarial networks for higher audio quality
    Lavault, Antoine
    Roebel, Axel
    Voiry, Matthieu
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 234 - 238
  • [49] Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks
    Saito, Yuki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 84 - 96
  • [50] Transforming the Emotion in Speech using a Generative Adversarial Network
    Yasuda, Kenji
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 427 - 434