AntiFake: Using Adversarial Audio to Prevent Unauthorized Speech Synthesis

被引:3
|
作者
Yu, Zhiyuan [1 ]
Zhai, Shixuan [1 ]
Zhang, Ning [1 ]
机构
[1] Washington Univ, St Louis, MO 63110 USA
关键词
Adversarial Machine Learning; Generative AI; Speech Synthesis; DeepFake Defense;
D O I
10.1145/3576915.3623209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of deep neural networks and generative AI has catalyzed growth in realistic speech synthesis. While this technology has great potential to improve lives, it also leads to the emergence of "DeepFake" where synthesized speech can be misused to deceive humans and machines for nefarious purposes. In response to this evolving threat, there has been a significant amount of interest in mitigating this threat by DeepFake detection. Complementary to the existing work, we propose to take the preventative approach and introduce AntiFake, a defense mechanism that relies on adversarial examples to prevent unauthorized speech synthesis. To ensure the transferability to attackers' unknown synthesis models, an ensemble learning approach is adopted to improve the generalizability of the optimization process. To validate the efficacy of the proposed system, we evaluated AntiFake against five state-of-the-art synthesizers using real-world DeepFake speech samples. The experiments indicated that AntiFake achieved over 95% protection rate even to unknown black-box models. We have also conducted usability tests involving 24 human participants to ensure the solution is accessible to diverse populations.
引用
收藏
页码:460 / 474
页数:15
相关论文
共 50 条
  • [1] Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
    Nistal, Javier
    Lattner, Stefan
    Richard, Gael
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 161 - 165
  • [2] ADVERSARIAL AUDIO SYNTHESIS USING A HARMONIC-PERCUSSIVE DISCRIMINATOR
    Lee, Jihyun
    Lim, Hyungseob
    Lee, Chanwoo
    Jang, Inseon
    Kang, Hong-Goo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 961 - 965
  • [3] HIGH-FREQUENCY ADVERSARIAL DEFENSE FOR SPEECH AND AUDIO
    Olivier, R.
    Raj, B.
    Shah, M.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2995 - 2999
  • [4] Synthesising Audio Adversarial Examples for Automatic Speech Recognition
    Qu, Xinghua
    Wei, Pengfei
    Gao, Mingyong
    Sun, Zhu
    Ong, Yew-Soon
    Ma, Zejun
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1430 - 1440
  • [5] TOWARDS AUDIO TO SCENE IMAGE SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORK
    Wan, Chia-Hung
    Chuang, Shun-Po
    Lee, Hung-Yi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 496 - 500
  • [6] Enhancing Gappy Speech Audio Signals with Generative Adversarial Networks
    Strods, Deniss
    Smeaton, Alan F.
    2023 34TH IRISH SIGNALS AND SYSTEMS CONFERENCE, ISSC, 2023,
  • [7] A Protection Scheme With Speech Processing Against Audio Adversarial Examples
    Tarutani, Yuya
    Yamamoto, Taisei
    Fukushima, Yukinobu
    Yokohira, Tokumi
    IEEE ACCESS, 2024, 12 : 146551 - 146559
  • [8] Audio Adversarial Examples: Targeted Attacks on Speech-to-Text
    Carlini, Nicholas
    Wagner, David
    2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, : 1 - 7
  • [9] Detecting Audio Adversarial Examples in Automatic Speech Recognition Systems Using Decision Boundary Patterns
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    Kim, Jongkil
    Le, Ngoc Thuy
    JOURNAL OF IMAGING, 2022, 8 (12)
  • [10] Selective Audio Adversarial Example in Evasion Attack on Speech Recognition System
    Kwon, Hyun
    Kim, Yongchul
    Yoon, Hyunsoo
    Choi, Daeseon
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 526 - 538