Augmented Neural Fine-Tuning for Efficient Backdoor Purification

被引:0
|
作者
Karim, Nazmul [1 ]
Al Arafat, Abdullah [2 ]
Khalid, Umar [1 ]
Guo, Zhishan [2 ]
Rahnavard, Nazanin [1 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
[2] North Carolina State Univ, Raleigh, NC USA
来源
基金
美国国家科学基金会;
关键词
ATTACK;
D O I
10.1007/978-3-031-72989-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have revealed the vulnerability of deep neural networks (DNNs) to various backdoor attacks, where the behavior of DNNs can be compromised by utilizing certain types of triggers or poisoning mechanisms. State-of-the-art (SOTA) defenses employ too-sophisticated mechanisms that require either a computationally expensive adversarial search module for reverse-engineering the trigger distribution or an over-sensitive hyper-parameter selection module. Moreover, they offer sub-par performance in challenging scenarios, e.g., limited validation data and strong attacks. In this paper, we propose-Neural mask Fine-Tuning (NFT)-with an aim to optimally re-organize the neuron activities in a way that the effect of the backdoor is removed. Utilizing a simple data augmentation like MixUp, NFT relaxes the trigger synthesis process and eliminates the requirement of the adversarial search module. Our study further reveals that direct weight fine-tuning under limited validation data results in poor post-purification clean test accuracy, primarily due to overfitting issue. To overcome this, we propose to fine-tune neural masks instead of model weights. In addition, a mask regularizer has been devised to further mitigate the model drift during the purification process. The distinct characteristics of NFT render it highly efficient in both runtime and sample usage, as it can remove the backdoor even when a single sample is available from each class. We validate the effectiveness of NFT through extensive experiments covering the tasks of image classification, object detection, video action recognition, 3D point cloud, and natural language processing. We evaluate our method against 14 different attacks (LIRA, WaNet, etc.) on 11 benchmark data sets (ImageNet, UCF101, Pascal VOC, ModelNet, OpenSubtitles2012, etc.). Our code is available online in this GitHub Repository.
引用
收藏
页码:401 / 418
页数:18
相关论文
共 50 条
  • [1] An efficient pruning and fine-tuning method for deep spiking neural network
    L. W. Meng
    G. C. Qiao
    X. Y. Zhang
    J. Bai
    Y. Zuo
    P. J. Zhou
    Y. Liu
    S. G. Hu
    Applied Intelligence, 2023, 53 : 28910 - 28923
  • [2] Efficient Neural Network Fine-Tuning via Layer Contribution Analysis
    Liu, Zhizhuo
    Zhou, Nanjian
    Liu, Min
    Liu, Zhibang
    Xu, Chaonong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 350 - 361
  • [3] An efficient pruning and fine-tuning method for deep spiking neural network
    Meng, L. W.
    Qiao, G. C.
    Zhang, X. Y.
    Bai, J.
    Zuo, Y.
    Zhou, P. J.
    Liu, Y.
    Hu, S. G.
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28910 - 28923
  • [4] Efficient fine-tuning of vision transformer via path-augmented parameter adaptation
    Zhou, Yao
    Yi, Zhang
    Yen, Gary G.
    INFORMATION SCIENCES, 2025, 703
  • [5] Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment
    Wang, Jiongxiao
    Li, Jiazhao
    Li, Yiquan
    Qi, Xiangyu
    Hu, Junjie
    Li, Yixuan
    McDaniel, Patrick
    Chen, Muhao
    Li, Bo
    Xiao, Chaowei
    arXiv,
  • [6] Enhancing Fine-Tuning based Backdoor Defense with Sharpness-Aware Minimization
    Zhu, Mingli
    Wei, Shaokui
    Shen, Li
    Fan, Yanbo
    Wu, Baoyuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4443 - 4454
  • [7] Fine-tuning neural gene expression with microRNAs
    Schratt, Gerhard
    CURRENT OPINION IN NEUROBIOLOGY, 2009, 19 (02) : 213 - 219
  • [8] Fine-tuning neural network quantum states
    Rende, Riccardo
    Goldt, Sebastian
    Becca, Federico
    Viteritti, Luciano Loris
    PHYSICAL REVIEW RESEARCH, 2024, 6 (04):
  • [9] Fine-Tuning and the Stability of Recurrent Neural Networks
    MacNeil, David
    Eliasmith, Chris
    PLOS ONE, 2011, 6 (09):
  • [10] Fine-tuning and Visualization of Convolutional Neural Networks
    Yin, Xiangnan
    Chen, Weihai
    Wu, Xingming
    Yue, Haosong
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1310 - 1315