A Novel Backdoor Detection Approach Using Entropy-Based Measures

被引:0
|
作者
Surendrababu, Hema Karnam [1 ,2 ]
Nagaraj, Nithin [3 ]
机构
[1] Univ Transdisciplinary Hlth Sci & Technol, Bengaluru 560012, Karnataka, India
[2] Natl Inst Adv Studies, Indian Inst Sci Campus, Sch Conflict & Secur Studies, Bengaluru 560012, Karnataka, India
[3] Natl Inst Adv Studies, Indian Inst Sci Campus, Sch Humanities, Consciousness Studies Programme, Bengaluru 560012, Karnataka, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Entropy; Complexity theory; Training; Data models; Computational modeling; Vectors; Time series analysis; Artificial intelligence; Detection algorithms; Data integrity; Data poisoning; backdoor attacks; backdoor defenses; approximate entropy; sample entropy; TIME-SERIES ANALYSIS; APPROXIMATE ENTROPY; COMPLEXITY; COMPRESSION;
D O I
10.1109/ACCESS.2024.3444273
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Amidst the recent technological breakthroughs and increased integration of Artificial Intelligence (AI) technologies across various domains, it is imperative to consider the myriad security threats posed by AI. One of the significant attack vectors on AI models is the backdoor attack, which involves maliciously manipulating the model's behaviour by inserting hidden patterns or triggers into training datasets. In this paper our primary focus is on the defenses for the backdoor attacks mounted via poisoned training datasets. While many backdoor defense mechanisms have been proposed in the context of text, image, and audio domains, a majority of these defense mechanisms focus on training a specific model to detect backdoor triggers. Our current work proposes a novel model agnostic backdoor detection approach that utilizes complexity/entropy-based measures. In this study, we demonstrate the limitations of currently existing entropy measures - Sample Entropy and Approximate Entropy in detecting backdoor triggers in poisoned datasets. Consequently, we propose a novel modification of the Manhattan metric in the Entropy calculation and incorporate it in the complexity measures. This modified approach is shown to successfully detect backdoor triggers in datasets from not only the Natural Language Processing (NLP) domain, but also from the Financial and Geological domains. The effectiveness of the proposed approach was further substantiated with the high F1 scores in the range of 0.92 to 1.00 across the datasets, and with zero false negatives for the real-world datasets from the Financial and the Geological domains.
引用
收藏
页码:114057 / 114072
页数:16
相关论文
共 50 条
  • [31] An Entropy-based TextWatermarking Detection Method
    Lu, Yijian
    Liu, Aiwei
    Yu, Dianzhi
    Li, Jingjing
    King, Irwin
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 11724 - 11735
  • [32] Entropy-Based Anomaly Detection in a Network
    Shukla, Ajay Shankar
    Maurya, Rohit
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 99 (04) : 1487 - 1501
  • [33] Entropy-Based Watermarking Approach for Sensitive Tamper Detection of Arabic Text
    Al-Wesabi, Fahd N.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 3635 - 3648
  • [34] A novel subspace outlier detection method by entropy-based clustering algorithm
    Zuo, Zheng
    Li, Ziqiang
    Cheng, Pengsen
    Zhao, Jian
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [35] An entropy-based approach for shape description
    Bruni, Vittoria
    Della Cioppa, Lorenzo
    Vitulano, Domenico
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 603 - 607
  • [36] An Entropy-Based Approach to Portfolio Optimization
    Mercurio, Peter Joseph
    Wu, Yuehua
    Xie, Hong
    ENTROPY, 2020, 22 (03)
  • [37] Improved Memory-based Collaborative Filtering Using Entropy-based Similarity Measures
    Kwon, Hyeong-Joon
    Lee, Tae-Hoon
    Hong, Kwang-Seok
    2009 INTERNATIONAL SYMPOSIUM ON WEB INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 29 - 34
  • [38] New entropy-based measures of gene significance and epistasis
    Seo, DI
    Kim, YH
    Moon, BR
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 1345 - 1356
  • [39] Inequalities for entropy-based measures of network information content
    Dehmer, Matthias
    Mowshowitz, Abbe
    APPLIED MATHEMATICS AND COMPUTATION, 2010, 215 (12) : 4263 - 4271
  • [40] Entropy-based complexity measures for dynamic decision processes
    Wang, H
    Efstathiou, J
    Yang, JB
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2005, 12 (5-6): : 829 - 848