Explanation-based data-free model extraction attacks

被引:3
|
作者
Yan, Anli [1 ,2 ]
Hou, Ruitao [2 ]
Yan, Hongyang [2 ]
Liu, Xiaozhang [3 ]
机构
[1] Hainan Univ, Sch Cyberspace Secur, Sch Cryptol, Haikou, Peoples R China
[2] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou, Peoples R China
[3] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network; Model explanation; Black-box; Model extraction attack; FRAMEWORK; EFFICIENT;
D O I
10.1007/s11280-023-01150-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning (DL) has dramatically pushed the previous limits of various tasks, ranging from computer vision to natural language processing. Despite its success, the lack of model explanations thwarts the usage of these techniques in life-critical domains, e.g., medical diagnosis and self-driving systems. To date, the core technology to solve the explainable issue is explainable artificial intelligence (XAI). XAI methods have been developed to produce human-understandable explanations by leveraging intermediate results of the DL models, e.g., gradients and model parameters. While the effectiveness of XAI methods has been demonstrated in benign environments, their privacy against model extraction attacks (i.e., attacks at the model confidentially) requires to be studied. To this end, this paper proposes DMEAE, a data-free model extraction attack using explanation-guided, to explore XAI privacy threats. Compared with previous works, DMEAE does not require collecting any data and utilizes model explanation loss. Specifically, DMEAE creates synthetic data using a generative model with model explanation loss items. Extensive evaluations verify the effectiveness and efficiency of the proposed attack strategy on SVHN and CIFAR-10 datasets. We hope that our research can provide insights for the development of practical tools to trade off the relationship between privacy and model explanations.
引用
收藏
页码:3081 / 3092
页数:12
相关论文
共 50 条
  • [21] Explanation-Based Feature Construction
    Lim, Shiau Hong
    Wang, Li-Lun
    DeJong, Gerald
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 931 - 936
  • [22] Explanation-based learning in infancy
    Baillargeon, Renee
    DeJong, Gerald F.
    PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (05) : 1511 - 1526
  • [23] Explanation-based learning in infancy
    Renée Baillargeon
    Gerald F. DeJong
    Psychonomic Bulletin & Review, 2017, 24 : 1511 - 1526
  • [24] EXPLANATION-BASED LEARNING FOR DIAGNOSIS
    ELFATTAH, Y
    ORORKE, P
    MACHINE LEARNING, 1993, 13 (01) : 35 - 70
  • [25] EXPLANATION-BASED LEARNING - A SURVEY
    WUSTEMAN, J
    ARTIFICIAL INTELLIGENCE REVIEW, 1992, 6 (03) : 243 - 262
  • [26] Explanation-Based Approximate Weighted Model Counting for Probabilistic Logics
    Renkens, Joris
    Kimmig, Angelika
    Van den Broeck, Guy
    De Raedt, Luc
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2490 - 2496
  • [27] Exploring and Exploiting Data-Free Model Stealing
    Hong, Chi
    Huang, Jiyue
    Birke, Robert
    Chen, Lydia Y.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 20 - 35
  • [28] Data-Free Network Pruning for Model Compression
    Tang, Jialiang
    Liu, Mingjin
    Jiang, Ning
    Cai, Huan
    Yu, Wenxin
    Zhou, Jinjia
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [29] ELEVATED SERUM ENZYME-ACTIVITY - AN EXPLANATION-BASED MODEL
    MORTON, RH
    CARTER, MR
    JOURNAL OF APPLIED PHYSIOLOGY, 1992, 73 (05) : 2192 - 2200
  • [30] A STRUCTURAL THEORY OF EXPLANATION-BASED LEARNING
    ETZIONI, O
    ARTIFICIAL INTELLIGENCE, 1993, 60 (01) : 93 - 139