Explanation-based data-free model extraction attacks

被引:3
|
作者
Yan, Anli [1 ,2 ]
Hou, Ruitao [2 ]
Yan, Hongyang [2 ]
Liu, Xiaozhang [3 ]
机构
[1] Hainan Univ, Sch Cyberspace Secur, Sch Cryptol, Haikou, Peoples R China
[2] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou, Peoples R China
[3] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network; Model explanation; Black-box; Model extraction attack; FRAMEWORK; EFFICIENT;
D O I
10.1007/s11280-023-01150-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning (DL) has dramatically pushed the previous limits of various tasks, ranging from computer vision to natural language processing. Despite its success, the lack of model explanations thwarts the usage of these techniques in life-critical domains, e.g., medical diagnosis and self-driving systems. To date, the core technology to solve the explainable issue is explainable artificial intelligence (XAI). XAI methods have been developed to produce human-understandable explanations by leveraging intermediate results of the DL models, e.g., gradients and model parameters. While the effectiveness of XAI methods has been demonstrated in benign environments, their privacy against model extraction attacks (i.e., attacks at the model confidentially) requires to be studied. To this end, this paper proposes DMEAE, a data-free model extraction attack using explanation-guided, to explore XAI privacy threats. Compared with previous works, DMEAE does not require collecting any data and utilizes model explanation loss. Specifically, DMEAE creates synthetic data using a generative model with model explanation loss items. Extensive evaluations verify the effectiveness and efficiency of the proposed attack strategy on SVHN and CIFAR-10 datasets. We hope that our research can provide insights for the development of practical tools to trade off the relationship between privacy and model explanations.
引用
收藏
页码:3081 / 3092
页数:12
相关论文
共 50 条
  • [31] On the Pros and Cons of Explanation-Based Ranking
    Muhammad, Khalil
    Lawlor, Aonghus
    Smyth, Barry
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2017, 2017, 10339 : 227 - 241
  • [32] DEFINING OPERATIONALITY FOR EXPLANATION-BASED LEARNING
    KELLER, RM
    ARTIFICIAL INTELLIGENCE, 1988, 35 (02) : 227 - 241
  • [33] THE EXPLANATION-BASED LEARNING (EBL) SYMPOSIUM
    不详
    AI MAGAZINE, 1988, 9 (02) : 141 - 141
  • [34] Sequencing via explanation-based learning
    Tianfield, H
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (02) : 237 - 262
  • [35] EXPLANATION-BASED GENERALIZATION = PARTIAL EVALUATION
    VANHARMELEN, F
    BUNDY, A
    ARTIFICIAL INTELLIGENCE, 1988, 36 (03) : 401 - 412
  • [36] Explanation-based large neighborhood search
    Prud'homme, Charles
    Lorca, Xavier
    Jussien, Narendra
    CONSTRAINTS, 2014, 19 (04) : 339 - 379
  • [37] Explanation-based large neighborhood search
    Charles Prud’homme
    Xavier Lorca
    Narendra Jussien
    Constraints, 2014, 19 : 339 - 379
  • [38] A new method for explanation-based learning
    Yang, J
    Shi, PF
    EXPERT SYSTEMS WITH APPLICATIONS, 1996, 10 (3-4) : 435 - 439
  • [39] SUPPORTING MODEL-BASED DIAGNOSIS WITH EXPLANATION-BASED LEARNING AND ANALOGICAL INFERENCES
    SPECHT, D
    WEISS, S
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 604 : 314 - 323
  • [40] Contrastive Model Inversion for Data-Free Knowledge Distillation
    Fang, Gongfan
    Song, Jie
    Wang, Xinchao
    Shen, Chengchao
    Wang, Xingen
    Song, Mingli
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2374 - 2380