Explanation-based data-free model extraction attacks

被引：3

作者：

Yan, Anli ^{[1
,2
]}

Hou, Ruitao ^{[2
]}

Yan, Hongyang ^{[2
]}

Liu, Xiaozhang ^{[3
]}

机构：

[1] Hainan Univ, Sch Cyberspace Secur, Sch Cryptol, Haikou, Peoples R China

[2] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou, Peoples R China

[3] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Deep neural network; Model explanation; Black-box; Model extraction attack; FRAMEWORK; EFFICIENT;

D O I：

10.1007/s11280-023-01150-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning (DL) has dramatically pushed the previous limits of various tasks, ranging from computer vision to natural language processing. Despite its success, the lack of model explanations thwarts the usage of these techniques in life-critical domains, e.g., medical diagnosis and self-driving systems. To date, the core technology to solve the explainable issue is explainable artificial intelligence (XAI). XAI methods have been developed to produce human-understandable explanations by leveraging intermediate results of the DL models, e.g., gradients and model parameters. While the effectiveness of XAI methods has been demonstrated in benign environments, their privacy against model extraction attacks (i.e., attacks at the model confidentially) requires to be studied. To this end, this paper proposes DMEAE, a data-free model extraction attack using explanation-guided, to explore XAI privacy threats. Compared with previous works, DMEAE does not require collecting any data and utilizes model explanation loss. Specifically, DMEAE creates synthetic data using a generative model with model explanation loss items. Extensive evaluations verify the effectiveness and efficiency of the proposed attack strategy on SVHN and CIFAR-10 datasets. We hope that our research can provide insights for the development of practical tools to trade off the relationship between privacy and model explanations.

引用

页码：3081 / 3092

页数：12

共 50 条

[31] On the Pros and Cons of Explanation-Based Ranking
Muhammad, Khalil
Lawlor, Aonghus
Smyth, Barry
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2017, 2017, 10339 : 227 - 241
[32] DEFINING OPERATIONALITY FOR EXPLANATION-BASED LEARNING
KELLER, RM
ARTIFICIAL INTELLIGENCE, 1988, 35 (02) : 227 - 241
[33] THE EXPLANATION-BASED LEARNING (EBL) SYMPOSIUM
不详
AI MAGAZINE, 1988, 9 (02) : 141 - 141
[34] Sequencing via explanation-based learning
Tianfield, H
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (02) : 237 - 262
[35] EXPLANATION-BASED GENERALIZATION = PARTIAL EVALUATION
VANHARMELEN, F
BUNDY, A
ARTIFICIAL INTELLIGENCE, 1988, 36 (03) : 401 - 412
[36] Explanation-based large neighborhood search
Prud'homme, Charles
Lorca, Xavier
Jussien, Narendra
CONSTRAINTS, 2014, 19 (04) : 339 - 379
[37] Explanation-based large neighborhood search
Charles Prud’homme
Xavier Lorca
Narendra Jussien
Constraints, 2014, 19 : 339 - 379
[38] A new method for explanation-based learning
Yang, J
Shi, PF
EXPERT SYSTEMS WITH APPLICATIONS, 1996, 10 (3-4) : 435 - 439
[39] SUPPORTING MODEL-BASED DIAGNOSIS WITH EXPLANATION-BASED LEARNING AND ANALOGICAL INFERENCES
SPECHT, D
WEISS, S
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 604 : 314 - 323
[40] Contrastive Model Inversion for Data-Free Knowledge Distillation
Fang, Gongfan
Song, Jie
Wang, Xinchao
Shen, Chengchao
Wang, Xingen
Song, Mingli
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2374 - 2380

← 1 2 3 4 5 →