Explanation-based data-free model extraction attacks

被引：3

作者：

Yan, Anli ^{[1
,2
]}

Hou, Ruitao ^{[2
]}

Yan, Hongyang ^{[2
]}

Liu, Xiaozhang ^{[3
]}

机构：

[1] Hainan Univ, Sch Cyberspace Secur, Sch Cryptol, Haikou, Peoples R China

[2] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou, Peoples R China

[3] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Deep neural network; Model explanation; Black-box; Model extraction attack; FRAMEWORK; EFFICIENT;

D O I：

10.1007/s11280-023-01150-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning (DL) has dramatically pushed the previous limits of various tasks, ranging from computer vision to natural language processing. Despite its success, the lack of model explanations thwarts the usage of these techniques in life-critical domains, e.g., medical diagnosis and self-driving systems. To date, the core technology to solve the explainable issue is explainable artificial intelligence (XAI). XAI methods have been developed to produce human-understandable explanations by leveraging intermediate results of the DL models, e.g., gradients and model parameters. While the effectiveness of XAI methods has been demonstrated in benign environments, their privacy against model extraction attacks (i.e., attacks at the model confidentially) requires to be studied. To this end, this paper proposes DMEAE, a data-free model extraction attack using explanation-guided, to explore XAI privacy threats. Compared with previous works, DMEAE does not require collecting any data and utilizes model explanation loss. Specifically, DMEAE creates synthetic data using a generative model with model explanation loss items. Extensive evaluations verify the effectiveness and efficiency of the proposed attack strategy on SVHN and CIFAR-10 datasets. We hope that our research can provide insights for the development of practical tools to trade off the relationship between privacy and model explanations.

引用

页码：3081 / 3092

页数：12

共 50 条

[21] Explanation-Based Feature Construction
Lim, Shiau Hong
Wang, Li-Lun
DeJong, Gerald
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 931 - 936
[22] Explanation-based learning in infancy
Baillargeon, Renee
DeJong, Gerald F.
PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (05) : 1511 - 1526
[23] Explanation-based learning in infancy
Renée Baillargeon
Gerald F. DeJong
Psychonomic Bulletin & Review, 2017, 24 : 1511 - 1526
[24] EXPLANATION-BASED LEARNING FOR DIAGNOSIS
ELFATTAH, Y
ORORKE, P
MACHINE LEARNING, 1993, 13 (01) : 35 - 70
[25] EXPLANATION-BASED LEARNING - A SURVEY
WUSTEMAN, J
ARTIFICIAL INTELLIGENCE REVIEW, 1992, 6 (03) : 243 - 262
[26] Explanation-Based Approximate Weighted Model Counting for Probabilistic Logics
Renkens, Joris
Kimmig, Angelika
Van den Broeck, Guy
De Raedt, Luc
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2490 - 2496
[27] Exploring and Exploiting Data-Free Model Stealing
Hong, Chi
Huang, Jiyue
Birke, Robert
Chen, Lydia Y.
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 20 - 35
[28] Data-Free Network Pruning for Model Compression
Tang, Jialiang
Liu, Mingjin
Jiang, Ning
Cai, Huan
Yu, Wenxin
Zhou, Jinjia
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[29] ELEVATED SERUM ENZYME-ACTIVITY - AN EXPLANATION-BASED MODEL
MORTON, RH
CARTER, MR
JOURNAL OF APPLIED PHYSIOLOGY, 1992, 73 (05) : 2192 - 2200
[30] A STRUCTURAL THEORY OF EXPLANATION-BASED LEARNING
ETZIONI, O
ARTIFICIAL INTELLIGENCE, 1993, 60 (01) : 93 - 139

← 1 2 3 4 5 →