Private Model Compression via Knowledge Distillation

被引：0

作者：

Wang, Ji ^{[1
]}

Bao, Weidong ^{[1
]}

Sun, Lichao ^{[2
]}

Zhu, Xiaomin ^{[1
,3
]}

Cao, Bokai ^{[4
]}

Yu, Philip S. ^{[2
,5
]}

机构：

[1] Natl Univ Def Technol, Coll Syst Engn, Changsha, Hunan, Peoples R China

[2] Univ Illinois, Dept Comp Sci, Chicago, IL USA

[3] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha, Hunan, Peoples R China

[4] Facebook Inc, Menlo Pk, CA USA

[5] Tsinghua Univ, Inst Data Sci, Beijing, Peoples R China

来源：

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The soaring demand for intelligent mobile applications calls for deploying powerful deep neural networks (DNNs) on mobile devices. However, the outstanding performance of DNNs notoriously relies on increasingly complex models, which in turn is associated with an increase in computational expense far surpassing mobile devices' capacity. What is worse, app service providers need to collect and utilize a large volume of users' data, which contain sensitive information, to build the sophisticated DNN models. Directly deploying these models on public mobile devices presents prohibitive privacy risk. To benefit from the on-device deep learning without the capacity and privacy concerns, we design a private model compression framework RONA. Following the knowledge distillation paradigm, we jointly use hint learning, distillation learning, and self learning to train a compact and fast neural network. The knowledge distilled from the cumbersome model is adaptively bounded and carefully perturbed to enforce differential privacy. We further propose an elegant query sample selection method to reduce the number of queries and control the privacy loss. A series of empirical evaluations as well as the implementation on an Android mobile device show that RONA can not only compress cumbersome models efficiently but also provide a strong privacy guarantee. For example, on SVHN, when a meaningful (9.83, 10(-6))-differential privacy is guaranteed, the compact model trained by RONA can obtain 20x compression ratio and 19x speed-up with merely 0.97% accuracy loss.

引用

页码：1190 / +

页数：9

共 50 条

[1] Compression of Acoustic Model via Knowledge Distillation and Pruning
Li, Chenxing
Zhu, Lei
Xu, Shuang
Gao, Peng
Xu, Bo
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2785 - 2790
[2] Model Compression Algorithm via Reinforcement Learning and Knowledge Distillation
Liu, Botao
Hu, Bing-Bing
Zhao, Ming
Peng, Sheng-Lung
Chang, Jou-Ming
Tsoulos, Ioannis G.
MATHEMATICS, 2023, 11 (22)
[3] PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation
Kim, Jangho
Chang, Simyung
Kwak, Nojun
INTERSPEECH 2021, 2021, : 4568 - 4572
[4] Private Knowledge Transfer via Model Distillation with Generative Adversarial Networks
Gao, Di
Zhuo, Cheng
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1794 - 1801
[5] Knowledge Distillation Beyond Model Compression
Sarfraz, Fahad
Arani, Elahe
Zonooz, Bahram
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6136 - 6143
[6] Model compression via pruning and knowledge distillation for person re-identification
Xie, Haonan
Jiang, Wei
Luo, Hao
Yu, Hongyan
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2149 - 2161
[7] Model compression via pruning and knowledge distillation for person re-identification
Haonan Xie
Wei Jiang
Hao Luo
Hongyan Yu
Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2149 - 2161
[8] Model Selection - Knowledge Distillation Framework for Model Compression
Chen, Renhai
Yuan, Shimin
Wang, Shaobo
Li, Zhenghan
Xing, Meng
Feng, Zhiyong
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[9] Patient Knowledge Distillation for BERT Model Compression
Sun, Siqi
Cheng, Yu
Gan, Zhe
Liu, Jingjing
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4323 - 4332
[10] Triplet Knowledge Distillation Networks for Model Compression
Tang, Jialiang
Jiang, Ning
Yu, Wenxin
Wu, Wenqin
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,

← 1 2 3 4 5 →