Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

被引:158
作者
Kim, Miran [1 ]
Song, Yongsoo [2 ,3 ]
Wang, Shuang [1 ]
Xia, Yuhou [4 ]
Jiang, Xiaoqian [1 ]
机构
[1] Univ Calif San Diego, Div Biomed Informat, 9500 Gilman Dr, San Diego, CA 92093 USA
[2] Seoul Natl Univ, Dept Math Sci, Seoul, South Korea
[3] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92093 USA
[4] Princeton Univ, Dept Math, Princeton, NJ 08544 USA
基金
新加坡国家研究基金会;
关键词
homomorphic encryption; machine learning; logistic regression; gradient descent; SEARCH; MODELS;
D O I
10.2196/medinform.8805
中图分类号
R-058 [];
学科分类号
摘要
Background: Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis. Objective: The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression). Methods: We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques. Results: Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset. Conclusions: We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still.
引用
收藏
页码:245 / 255
页数:11
相关论文
共 27 条
[1]  
[Anonymous], IMPLEMENTATION SECUR
[2]  
[Anonymous], SURV RES HOST CLOUD
[3]  
[Anonymous], P IEEE S SECUR PRIV
[4]  
[Anonymous], BIOMEDICAL RES CHANG
[5]  
[Anonymous], 2012, DEFINITIVE GUIDE COM
[6]  
[Anonymous], THE HEAAN LIB
[7]   Scalable and Secure Logistic Regression via Homomorphic Encryption [J].
Aono, Yoshinori ;
Hayashi, Takuya ;
Le Trieu Phong ;
Wang, Lihua .
CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, :142-144
[8]   Private predictive analysis on encrypted medical data [J].
Bos, Joppe W. ;
Lauter, Kristin ;
Naehrig, Michael .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 :234-243
[9]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[10]   Homomorphic Encryption for Arithmetic of Approximate Numbers [J].
Cheon, Jung Hee ;
Kim, Andrey ;
Kim, Miran ;
Song, Yongsoo .
ADVANCES IN CRYPTOLOGY - ASIACRYPT 2017, PT I, 2017, 10624 :409-437