Privacy-Preserving Locally Weighted Linear Regression Over Encrypted Millions of Data

被引:5
|
作者
Dong, Xiaoxia [1 ,3 ]
Chen, Jie [2 ,3 ]
Zhang, Kai [4 ]
Qian, Haifeng [2 ,3 ]
机构
[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Sch Software Engn, Shanghai 200062, Peoples R China
[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Locally weighted linear regression; privacy-preserving; paillier homomorphic encryption; stochastic gradient descent;
D O I
10.1109/ACCESS.2019.2962700
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging development of cloud computing makes a trend that the cloud becomes a outsourced agglomeration for storing big data that generally contains numerous information. To mine the rich value involved in big data, the machine learning methodology is widespread employed due to its ability to adapt to data changes. However, the data mining process may involve the privacy issues of the users, hence they are reluctant to share their information. This is the reason why the outsourced data need to be dealt with securely, where data encryption is considered to be the most straightforward method to keep the privacy of data, but machine learning on the data in ciphertext domain is more complicated than the plaintext, since the relationship structure between data is no longer maintained, in such a way that we focus on the machine learning over encrypted big data. In this work, we study locally weighted linear regression (LWLR), a widely used classic machine learning algorithm in real-world, such as predict and find the best-fit curve through numerous data points. To tackle the privacy concerns in utilizing the LWLR algorithm, we present a system for privacy-preserving locally weighted linear regression, where the system not only protects the privacy of users but also encrypts the best-fit curve. Therefore, we use Paillier homomorphic encryption as the building modular to encrypt data and then apply the stochastic gradient descent in encrypted domain. After given a security analysis, we study how to let Paillier encryption deal with real numbers and implement the system in Python language with a couple of experiments on real-world data sets to evaluate the effectiveness, and show that it outperforms the state-of-the-art and occurs negligible errors compared with performing locally weighted linear regression in the clear.
引用
收藏
页码:2247 / 2257
页数:11
相关论文
共 50 条
  • [31] Privacy-Preserving Tensor Decomposition Over Encrypted Data in a Federated Cloud Environment
    Feng, Jun
    Yang, Laurence T.
    Zhu, Qing
    Choo, Kim-Kwang Raymond
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2020, 17 (04) : 857 - 868
  • [32] Privacy-Preserving Krawtchouk Moment feature extraction over encrypted image data
    Yang, Tengfei
    Ma, Jianfeng
    Miao, Yinbin
    Liu, Ximeng
    Wang, Xuan
    Xiao, Bin
    Meng, Qian
    INFORMATION SCIENCES, 2020, 536 : 244 - 262
  • [33] Efficient and Privacy-Preserving Spatial Keyword Similarity Query Over Encrypted Data
    Zhang, Songnian
    Ray, Suprio
    Lu, Rongxing
    Guan, Yunguo
    Zheng, Yandong
    Shao, Jun
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (05) : 3770 - 3786
  • [34] Towards realistic privacy-preserving deep learning over encrypted medical data
    Cabrero-Holgueras, Jose
    Pastrana, Sergio
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
  • [35] Privacy-Preserving Outsourced Similarity Test for Access Over Encrypted Data in the Cloud
    Yang, Dan
    Chen, Yu-Chi
    Ye, Shaozhen
    Tso, Raylin
    IEEE ACCESS, 2018, 6 : 63624 - 63634
  • [36] PRkNN: Efficient and Privacy-Preserving Reverse kNN Query Over Encrypted Data
    Zheng, Yandong
    Lu, Rongxing
    Zhang, Songnian
    Guan, Yunguo
    Wang, Fengwei
    Shao, Jun
    Zhu, Hui
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (05) : 4387 - 4402
  • [37] A Privacy-preserving Fuzzy Keyword Search Scheme over Encrypted Cloud Data
    Wang, Dongsheng
    Fu, Shaojing
    Xu, Ming
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 663 - 670
  • [38] Privacy-Preserving Reverse Nearest Neighbor Query Over Encrypted Spatial Data
    Li, Xiaoguo
    Xiang, Tao
    Guo, Shangwei
    Li, Hongwei
    Mu, Yi
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (05) : 2954 - 2968
  • [39] Toward practical privacy-preserving linear regression
    Xu, Wenju
    Wang, Baocang
    Liu, Jiasen
    Chen, Yange
    Duan, Pu
    Hong, Zhiyong
    INFORMATION SCIENCES, 2022, 596 (119-136) : 119 - 136
  • [40] Input and Output Privacy-Preserving Linear Regression
    Aono, Yoshinori
    Hayashi, Takuya
    Phong, Le Trieu
    Wang, Lihua
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (10) : 2339 - 2347