Privacy-Preserving Locally Weighted Linear Regression Over Encrypted Millions of Data

被引:5
|
作者
Dong, Xiaoxia [1 ,3 ]
Chen, Jie [2 ,3 ]
Zhang, Kai [4 ]
Qian, Haifeng [2 ,3 ]
机构
[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Sch Software Engn, Shanghai 200062, Peoples R China
[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Locally weighted linear regression; privacy-preserving; paillier homomorphic encryption; stochastic gradient descent;
D O I
10.1109/ACCESS.2019.2962700
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging development of cloud computing makes a trend that the cloud becomes a outsourced agglomeration for storing big data that generally contains numerous information. To mine the rich value involved in big data, the machine learning methodology is widespread employed due to its ability to adapt to data changes. However, the data mining process may involve the privacy issues of the users, hence they are reluctant to share their information. This is the reason why the outsourced data need to be dealt with securely, where data encryption is considered to be the most straightforward method to keep the privacy of data, but machine learning on the data in ciphertext domain is more complicated than the plaintext, since the relationship structure between data is no longer maintained, in such a way that we focus on the machine learning over encrypted big data. In this work, we study locally weighted linear regression (LWLR), a widely used classic machine learning algorithm in real-world, such as predict and find the best-fit curve through numerous data points. To tackle the privacy concerns in utilizing the LWLR algorithm, we present a system for privacy-preserving locally weighted linear regression, where the system not only protects the privacy of users but also encrypts the best-fit curve. Therefore, we use Paillier homomorphic encryption as the building modular to encrypt data and then apply the stochastic gradient descent in encrypted domain. After given a security analysis, we study how to let Paillier encryption deal with real numbers and implement the system in Python language with a couple of experiments on real-world data sets to evaluate the effectiveness, and show that it outperforms the state-of-the-art and occurs negligible errors compared with performing locally weighted linear regression in the clear.
引用
收藏
页码:2247 / 2257
页数:11
相关论文
共 50 条
  • [1] Privacy-Preserving Similarity Joins Over Encrypted Data
    Yuan, Xingliang
    Wang, Xinyu
    Wang, Cong
    Yu, Chenyun
    Nutanong, Sarana
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (11) : 2763 - 2775
  • [2] Outsourced privacy-preserving classification service over encrypted data
    Li, Tong
    Huang, Zhengan
    Li, Ping
    Liu, Zheli
    Jia, Chunfu
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 106 : 100 - 110
  • [3] Efficient and Privacy-Preserving Eclipse Query over Encrypted Data
    Song, Weiyu
    Zhang, Yonggang
    Sun, Lili
    Zheng, Yandong
    Lu, Rongxing
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1 - 6
  • [4] Privacy-preserving model evaluation for logistic and linear regression using homomorphically encrypted genotype data
    Hong, Seungwan
    Choi, Yoolim A.
    Joo, Daniel S.
    Gursoy, Gamze
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 156
  • [5] Privacy-Preserving Hierarchical Anonymization Framework over Encrypted Data
    Jia, Jing
    Saito, Kenta
    Nishi, Hiroaki
    IEEJ Transactions on Electronics, Information and Systems, 2024, 144 (10) : 1011 - 1019
  • [6] Privacy-preserving queries on encrypted data
    Yang, Zhiqiang
    Zhong, Sheng
    Wright, Rebecca N.
    Computer Security - ESORICS 2006, Proceedings, 2006, 4189 : 479 - 495
  • [7] Privacy-Preserving Ridge Regression on Hundreds of Millions of Records
    Nikolaenko, Valeria
    Weinsberg, Udi
    Ioannidis, Stratis
    Joye, Marc
    Boneh, Dan
    Taft, Nina
    2013 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2013, : 334 - 348
  • [8] Privacy-preserving Computation over Encrypted Vectors
    Hu, Rui
    Ding, Wenxiu
    Yan, Zheng
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Enabling Comparable Search Over Encrypted Data for IoT with Privacy-Preserving
    Xu, Lei
    Xu, Chungen
    Liu, Zhongyi
    Wang, Yunling
    Wang, Jianfeng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 60 (02): : 675 - 690
  • [10] Improvement on a privacy-preserving outsourced classification protocol over encrypted data
    Chai, Yanting
    Zhan, Yu
    Wang, Baocang
    Ping, Yuan
    Zhang, Zhili
    WIRELESS NETWORKS, 2020, 26 (06) : 4363 - 4374