Privacy-Preserving Locally Weighted Linear Regression Over Encrypted Millions of Data

被引:5
|
作者
Dong, Xiaoxia [1 ,3 ]
Chen, Jie [2 ,3 ]
Zhang, Kai [4 ]
Qian, Haifeng [2 ,3 ]
机构
[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Sch Software Engn, Shanghai 200062, Peoples R China
[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Locally weighted linear regression; privacy-preserving; paillier homomorphic encryption; stochastic gradient descent;
D O I
10.1109/ACCESS.2019.2962700
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging development of cloud computing makes a trend that the cloud becomes a outsourced agglomeration for storing big data that generally contains numerous information. To mine the rich value involved in big data, the machine learning methodology is widespread employed due to its ability to adapt to data changes. However, the data mining process may involve the privacy issues of the users, hence they are reluctant to share their information. This is the reason why the outsourced data need to be dealt with securely, where data encryption is considered to be the most straightforward method to keep the privacy of data, but machine learning on the data in ciphertext domain is more complicated than the plaintext, since the relationship structure between data is no longer maintained, in such a way that we focus on the machine learning over encrypted big data. In this work, we study locally weighted linear regression (LWLR), a widely used classic machine learning algorithm in real-world, such as predict and find the best-fit curve through numerous data points. To tackle the privacy concerns in utilizing the LWLR algorithm, we present a system for privacy-preserving locally weighted linear regression, where the system not only protects the privacy of users but also encrypts the best-fit curve. Therefore, we use Paillier homomorphic encryption as the building modular to encrypt data and then apply the stochastic gradient descent in encrypted domain. After given a security analysis, we study how to let Paillier encryption deal with real numbers and implement the system in Python language with a couple of experiments on real-world data sets to evaluate the effectiveness, and show that it outperforms the state-of-the-art and occurs negligible errors compared with performing locally weighted linear regression in the clear.
引用
收藏
页码:2247 / 2257
页数:11
相关论文
共 50 条
  • [41] Privacy-Preserving Linear Regression on Distributed Data by Homomorphic Encryption and Data Masking
    Qiu, Guowei
    Gui, Xiaolin
    Zhao, Yingliang
    IEEE ACCESS, 2020, 8 : 107601 - 107613
  • [42] Privacy-preserving query over the encrypted image in cloud computing
    Zhu, Xudong
    Li, Hui
    Guo, Zhen
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2014, 41 (02): : 151 - 158
  • [43] Privacy-Preserving Multi-keyword Ranked Search over Encrypted Cloud Data
    Cao, Ning
    Wang, Cong
    Li, Ming
    Ren, Kui
    Lou, Wenjing
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 829 - 837
  • [44] Privacy-Preserving Multi-Keyword Ranked Search over Encrypted Cloud Data
    Cao, Ning
    Wang, Cong
    Li, Ming
    Ren, Kui
    Lou, Wenjing
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (01) : 222 - 233
  • [45] Towards efficient privacy-preserving conjunctive keywords search over encrypted cloud data
    Liu, Yaru
    Xiao, Xiaodong
    Kong, Fanyu
    Zhang, Hanlin
    Yu, Jia
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [46] Privacy-preserving and Efficient Multi-keyword Search Over Encrypted Data on Blockchain
    Jiang, Shan
    Cao, Jiannong
    McCannt, Julie A.
    Yang, Yanni
    Liu, Yang
    Wang, Xiaoqing
    Deng, Yuming
    2019 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN (BLOCKCHAIN 2019), 2019, : 405 - 410
  • [47] Privacy-Preserving Keyword Search Schemes over Encrypted Cloud Data: An Extensive Analysis
    Sreekumari, Prasanthi
    2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, : 114 - 120
  • [48] Achieve Efficient and Privacy-preserving Online Fingerprint Authentication over Encrypted Outsourced Data
    Wei, Qing
    Zhu, Hui
    Lu, Rongxing
    Li, Hui
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [49] Privacy-preserving multi-keyword hybrid search over encrypted data in cloud
    Singh N.
    Kumar J.
    Singh A.K.
    Mohan A.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 261 - 274
  • [50] EPSet: Efficient and Privacy-Preserving Set Similarity Range Query Over Encrypted Data
    Zheng, Yandong
    Lu, Rongxing
    Guan, Yunguo
    Zhang, Songnian
    Shao, Jun
    Wang, Fengwei
    Zhu, Hui
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (02) : 524 - 536