HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing

被引:0
|
作者
Han, Xiao [1 ,2 ,3 ]
Yang, Yuncong [1 ]
Wu, Junjie [4 ,5 ]
Xiong, Hui [6 ,7 ,8 ]
机构
[1] Shanghai Univ Finance & Econ, Key Lab Interdisciplinary Res Computat & Econ, Minist Educ, Shanghai 200433, Peoples R China
[2] Shanghai Univ Finance & Econ, Sch Informat Management & Engn, Shanghai 200433, Peoples R China
[3] Shanghai Univ Finance & Econ, Dishui Lake Adv Finance Inst, Shanghai 200433, Peoples R China
[4] Beihang Univ, Key Lab Data Intelligence & Management, Minist Ind & Informat Technol, Beijing 100191, Peoples R China
[5] Beihang Univ, Sch Econ & Management, Beijing 100191, Peoples R China
[6] HKUST Guangzhou, Thrust Artificial Intelligence, Guangzhou 511458, Guangdong, Peoples R China
[7] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[8] Guangzhou HKUST Fok Ying Tung Res Inst, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Publishing; Economics; Task analysis; Machine learning; Loss measurement; Social networking (online); Attribute inference attack; generalization; hybrid obscuring; obfuscation; privacy preserving data publishing; INFERENCE ATTACKS;
D O I
10.1109/TKDE.2023.3331568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimizing privacy leakage while ensuring data utility is a critical problem in a privacy-preserving data publishing task, from which data holders can boost platform engagements or enlarge data values. Most prior research concerned only with either privacy-insensitive or exact private data and resorts to a single obscuring method to achieve a privacy-utility tradeoff, which is inadequate for real-life hybrid data especially when facing machine learning-based inference attacks. This work takes a pilot study on privacy-preserving data publishing when both widely adopted generalization and obfuscation operations are employed for privacy-heterogeneous data protection. Specifically, we first propose novel measures for privacy and utility values quantification and formulate the hybrid privacy-preserving data obscuring problem to account for the joint effect of generalization and obfuscation. We then design a novel protection mechanism called HyObscure, which decomposes the original problem into three sub-problems to cross-iteratively optimize the hybrid operations for maximum privacy protection under a certain data utility guarantee. The convergence of the iterative process and the privacy leakage bound of HyObscure are also provided in theory. Extensive experiments demonstrate that HyObscure significantly outperforms a variety of state-of-the-art baseline methods when facing various inference attacks in different scenarios.
引用
收藏
页码:3893 / 3905
页数:13
相关论文
共 50 条
  • [1] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308
  • [2] Privacy-Preserving Data Publishing
    Chen, Bee-Chung
    Kifer, Daniel
    LeFevre, Kristen
    Machanavajjhala, Ashwin
    FOUNDATIONS AND TRENDS IN DATABASES, 2009, 2 (1-2): : 1 - 167
  • [3] Privacy-Preserving Characterization and Data Publishing
    Ren, Jian
    Li, Tongtong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 549 - 553
  • [4] Privacy-preserving publishing for streaming data
    Huang, Xuezhen
    Liu, Jiqiang
    Han, Zhen
    Yang, Jun
    Journal of Computational Information Systems, 2015, 11 (05): : 1863 - 1877
  • [5] Privacy-Preserving Sequential Data Publishing
    Wang, Huili
    Ma, Wenping
    Zheng, Haibin
    Liang, Zhi
    Wu, Qianhong
    NETWORK AND SYSTEM SECURITY, NSS 2019, 2019, 11928 : 596 - 614
  • [6] Privacy-Preserving Big Data Publishing
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [7] Privacy-Preserving Publishing of Hierarchical Data
    Ozalp, Ismet
    Gursoy, Mehmet Emre
    Nergiz, Mehmet Ercan
    Saygin, Yucel
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2016, 19 (03)
  • [8] Privacy-Preserving Data Publishing: An Overview
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Synthesis Lectures on Data Management, 2010, 2 (01): : 1 - 138
  • [9] Personalized Privacy-Preserving Trajectory Data Publishing
    Lu Qiwei
    Wang Caimei
    Xiong Yan
    Xia Huihua
    Huang Wenchao
    Gong Xudong
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (02) : 285 - 291
  • [10] Privacy-preserving data publishing for cluster analysis
    Fung, Benjamin C. M.
    Wang, Ke
    Wang, Lingyu
    Hung, Patrick C. K.
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (06) : 552 - 575