HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing

被引:0
|
作者
Han, Xiao [1 ,2 ,3 ]
Yang, Yuncong [1 ]
Wu, Junjie [4 ,5 ]
Xiong, Hui [6 ,7 ,8 ]
机构
[1] Shanghai Univ Finance & Econ, Key Lab Interdisciplinary Res Computat & Econ, Minist Educ, Shanghai 200433, Peoples R China
[2] Shanghai Univ Finance & Econ, Sch Informat Management & Engn, Shanghai 200433, Peoples R China
[3] Shanghai Univ Finance & Econ, Dishui Lake Adv Finance Inst, Shanghai 200433, Peoples R China
[4] Beihang Univ, Key Lab Data Intelligence & Management, Minist Ind & Informat Technol, Beijing 100191, Peoples R China
[5] Beihang Univ, Sch Econ & Management, Beijing 100191, Peoples R China
[6] HKUST Guangzhou, Thrust Artificial Intelligence, Guangzhou 511458, Guangdong, Peoples R China
[7] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[8] Guangzhou HKUST Fok Ying Tung Res Inst, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Publishing; Economics; Task analysis; Machine learning; Loss measurement; Social networking (online); Attribute inference attack; generalization; hybrid obscuring; obfuscation; privacy preserving data publishing; INFERENCE ATTACKS;
D O I
10.1109/TKDE.2023.3331568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimizing privacy leakage while ensuring data utility is a critical problem in a privacy-preserving data publishing task, from which data holders can boost platform engagements or enlarge data values. Most prior research concerned only with either privacy-insensitive or exact private data and resorts to a single obscuring method to achieve a privacy-utility tradeoff, which is inadequate for real-life hybrid data especially when facing machine learning-based inference attacks. This work takes a pilot study on privacy-preserving data publishing when both widely adopted generalization and obfuscation operations are employed for privacy-heterogeneous data protection. Specifically, we first propose novel measures for privacy and utility values quantification and formulate the hybrid privacy-preserving data obscuring problem to account for the joint effect of generalization and obfuscation. We then design a novel protection mechanism called HyObscure, which decomposes the original problem into three sub-problems to cross-iteratively optimize the hybrid operations for maximum privacy protection under a certain data utility guarantee. The convergence of the iterative process and the privacy leakage bound of HyObscure are also provided in theory. Extensive experiments demonstrate that HyObscure significantly outperforms a variety of state-of-the-art baseline methods when facing various inference attacks in different scenarios.
引用
收藏
页码:3893 / 3905
页数:13
相关论文
共 50 条
  • [41] Inference Analysis in Privacy-Preserving Data Re-publishing
    Wang, Guan
    Zhu, Zutao
    Du, Wenliang
    Teng, Zhouxuan
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1079 - 1084
  • [42] Two privacy-preserving approaches for data publishing with identity reservation
    Wang, Jinyan
    Du, Kai
    Luo, Xudong
    Li, Xianxian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 1039 - 1080
  • [43] A Survey and Experimental Study on Privacy-Preserving Trajectory Data Publishing
    Jin, Fengmei
    Hua, Wen
    Francia, Matteo
    Chao, Pingfu
    Orlowska, Maria E.
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5577 - 5596
  • [44] Two privacy-preserving approaches for data publishing with identity reservation
    Jinyan Wang
    Kai Du
    Xudong Luo
    Xianxian Li
    Knowledge and Information Systems, 2019, 60 : 1039 - 1080
  • [45] Anonymization-Based Attacks in Privacy-Preserving Data Publishing
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Wang, Ke
    Pei, Jian
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [46] A privacy-preserving method for publishing data with multiple sensitive attributes
    Yi, Tong
    Shi, Minyong
    Shang, Wenqian
    Zhu, Haibin
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (01) : 222 - 238
  • [47] Two Privacy-Preserving Approaches for Publishing Transactional Data Streams
    Wang, Jinyan
    Deng, Chaoji
    Li, Xianxian
    IEEE ACCESS, 2018, 6 : 23648 - 23658
  • [48] Privacy-preserving trajectory stream publishing
    Al-Hussaeni, Khalil
    Fung, Benjamin C. M.
    Cheung, William K.
    DATA & KNOWLEDGE ENGINEERING, 2014, 94 : 89 - 109
  • [49] Privacy-preserving high-dimensional data publishing for classification
    Wang, Rong
    Zhu, Yan
    Chang, Chin-Chen
    Peng, Qiang
    COMPUTERS & SECURITY, 2020, 93
  • [50] Privacy-preserving data publishing based on de-clustering
    Wei, Qiong
    Lu, Yansheng
    Lou, Qiang
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 152 - +