HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing

被引:0
|
作者
Han, Xiao [1 ,2 ,3 ]
Yang, Yuncong [1 ]
Wu, Junjie [4 ,5 ]
Xiong, Hui [6 ,7 ,8 ]
机构
[1] Shanghai Univ Finance & Econ, Key Lab Interdisciplinary Res Computat & Econ, Minist Educ, Shanghai 200433, Peoples R China
[2] Shanghai Univ Finance & Econ, Sch Informat Management & Engn, Shanghai 200433, Peoples R China
[3] Shanghai Univ Finance & Econ, Dishui Lake Adv Finance Inst, Shanghai 200433, Peoples R China
[4] Beihang Univ, Key Lab Data Intelligence & Management, Minist Ind & Informat Technol, Beijing 100191, Peoples R China
[5] Beihang Univ, Sch Econ & Management, Beijing 100191, Peoples R China
[6] HKUST Guangzhou, Thrust Artificial Intelligence, Guangzhou 511458, Guangdong, Peoples R China
[7] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[8] Guangzhou HKUST Fok Ying Tung Res Inst, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Publishing; Economics; Task analysis; Machine learning; Loss measurement; Social networking (online); Attribute inference attack; generalization; hybrid obscuring; obfuscation; privacy preserving data publishing; INFERENCE ATTACKS;
D O I
10.1109/TKDE.2023.3331568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimizing privacy leakage while ensuring data utility is a critical problem in a privacy-preserving data publishing task, from which data holders can boost platform engagements or enlarge data values. Most prior research concerned only with either privacy-insensitive or exact private data and resorts to a single obscuring method to achieve a privacy-utility tradeoff, which is inadequate for real-life hybrid data especially when facing machine learning-based inference attacks. This work takes a pilot study on privacy-preserving data publishing when both widely adopted generalization and obfuscation operations are employed for privacy-heterogeneous data protection. Specifically, we first propose novel measures for privacy and utility values quantification and formulate the hybrid privacy-preserving data obscuring problem to account for the joint effect of generalization and obfuscation. We then design a novel protection mechanism called HyObscure, which decomposes the original problem into three sub-problems to cross-iteratively optimize the hybrid operations for maximum privacy protection under a certain data utility guarantee. The convergence of the iterative process and the privacy leakage bound of HyObscure are also provided in theory. Extensive experiments demonstrate that HyObscure significantly outperforms a variety of state-of-the-art baseline methods when facing various inference attacks in different scenarios.
引用
收藏
页码:3893 / 3905
页数:13
相关论文
共 50 条
  • [21] STDP: Secure Privacy-Preserving Trajectory Data Publishing
    Eom, Chris Soo-Hyun
    Lee, Wookey
    Leung, Carson Kai-Sang
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 892 - 899
  • [22] Discretization: Privacy-preserving data publishing for causal discovery
    Ahn, Youngmin
    Park, Woongjoon
    Park, Gunwoong
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 209
  • [23] Privacy-preserving trajectory data publishing by local suppression
    Chen, Rui
    Fung, Benjamin C. M.
    Mohammed, Noman
    Desai, Bipin C.
    Wang, Ke
    INFORMATION SCIENCES, 2013, 231 : 83 - 97
  • [24] A privacy-preserving data publishing algorithm for clustering application
    Chong, Zhihong
    Ni, Weiwei
    Liu, Tengteng
    Zhang, Yong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (12): : 2083 - 2089
  • [25] Clustering-oriented privacy-preserving data publishing
    Ni, Weiwei
    Chong, Zhihong
    KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 264 - 270
  • [26] Lightweight Privacy-Preserving Raw Data Publishing Scheme
    Chen, Jingxue
    Liu, Gao
    Liu, Yining
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 2170 - 2174
  • [27] Suppression techniques for privacy-preserving trajectory data publishing
    Lin, Chen-Yi
    KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [28] Privacy-Preserving Data Publishing Based On Utility Specification
    Tian, Hongwei
    Zhang, Weining
    2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 114 - 121
  • [29] On Privacy-Preserving Publishing of Spontaneous ADE Reporting Data
    Lin, Wen-Yang
    Yang, Duan-Chun
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [30] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    ACM COMPUTING SURVEYS, 2010, 42 (04)