HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing

被引:0
|
作者
Han, Xiao [1 ,2 ,3 ]
Yang, Yuncong [1 ]
Wu, Junjie [4 ,5 ]
Xiong, Hui [6 ,7 ,8 ]
机构
[1] Shanghai Univ Finance & Econ, Key Lab Interdisciplinary Res Computat & Econ, Minist Educ, Shanghai 200433, Peoples R China
[2] Shanghai Univ Finance & Econ, Sch Informat Management & Engn, Shanghai 200433, Peoples R China
[3] Shanghai Univ Finance & Econ, Dishui Lake Adv Finance Inst, Shanghai 200433, Peoples R China
[4] Beihang Univ, Key Lab Data Intelligence & Management, Minist Ind & Informat Technol, Beijing 100191, Peoples R China
[5] Beihang Univ, Sch Econ & Management, Beijing 100191, Peoples R China
[6] HKUST Guangzhou, Thrust Artificial Intelligence, Guangzhou 511458, Guangdong, Peoples R China
[7] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[8] Guangzhou HKUST Fok Ying Tung Res Inst, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Publishing; Economics; Task analysis; Machine learning; Loss measurement; Social networking (online); Attribute inference attack; generalization; hybrid obscuring; obfuscation; privacy preserving data publishing; INFERENCE ATTACKS;
D O I
10.1109/TKDE.2023.3331568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimizing privacy leakage while ensuring data utility is a critical problem in a privacy-preserving data publishing task, from which data holders can boost platform engagements or enlarge data values. Most prior research concerned only with either privacy-insensitive or exact private data and resorts to a single obscuring method to achieve a privacy-utility tradeoff, which is inadequate for real-life hybrid data especially when facing machine learning-based inference attacks. This work takes a pilot study on privacy-preserving data publishing when both widely adopted generalization and obfuscation operations are employed for privacy-heterogeneous data protection. Specifically, we first propose novel measures for privacy and utility values quantification and formulate the hybrid privacy-preserving data obscuring problem to account for the joint effect of generalization and obfuscation. We then design a novel protection mechanism called HyObscure, which decomposes the original problem into three sub-problems to cross-iteratively optimize the hybrid operations for maximum privacy protection under a certain data utility guarantee. The convergence of the iterative process and the privacy leakage bound of HyObscure are also provided in theory. Extensive experiments demonstrate that HyObscure significantly outperforms a variety of state-of-the-art baseline methods when facing various inference attacks in different scenarios.
引用
收藏
页码:3893 / 3905
页数:13
相关论文
共 50 条
  • [31] Logical Foundations of Privacy-Preserving Publishing of Linked Data
    Grau, Bernardo Cuenca
    Kostylev, Egor V.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 943 - 949
  • [32] Privacy, space, and time: a survey on privacy-preserving continuous data publishing
    Katsomallos, Manos
    Tzompanaki, Katerina
    Kotzinos, Dimitris
    JOURNAL OF SPATIAL INFORMATION SCIENCE, 2019, (19): : 57 - 103
  • [33] Privacy-preserving data utilization in hybrid clouds
    Li, Jingwei
    Li, Jin
    Chen, Xiaofeng
    Liu, Zheli
    Jia, Chunfu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 30 : 98 - 106
  • [34] Hybrid Transformation in Privacy-Preserving Data Mining
    Putri, Awalia W.
    Hira, Laksmiwati
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [35] Privacy-Preserving Data Publishing for Multiple Numerical Sensitive Attributes
    Qinghai Liu
    Hong Shen
    Yingpeng Sang
    Tsinghua Science and Technology, 2015, 20 (03) : 246 - 254
  • [36] Quantifying the costs and benefits of privacy-preserving health data publishing
    Khokhar, Rashid Hussain
    Chen, Rui
    Fung, Benjamin C. M.
    Lui, Siu Man
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 107 - 121
  • [37] Pseudonym Exchange for Privacy-Preserving Publishing of Trajectory Data Set
    Mano, Ken
    Minami, Kazuhiro
    Maruyama, Hiroshi
    2014 IEEE 3RD GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2014, : 691 - 695
  • [38] Background knowledge attacks in privacy-preserving data publishing models
    Desai, Nidhi
    Das, Manik Lal
    Chaudhari, Payal
    Kumar, Naveen
    COMPUTERS & SECURITY, 2022, 122
  • [39] Privacy-Preserving Spatio-Temporal Patient Data Publishing
    Olawoyin, Anifat M.
    Leung, Carson K.
    Choudhury, Ratna
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 407 - 416
  • [40] Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing
    Mehta, Brijesh B.
    Rao, Udai Pratap
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 2, 2018, 708 : 297 - 304