Sensitive Label Privacy Preservation with Anatomization for Data Publishing

被引:16
|
作者
Yao, Lin [1 ]
Chen, Zhenyu [2 ]
Wang, Xin [3 ]
Liu, Dong [2 ]
Wu, Guowei [2 ]
机构
[1] Dalian Univ Technol, DUT RU Int Sch Informat Sci & Engn, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116081, Peoples R China
[2] Dalian Univ Technol, Sch Software, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116081, Peoples R China
[3] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Privacy preservation; anatomization; sensitive label; ANONYMIZING CLASSIFICATION DATA; K-ANONYMITY; PRESERVING PRIVACY;
D O I
10.1109/TDSC.2019.2919833
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data in its original form, however, typically contain sensitive information about individuals. Directly publishing raw data will violate the privacy of people involed. Consequently, it becomes increasingly important to preserve the privacy of published data. An attacker is apt to identify an individual from the published tables, with attacks through the record linkage, attribute linkage, table linkage or probabilistic attack. Although algorithms based on generalization and suppression have been proposed to protect the sensitive attributes and resist these multiple types of attacks, they often suffer from large information loss by replacing specific values with more general ones. Alternatively, anatomization and permutation operations can de-link the relation between attributes without modifying them. In this paper, we propose a scheme Sensitive Label Privacy Preservation with Anatomization (SLPPA) to protect the privacy of published data. SLPPA includes two procedures, table division and group division. During the table division, we adopt entropy and mean-square contingency coefficient to partition attributes into separate tables to inject uncertainty for reconstructing the original table. During the group division, all the individuals in the original table are partitioned into non-overlapping groups so that the published data satisfies the pre-defined privacy requirements of our (alpha,beta,gamma,delta) model. Two comprehensive sets of real-world relationship data are applied to evaluate the performance of our anonymization approach. Simulations and privacy analysis show our scheme possesses better privacy while ensuring higher utility.
引用
收藏
页码:904 / 917
页数:14
相关论文
共 50 条
  • [21] Privacy in Data Publishing
    Gehrke, Johannes
    Kifer, Daniel
    Machanavajjhala, Ashwin
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1213 - 1213
  • [22] A Generalization-Based Approach for Personalized Privacy Preservation in Trajectory Data Publishing
    Komishani, Elahe Ghasemi
    Abadi, Mahdi
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 1129 - 1135
  • [23] Publishing Time-Series Data under Preservation of Privacy and Distance Orders
    Moon, Yang-Sae
    Kim, Hea-Suk
    Kim, Sang-Pil
    Bertino, Elisa
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT 2, 2010, 6262 : 17 - +
  • [24] Privacy-Preserving Data Publishing for Multiple Numerical Sensitive Attributes
    Qinghai Liu
    Hong Shen
    Yingpeng Sang
    Tsinghua Science and Technology, 2015, 20 (03) : 246 - 254
  • [25] A privacy-preserving method for publishing data with multiple sensitive attributes
    Yi, Tong
    Shi, Minyong
    Shang, Wenqian
    Zhu, Haibin
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (01) : 222 - 238
  • [26] Generalization-based privacy preservation and discrimination prevention in data publishing and mining
    Sara Hajian
    Josep Domingo-Ferrer
    Oriol Farràs
    Data Mining and Knowledge Discovery, 2014, 28 : 1158 - 1188
  • [27] A Toll Data Publishing Method using Encryption and Differential Privacy Preservation Technology
    Shen, Lijun
    Su, Peng
    Lu, Xiaoyu
    Wang, Xiao
    Liu, Yifei
    Ouyang, Hai
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 1586 - 1594
  • [28] Generalization-based privacy preservation and discrimination prevention in data publishing and mining
    Hajian, Sara
    Domingo-Ferrer, Josep
    Farras, Oriol
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (5-6) : 1158 - 1188
  • [29] Privacy Preservation for Trajectory Data Publishing by Look-Up Table Generalization
    Harnsamut, Nattapon
    Natwichai, Juggapong
    Riyana, Surapon
    DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : 15 - 27
  • [30] Privacy-Preserving Data Publishing for Multiple Numerical Sensitive Attributes
    Liu, Qinghai
    Shen, Hong
    Sang, Yingpeng
    TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (03) : 246 - 254