Sensitive Label Privacy Preservation with Anatomization for Data Publishing

被引:16
|
作者
Yao, Lin [1 ]
Chen, Zhenyu [2 ]
Wang, Xin [3 ]
Liu, Dong [2 ]
Wu, Guowei [2 ]
机构
[1] Dalian Univ Technol, DUT RU Int Sch Informat Sci & Engn, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116081, Peoples R China
[2] Dalian Univ Technol, Sch Software, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116081, Peoples R China
[3] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Privacy preservation; anatomization; sensitive label; ANONYMIZING CLASSIFICATION DATA; K-ANONYMITY; PRESERVING PRIVACY;
D O I
10.1109/TDSC.2019.2919833
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data in its original form, however, typically contain sensitive information about individuals. Directly publishing raw data will violate the privacy of people involed. Consequently, it becomes increasingly important to preserve the privacy of published data. An attacker is apt to identify an individual from the published tables, with attacks through the record linkage, attribute linkage, table linkage or probabilistic attack. Although algorithms based on generalization and suppression have been proposed to protect the sensitive attributes and resist these multiple types of attacks, they often suffer from large information loss by replacing specific values with more general ones. Alternatively, anatomization and permutation operations can de-link the relation between attributes without modifying them. In this paper, we propose a scheme Sensitive Label Privacy Preservation with Anatomization (SLPPA) to protect the privacy of published data. SLPPA includes two procedures, table division and group division. During the table division, we adopt entropy and mean-square contingency coefficient to partition attributes into separate tables to inject uncertainty for reconstructing the original table. During the group division, all the individuals in the original table are partitioned into non-overlapping groups so that the published data satisfies the pre-defined privacy requirements of our (alpha,beta,gamma,delta) model. Two comprehensive sets of real-world relationship data are applied to evaluate the performance of our anonymization approach. Simulations and privacy analysis show our scheme possesses better privacy while ensuring higher utility.
引用
收藏
页码:904 / 917
页数:14
相关论文
共 50 条
  • [1] Privacy preservation for attribute order sensitive workload in medical data publishing
    Gao, Ai-Qiang
    Diao, Lu-Hong
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (SUPPL. 1): : 314 - 320
  • [2] Privacy Preservation for Attribute Order Sensitive Workload in Medical Data Publishing
    Gao Ai-qiang
    Diao Lu-hong
    2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 1140 - +
  • [3] GAN-based Differential Privacy Trajectory Data Publishing with Sensitive Label
    Yao, Lin
    Zhang, Yu
    Zheng, Zhaolong
    Wu, Guowei
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 112 - 119
  • [4] Utility of Privacy Preservation for Health Data Publishing
    Wu, Lengdong
    He, Hua
    Zaiane, Osmar R.
    2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2013, : 510 - 511
  • [5] A Data Publishing System Based on Privacy Preservation
    Wang, Zhihui
    Zhu, Yun
    Zhou, Xuchen
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 553 - 556
  • [6] Enhancing Privacy Preservation in Speech Data Publishing
    Zhang, Guanglin
    Ni, Sifan
    Zhao, Ping
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (08) : 7357 - 7367
  • [7] Preservation of Privacy in Publishing Social Network Data
    Wei, Qiong
    Lu, Yansheng
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 421 - 425
  • [8] An Enhanced Method for Privacy Preservation in Data Publishing
    Thomas, Christy
    Thomas, Diya
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [9] PUBLISHING SENSITIVE TIME-SERIES DATA UNDER PRESERVATION OF PRIVACY AND DISTANCE ORDERS
    Choi, Mi-Jung
    Kim, Hea-Suk
    Moon, Yang-Sae
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (5B): : 3619 - 3638
  • [10] Sensitive attribute privacy preservation of trajectory data publishing based on l-diversity
    Lin Yao
    Zhenyu Chen
    Haibo Hu
    Guowei Wu
    Bin Wu
    Distributed and Parallel Databases, 2021, 39 : 785 - 811