Anonymization in the time of big data

被引:0
|
作者
Domingo-Ferrer J. [1 ]
Soria-Comas J. [1 ]
机构
[1] Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans 26, Tarragona, 43007, CA
关键词
Big data; Curse of dimensionality; Data anonymization; K-anonymity; Multiple releases;
D O I
10.1007/978-3-319-45381-15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on k-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use k-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain k-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent k-anonymous releases on overlapping populations of subjects; the k-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, k-anonymity is still useful in a context of big data. © Springer International Publishing Switzerland 2016.
引用
收藏
页码:57 / 68
页数:11
相关论文
共 50 条
  • [41] Spectral Anonymization of Data
    Lasko, Thomas A.
    Vinterbo, Staal A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (03) : 437 - 446
  • [42] Distributed Data Anonymization
    SheikhAlishahi, Mina
    Martinelli, Fabio
    IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 580 - 586
  • [43] On Anonymization of String Data
    Aggarwal, Charu C.
    Yu, Philip S.
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 419 - 424
  • [44] Review on Data Sharing in Smart City Planning Based on Mobile Phone Signaling Big Data From the Perspective of China Experience: Anonymization VS De-anonymization
    Lin, Yong
    Shen, Zhenjiang
    Teng, Xiao
    INTERNATIONAL REVIEW FOR SPATIAL PLANNING AND SUSTAINABLE DEVELOPMENT, 2021, 9 (02): : 76 - 93
  • [45] Automation of the Validation, Anonymization and Augmentation of Big Data from a Multi-year Driving Study
    Wallace, Bruce
    Goubran, Rafik
    Knoefel, Frank
    Marshall, Shawn
    Porter, Michelle
    Harlow, Madelaine
    Puli, Akshay
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 608 - 614
  • [46] BIG DATA, TIME AND THE ARCHIVE
    Agostinho, Daniela
    SYMPLOKE, 2016, 24 (1-2) : 435 - 445
  • [47] Adaptive Privacy Preservation Approach for Big Data Publishing in Cloud using k-anonymization
    Madan S.
    Goswami P.
    Recent Advances in Computer Science and Communications, 2021, 14 (08) : 2678 - 2688
  • [48] Privacy preserving big data publishing: a scalable k-anonymization approach using MapReduce
    Mehta, Brijesh B.
    Rao, Udai Pratap
    IET SOFTWARE, 2017, 11 (05) : 271 - 276
  • [49] Improved l-diversity: Scalable anonymization approach for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Rao, Udai Pratap
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (04) : 1423 - 1430
  • [50] Supporting Pattern-Preserving Anonymization for Time-Series Data
    Shou, Lidan
    Shang, Xuan
    Chen, Ke
    Chen, Gang
    Zhang, Chao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) : 877 - 892