Anonymization in the time of big data

被引:0
|
作者
Domingo-Ferrer J. [1 ]
Soria-Comas J. [1 ]
机构
[1] Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans 26, Tarragona, 43007, CA
关键词
Big data; Curse of dimensionality; Data anonymization; K-anonymity; Multiple releases;
D O I
10.1007/978-3-319-45381-15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on k-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use k-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain k-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent k-anonymous releases on overlapping populations of subjects; the k-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, k-anonymity is still useful in a context of big data. © Springer International Publishing Switzerland 2016.
引用
收藏
页码:57 / 68
页数:11
相关论文
共 50 条
  • [31] Data Verification of Logical Pk-Anonymization with Big Data Application and Key Generation in Cloud Computing
    Kumar, Sindhe Phani
    Anandan, R.
    JOURNAL OF FUNCTION SPACES, 2022, 2022
  • [32] Semi-local Time sensitive Anonymization of Clinical Data
    Hammer, Freimut Gebhard Herbert
    Buglowski, Mateusz
    Stollenwerk, Andre
    SCIENTIFIC DATA, 2024, 11 (01)
  • [33] Motion sensor data anonymization by time-frequency filtering
    Debs, Noelie
    Jourdan, Theo
    Moukadem, Ali
    Boutet, Antoine
    Frindel, Carole
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1707 - 1711
  • [34] Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework
    Lawrance, Josephine Usha
    Jesudhasan, Jesu Vedha Nayahi
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 1587 - 1620
  • [35] A Scalable (α, k)-Anonymization Approach using MapReduce for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Gupta, Ruchika
    Rao, Udai Pratap
    Muthiyan, Mukesh
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [36] Time Distortion Anonymization for the Publication of Mobility Data with High Utility
    Primault, Vincent
    Ben Mokhtar, Sonia
    Lauradoux, Cedric
    Brunie, Lionel
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, : 539 - 546
  • [37] A MapReduce Based Approach of Scalable Multidimensional Anonymization for Big Data Privacy Preservation on Cloud
    Zhang, Xuyun
    Yang, Chi
    Nepal, Surya
    Liu, Chang
    Dou, Wanchun
    Chen, Jinjun
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 105 - 112
  • [38] Adaptive Utility-based Anonymization Model: Performance Evaluation on Big Data Sets
    Panackal, Jisha Jose
    Pillai, Anitha S.
    BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 347 - 352
  • [39] Anonymization of Statistical Data
    di Vimercati, Sabrina De Capitani
    Foresti, Sara
    Livraga, Giovanni
    Samarati, Pierangela
    IT-INFORMATION TECHNOLOGY, 2011, 53 (01): : 18 - 25
  • [40] Big data for big questions: it is time for data analysts to act
    Moscato, Pablo
    FUTURE SCIENCE OA, 2015, 1 (03):