Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing

被引:4
|
作者
Mehta, Brijesh B. [1 ]
Rao, Udai Pratap [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, India
关键词
Big data; Big data privacy; k-anonymity;
D O I
10.1007/978-981-10-8636-6_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is collected and processed using different sources and tools, which leads to privacy issues. Privacy-preserving data publishing techniques such as k-anonymity, l-diversity, t-closeness are used to de-identify data, but chances of re-identification are there as data is collected from multiple sources. Due to a large amount of data, less generalization or suppression is required to achieve same level of privacy, which is also known as "large crowd effect," but to handle such a large data for anonymization is also a challenging task. MapReduce handles a large amount of data, but it distributes data into small chunks, so the advantage of large data cannot be achieved. Therefore, scalability of privacy-preserving techniques has become a challenging area of research, and we are trying to explore it by proposing an algorithm for scalable k-anonymity for MapReduce. Based on comparison with existing algorithm, our approach shows significant improvement in running time.
引用
收藏
页码:297 / 304
页数:8
相关论文
共 50 条
  • [1] Privacy-Preserving Big Data Publishing
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [2] A Scalable (α, k)-Anonymization Approach using MapReduce for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Gupta, Ruchika
    Rao, Udai Pratap
    Muthiyan, Mukesh
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [3] Selective Feature Anonymization for Privacy-Preserving Image Data Publishing
    Kim, Taehoon
    Yang, Jihoon
    ELECTRONICS, 2020, 9 (05)
  • [4] Anonymization-Based Attacks in Privacy-Preserving Data Publishing
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Wang, Ke
    Pei, Jian
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [5] Privacy preserving big data publishing: a scalable k-anonymization approach using MapReduce
    Mehta, Brijesh B.
    Rao, Udai Pratap
    IET SOFTWARE, 2017, 11 (05) : 271 - 276
  • [6] Improved l-diversity: Scalable anonymization approach for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Rao, Udai Pratap
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (04) : 1423 - 1430
  • [7] Privacy-Preserving Trajectory Data Publishing by Dynamic Anonymization with Bounded Distortion
    Li, Songyuan
    Tian, Hui
    Shen, Hong
    Sang, Yingpeng
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (02)
  • [8] EDAMS: Efficient Data Anonymization Model Selector for Privacy-Preserving Data Publishing
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    Khan, Najeed Ahmed
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5423 - 5427
  • [9] Scalable privacy-preserving big data aggregation mechanism
    Dapeng Wu
    Boran Yang
    Ruyan Wang
    Digital Communications and Networks, 2016, 2 (03) : 122 - 129
  • [10] Scalable privacy-preserving big data aggregation mechanism
    Wu, Dapeng
    Yang, Boran
    Wang, Ruyan
    DIGITAL COMMUNICATIONS AND NETWORKS, 2016, 2 (03) : 122 - 129