OCDP: An enhanced perturbation approach for data privacy protection

被引:0
|
作者
Devi, S. Sathiya [2 ]
Jayasri, K. [1 ]
机构
[1] Univ Coll Engn, Dept Comp Sci & Engn, BIT Campus, Tiruchirappalli, India
[2] Univ Coll Engn, Dept Comp Sci & Engn, BIT Campus, Tiruchirappalli, India
关键词
Data privacy; Noise parameter (epsilon); means clustering; Particle Swarm Optimization (PSO); Mutual information (MI); Differential Privacy (DP); DIFFERENTIAL PRIVACY;
D O I
10.1016/j.jisa.2025.104046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the exponential growth of internet and digital technology, there is a significant increase in the volume of personal data being collected, stored and shared across various platforms poses privacy risks including unauthorized access, misuse and exploitation. To mitigate these risks, effective privacy mechanisms are crucial. One such mechanism is Differential Privacy (DP) which aims to protect personal information by introducing noise into the data to obstruct individual identification. Though it effectively prevents breaches of personal information, a trade-off exists among privacy and accuracy. Additionally, DP often requires meticulous noise parameter tuning which can be complex and resource intensive. To overcome these challenges, this paper proposed the method named Opti-Cluster Differential Privacy (OCDP). The proposed OCDP is designed to automatically determine the optimal amount of noise for a dataset. The dataset is first divided into non- overlapping clusters using k-means clustering. It then employs a hybrid approach combining DP with Particle Swarm Optimization (PSO) to compute the optimal noise parameter (epsilon- epsilon) for each cluster. Based on this computed value, noise is added to each cluster and then it is merged to produce a final perturbed dataset. The Experimental results demonstrate that the proposed OCDP method achieves high privacy while being computationally efficient. The proposed OCDP method produces data with privacy percentages of 84 %, 88 %, 89 %, 85 %, 83 % and 77 % for the Heart Disease, GDM, Adult, Automobile, Thyroid Disease and Insurance datasets respectively representing 13 % (with clustering) and 50 % high (without clustering) when compared with other methods. Moreover, OCDP's computational efficiency allows for faster processing times making it reliable solution for maintaining privacy in large datasets.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Protection of Big Data Privacy
    Mehmood, Abid
    Natgunanathan, Iynkaran
    Xiang, Yong
    Hua, Guang
    Guo, Song
    IEEE ACCESS, 2016, 4 : 1821 - 1834
  • [32] PRIVACY AND DATA PROTECTION IN JAPAN
    SRINIVASAN, S
    GOVERNMENT INFORMATION QUARTERLY, 1992, 9 (02) : 121 - 133
  • [33] Data protection: The future of privacy
    Wong, Rebecca
    COMPUTER LAW & SECURITY REVIEW, 2011, 27 (01) : 53 - 57
  • [34] Privacy and Data Protection Issues on Smart Tourism Destinations - A First Approach
    Masseno, Manuel David
    Santos, Cristiana
    INTELLIGENT ENVIRONMENTS 2018, 2018, 23 : 298 - 307
  • [35] A Model-based Approach to Realize Privacy and Data Protection by Design
    Pedroza, Gabriel
    Muntes-Mulero, Victor
    Samuel Martin, Yod
    Mockly, Guillaume
    2021 IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (EUROS&PW 2021), 2021, : 332 - 339
  • [36] Epigenome data release: a participant-centered approach to privacy protection
    Stephanie O. M. Dyke
    Warren A. Cheung
    Yann Joly
    Ole Ammerpohl
    Pavlo Lutsik
    Mark A. Rothstein
    Maxime Caron
    Stephan Busche
    Guillaume Bourque
    Lars Rönnblom
    Paul Flicek
    Stephan Beck
    Martin Hirst
    Henk Stunnenberg
    Reiner Siebert
    Jörn Walter
    Tomi Pastinen
    Genome Biology, 16
  • [37] FACE RECOGNITION WITH ENHANCED PRIVACY PROTECTION
    Wang, Yongjin
    Hatzinakos, Dimitrios
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 885 - 888
  • [38] Epigenome data release: a participant-centered approach to privacy protection
    Dyke, Stephanie O. M.
    Cheung, Warren A.
    Joly, Yann
    Ammerpohl, Ole
    Lutsik, Pavlo
    Rothstein, Mark A.
    Caron, Maxime
    Busche, Stephan
    Bourque, Guillaume
    Ronnblom, Lars
    Flicek, Paul
    Beck, Stephan
    Hirst, Martin
    Stunnenberg, Henk
    Siebert, Reiner
    Walter, Joern
    Pastinen, Tomi
    GENOME BIOLOGY, 2015, 16
  • [39] Privacy protection framework for open data: Constructing and assessing an effective approach
    Tang, Yunjie
    LIBRARY & INFORMATION SCIENCE RESEARCH, 2024, 46 (03)
  • [40] Enhanced Privacy and Data Protection using Natural Language Processing and Artificial Intelligence
    Martinelli, Fabio
    Marulli, Fiammetta
    Mercaldo, Francesco
    Marrone, Stefano
    Santone, Antonella
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,