Federated Clouds: A New Metric for Measuring the Quality of Data Anonymization

被引:0
|
作者
Gaye, Youssoupha [1 ]
Mbaye, Maissa [1 ]
Diongue, Dame [1 ]
Dieng, Ousmane [2 ]
Adetiba, Emmanuel [3 ,4 ]
Badejo, Joke A. [3 ]
机构
[1] Gaston Berger Univ, Lab Anal Numer & Informat LANI, CEA MITIC, St Louis 234, PB, Senegal
[2] Univ Pittsburgh, Power Management & Real Time Syst Lab, Pittsburgh, PA 15260 USA
[3] Covenant Univ, Covenant Appl Informat & Commun African Ctr Excel, Ota, Ogun State, Nigeria
[4] Durban Univ Technol, Inst Syst Sci, HRA, ZA-1334 Durban, South Africa
来源
关键词
Data Anonymization Metric; Data Privacy; Federated Cloud Security; Federated Cloud; Cloud Computing; PRIVACY;
D O I
10.1007/978-3-031-62488-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated cloud has emerged as solution for cloud service providers to get scalability in serving the growing demand for cloud resources. In a federated cloud, a cloud member can provide service or request it from other cloud provider members in the federation. The federation enables its cloud provider members to be able to satisfy a service beyond the resources they owned by using the resources market in the federation. Data privacy is a major concern in federated clouds. As the privacy regulations and laws of the countries in the federation may vary, it is difficult to assess and confirm that they are in compliance. This makes protecting privacy even more challenging. Privacy management strategies primarily involve anonymization, cryptography, and data splitting. Anonymization is the traditional approach to preserving privacy, which aims at masking the link between the quasi-identifier and sensitive data. The most widely used anonymization techniques are k-anonymity, l-diversity and t-closeness. However, there is a lack of a formal metric to measure the quality of the anonymization process in terms of its ability to prevent re-identification. This paper examines the issue of assessing anonymization quality and introduces a new metric, Mmaq, for this purpose. It can be used to evaluate the anonymization of one or multiple attributes. The metric is a combination of the Shannon index, which measures diversity, and a stabilizer factor, which corrects the Shannon index for pathological cases. The initial results suggest that Mmaq can be used to classify attributes as identifier, quasi-identifier, and anonymous. Furthermore, it can be employed as a Cloud Privacy Policy anonymization compliance checker.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 50 条
  • [21] Measuring Data Similarity for Efficient Federated Learning: A Feasibility Study
    Fama, Fernanda
    Kalalas, Charalampos
    Lagen, Sandra
    Dini, Paolo
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 394 - 400
  • [22] A big video data transcoding service for social media over federated clouds
    Panarello, Alfonso
    Celesti, Antonio
    Fazio, Maria
    Puliafito, Antonio
    Villari, Massimo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9037 - 9061
  • [23] A new metric for categorical data
    Al-Harbi, SH
    McKeown, GP
    Rayward-Smith, VJ
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 339 - 351
  • [24] A big video data transcoding service for social media over federated clouds
    Alfonso Panarello
    Antonio Celesti
    Maria Fazio
    Antonio Puliafito
    Massimo Villari
    Multimedia Tools and Applications, 2020, 79 : 9037 - 9061
  • [25] Data Quality Control Based on Metric Data Models
    Koeppen, Veit
    Lenz, Hans-J
    FRONTIERS IN STATISTICAL QUALITY CONTROL 9, 2010, : 263 - +
  • [26] SegGuard: Segmentation-Based Anonymization of Network Data in Clouds for Privacy-Preserving Security Auditing
    Oqaily, Momen
    Jarraya, Yosr
    Mohammady, Meisam
    Majumdar, Suryadipta
    Pourzandi, Makan
    Wang, Lingyu
    Debbabi, Mourad
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2486 - 2505
  • [27] MEASURING DATA QUALITY IN VORTALS
    Caro, Angelica
    Angeles Moraga, Ma
    Moraga, Carmen
    Calero, Coral
    ICSOFT 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2009, : 217 - +
  • [28] Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
    Kang, Juyeon
    Park, Jungyeul
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4432 - 4436
  • [29] Method of measuring NEQ as a quality control metric for digital mammography
    Bloomquist, Aili K.
    Mainprize, James G.
    Mawdsley, Gordon E.
    Yaffe, Martin J.
    MEDICAL PHYSICS, 2014, 41 (03)
  • [30] When Poor-Quality Data Meet Anonymization Models: Threats and Countermeasures
    Majeed, Abdul
    Hwang, Seong Oun
    IEEE ACCESS, 2025, 13 : 49457 - 49475