Comparison of machine learning models applied on anonymized data with different techniques

被引:4
|
作者
Diaz, Judith Sainz-Pardo [1 ]
Garcia, Alvaro Lopez [1 ]
机构
[1] Inst Fis Cantabria IFCA, CSIC UC, Avda Castros S-N, Santander 39005, Spain
关键词
D O I
10.1109/CSR57506.2023.10224917
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is necessary to apply several anonymization techniques beyond the classical k-anonymity or l-diversity. However, the application of these methods is directly connected to a reduction of their utility in prediction and decision making tasks. In this work we study four classical machine learning methods currently used for classification purposes in order to analyze the results as a function of the anonymization techniques applied and the parameters selected for each of them. The performance of these models is studied when varying the value of k for k-anonymity and additional tools such as l-diversity, t-closeness and d-disclosure privacy are also deployed on the well-known adult dataset.
引用
收藏
页码:618 / 623
页数:6
相关论文
共 50 条
  • [21] Review: machine learning techniques applied to cybersecurity
    Javier Martínez Torres
    Carla Iglesias Comesaña
    Paulino J. García-Nieto
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 2823 - 2836
  • [22] Anonymized Data: Generation, Models, Usage
    Cormode, Graham
    Srivastava, Divesh
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1211 - 1212
  • [23] Anonymized Data: Generation, Models, Usage
    Cormode, Graham
    Srivastava, Divesh
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1015 - 1018
  • [24] Microeconometric models and anonymized micro data
    Ronning G.
    Allgemeines Statistisches Archiv, 2006, 90 (1): : 153 - 166
  • [25] Evaluating how different balancing data techniques impact on prediction of premature birth using machine learning models
    Silva, Anna Beatriz
    Rocha, Elisson da Silva
    Lorenzato, Joao Fausto
    Endo, Patricia Takako
    PLOS ONE, 2025, 20 (03):
  • [26] A Comparison of different learning models used in Data Mining for Medical Data
    Srimani, P. K.
    Koti, Manjula Sanjay
    2ND INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-11), 2011, 1414
  • [27] VALIDITY OF AGROECOSYSTEM MODELS - A COMPARISON OF RESULTS OF DIFFERENT MODELS APPLIED TO THE SAME DATA SET
    DIEKKRUGER, B
    SONDGERATH, D
    KERSEBAUM, KC
    MCVOY, CW
    ECOLOGICAL MODELLING, 1995, 81 (1-3) : 3 - 29
  • [28] New Partially Linear Regression and Machine Learning Models Applied to Agronomic Data
    Rodrigues, Gabriela M.
    Ortega, Edwin M. M.
    Cordeiro, Gauss M.
    AXIOMS, 2023, 12 (11)
  • [30] Comparison of different machine learning models for mass appraisal of real estate
    Bilgilioglu, Suleyman Sefa
    Yilmaz, Haci Murat
    SURVEY REVIEW, 2023, 55 (388) : 32 - 43