Comparison of machine learning models applied on anonymized data with different techniques

被引:4
|
作者
Diaz, Judith Sainz-Pardo [1 ]
Garcia, Alvaro Lopez [1 ]
机构
[1] Inst Fis Cantabria IFCA, CSIC UC, Avda Castros S-N, Santander 39005, Spain
关键词
D O I
10.1109/CSR57506.2023.10224917
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is necessary to apply several anonymization techniques beyond the classical k-anonymity or l-diversity. However, the application of these methods is directly connected to a reduction of their utility in prediction and decision making tasks. In this work we study four classical machine learning methods currently used for classification purposes in order to analyze the results as a function of the anonymization techniques applied and the parameters selected for each of them. The performance of these models is studied when varying the value of k for k-anonymity and additional tools such as l-diversity, t-closeness and d-disclosure privacy are also deployed on the well-known adult dataset.
引用
收藏
页码:618 / 623
页数:6
相关论文
共 50 条
  • [1] A Comparison of Machine Learning Techniques Applied to Landsat-5 TM Spectral Data for Biomass Estimation
    Lopez-Serrano, Pablito M.
    Lopez-Sanchez, Carlos A.
    Alvarez-Gonzalez, Juan G.
    Garcia-Gutierrez, Jorge
    CANADIAN JOURNAL OF REMOTE SENSING, 2016, 42 (06) : 690 - 705
  • [2] Comparison of Different Machine Learning Models for diabetes detection
    Katarya, Rahul
    Jain, Sajal
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCES AND DEVELOPMENTS IN ELECTRICAL AND ELECTRONICS ENGINEERING (ICADEE), 2020, : 117 - 121
  • [3] MACHINE LEARNING TECHNIQUES APPLIED TO SENSOR DATA CORRECTION IN BUILDING TECHNOLOGIES
    Smith, Matt K.
    Castello, Charles C.
    New, Joshua R.
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 305 - 308
  • [4] Comparison of different machine learning techniques in river flow prediction
    Akbulut, Ugur
    Cifci, Mehmet Akif
    Isler, Buket
    Aslan, Zafer
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2025, 40 (01): : 467 - 485
  • [5] CPT Data Interpretation Employing Different Machine Learning Techniques
    Rauter, Stefan
    Tschuchnigg, Franz
    GEOSCIENCES, 2021, 11 (07)
  • [6] A Comparison of Machine Learning Techniques for Classification in Bank Marketing Data
    Saengthongrattanachot, Waritpon
    Na-udom, Anamai
    Rungrattanaubol, Jaratsri
    THAI JOURNAL OF MATHEMATICS, 2022, : 157 - 168
  • [7] A Comparison of Resampling Techniques for Medical Data Using Machine Learning
    Alahmari, Fahad
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2020, 19 (01)
  • [8] Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
    Yongbing Zhao
    Jinfeng Shao
    Yan W.Asmann
    Genomics,Proteomics & Bioinformatics, 2022, (05) : 899 - 911
  • [9] Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
    Zhao, Yongbing
    Shao, Jinfeng
    Asmann, Yan W.
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2022, 20 (05) : 899 - 911
  • [10] A Comparison of Machine Learning Models with Data Augmentation Techniques for Skeleton-based Human Action Recognition
    Xin, Chu
    Kim, Seokhwan
    Park, Kyoung Shin
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,