A privacy protection technique for publishing data mining models and research data

被引:1
|
作者
Fu Y. [1 ]
Chen Z. [1 ]
Koru G. [1 ]
Gangopadhyay A. [1 ]
机构
[1] Department of Information Systems, University of Maryland, Baltimore County, Baltimore, MD 21250
关键词
Preserving data mining;
D O I
10.1145/1877725.1877732
中图分类号
学科分类号
摘要
Data mining techniques have been widely used in many research disciplines such as medicine, life sciences, and social sciences to extract useful knowledge (such as mining models) from research data. Research data often needs to be published along with the data mining model for verification or reanalysis. However, the privacy of the published data needs to be protected because otherwise the published data is subject to misuse such as linking attacks. Therefore, employing various privacy protection methods becomes necessary. However, these methods only consider privacy protection and do not guarantee that the same mining models can be built from sanitized data. Thus the published models cannot be verified using the sanitized data. This article proposes a technique that not only protects privacy, but also guarantees that the same model, in the form of decision trees or regression trees, can be built from the sanitized data. We have also experimentally shown that other mining techniques can be used to reanalyze the sanitized data. This technique can be used to promote sharing of research data. © 2010 ACM.
引用
收藏
相关论文
共 50 条
  • [21] Privacy Preserving Data Mining Technique to Secure Distributed Client Data
    Dani, Virendra
    Kokate, Priyanka
    Kushwah, Surbhi
    Waghela, Swapnil
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 565 - 574
  • [22] Data mining privacy preserving: Research agenda
    Kreso, Inda
    Kapo, Amra
    Turulja, Lejla
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 11 (01)
  • [23] Recent research on privacy preserving data mining
    Gurevich, Alex
    Gudes, Ehud
    INFORMATION SYSTEMS SECURITY, PROCEEDINGS, 2006, 4332 : 377 - +
  • [24] Research on privacy protection method based on deep reinforcement learning algorithm in data mining
    Cai, Yan
    Xue, Rui
    International Journal of Computational Systems Engineering, 2024, 8 (3-4) : 210 - 219
  • [25] Blockchain Data Privacy Protection Mechanism for Enterprise Finance and Data Mining Algorithms
    Ma, Xuejun
    Zhang, Yongshan
    Engineering Intelligent Systems, 32 (05): : 435 - 443
  • [26] Research on Privacy Data Protection in Mobile Applications
    Chen, Haihong
    Gu, Yazhen
    Wang, Peng
    Dong, Jie
    Ren, Yanyan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 4912 - 4915
  • [27] Preserving Data Privacy in Speech Data Publishing
    孙佳鑫
    蒋进
    赵萍
    Journal of Donghua University(English Edition), 2020, 37 (04) : 293 - 297
  • [28] Correction to: Protection of data privacy from vulnerability using two-fish technique with Apriori algorithm in data mining
    D. Dhinakaran
    P. M. Joe Prathap
    The Journal of Supercomputing, 2022, 78 : 19754 - 19754
  • [29] The Formal Analysis on Negative Information Selections for Privacy Protection in Data Publishing
    Chen, Ping
    Hu, Jingjing
    Wu, Zhitao
    Xiong, Ruoting
    Ren, Wei
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024, 2024
  • [30] Privacy Protection Method on Publishing Dynamic Set-Valued Data
    Zhang, Jian
    Yang, Yu
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 262 - 265