Disambiguation data: Extracting information from anonymized sources

被引:0
|
作者
Dreiseitl, S [1 ]
Vinterbo, S
Ohno-Machado, L
机构
[1] Polytech Univ Upper Austria, Dept Software Engn Med, A-4232 Hagenberg, Austria
[2] Harvard Univ, Brigham & Womens Hosp, Sch Med, Decis Syst Grp, Boston, MA 02115 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy protection is an important consideration when releasing medical databases to the research community. We show that while recent advances in anonymization algorithms provide increased levels of protection, it is still possible to calculate approximations to the original data set. In some cases, one can even uniquely reconstruct entries in a table before anonymization. In this paper, we demonstrate how knowledge of an anonymization algorithm based on ambiguating data cell entries can be used to undo the anonymization process. We investigate the effect of this algorithm and its reversal on data sets of varying sizes and distributions. It is shown that by using a computationally complex disambiguation process, information on individuals can be extracted from an anonymized data set.
引用
收藏
页码:144 / 148
页数:5
相关论文
共 50 条
  • [1] Disambiguation data: Extracting information from anonymized sources
    Dreiseitl, S
    Vinterbo, S
    Ohno-Machado, L
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2002, 9 (06) : S110 - S114
  • [2] A Framework for Extracting Information from Semi-Structured Web Data Sources
    Shaker, Malunoud
    Ibrahim, Hamidah
    Mustapha, Aida
    Abdullah, Lili Nurliyana
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 27 - 31
  • [3] Extracting information from drill data
    Yin, K.
    Liu, H.
    Yang, H.
    2000, shers (04)
  • [4] Information Leakage in Optimal Anonymized and Diversified Data
    Fang, Chengfang
    Chang, Ee-Chien
    INFORMATION HIDING, 2008, 5284 : 30 - 44
  • [5] On extracting relevant information from medical data
    Zvarova, J
    Studeny, M
    Preiss, J
    MEDICAL INFORMATICS EUROPE '96: HUMAN FACETS IN INFORMATION TECHNOLOGIES, 1996, 34 : 649 - 653
  • [6] Extracting data, information, and knowledge from an ELN
    Bird, Colin L.
    Coles, Simon J.
    Frey, Jeremy G.
    Whitby, Richard J.
    Day, Aileen E.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 247
  • [7] Extracting meaningful information from financial data
    Rajkovic, M
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2000, 287 (3-4) : 383 - 395
  • [8] Extracting information from text data bases
    Albrecht, R
    Merkl, D
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS VIII, 1999, 172 : 283 - 286
  • [9] Flower: extracting information from pyrosequencing data
    Malde, Ketil
    BIOINFORMATICS, 2011, 27 (07) : 1041 - 1042
  • [10] Extracting morphologic information from field data
    Plant, Nathaniel G.
    Holman, Rob A.
    Proceedings of the Coastal Engineering Conference, 1998, 3 : 2773 - 2784