Disambiguation data: Extracting information from anonymized sources

被引:0
|
作者
Dreiseitl, S [1 ]
Vinterbo, S
Ohno-Machado, L
机构
[1] Polytech Univ Upper Austria, Dept Software Engn Med, A-4232 Hagenberg, Austria
[2] Harvard Univ, Brigham & Womens Hosp, Sch Med, Decis Syst Grp, Boston, MA 02115 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy protection is an important consideration when releasing medical databases to the research community. We show that while recent advances in anonymization algorithms provide increased levels of protection, it is still possible to calculate approximations to the original data set. In some cases, one can even uniquely reconstruct entries in a table before anonymization. In this paper, we demonstrate how knowledge of an anonymization algorithm based on ambiguating data cell entries can be used to undo the anonymization process. We investigate the effect of this algorithm and its reversal on data sets of varying sizes and distributions. It is shown that by using a computationally complex disambiguation process, information on individuals can be extracted from an anonymized data set.
引用
收藏
页码:144 / 148
页数:5
相关论文
共 50 条
  • [21] Sequential data search for extracting information from texts
    Charnois, Thierry
    Plantevit, Marc
    Rigotti, Christophe
    Cremilleux, Bruno
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2009, 50 (03): : 59 - 87
  • [22] Extracting ship stopping information from AIS data
    Yan, Zhaojin
    Cheng, Liang
    He, Rong
    Yang, Hui
    OCEAN ENGINEERING, 2022, 250
  • [23] Extracting Semantic Information from Visual Data: A Survey
    Liu, Qiang
    Li, Ruihao
    Hu, Huosheng
    Gu, Dongbing
    ROBOTICS, 2016, 5 (01)
  • [24] On the risk of extracting relevant information from random data
    Dominguez, Luis Garcia
    JOURNAL OF NEURAL ENGINEERING, 2009, 6 (05)
  • [25] EXTRACTING INFORMATION FROM APPARENT RANDOMNESS IN CARDIOVASCULAR DATA
    HUANG, NK
    HALBERG, F
    PROCEEDINGS OF THE ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, PTS 1-4, 1988, : 1824 - 1824
  • [26] Cable analysis - Extracting information from measured data
    Mlinarsky, F
    COMMUNICATION CABLES AND RELATED TECHNOLOGIES EC'99, 1999, : 102 - 108
  • [27] Extracting information from heterogeneous information sources using ontologically specified target views
    Biskup, J
    Embley, DW
    INFORMATION SYSTEMS, 2003, 28 (03) : 169 - 212
  • [28] A method for extracting subspace of deterministic sources from EEG data
    Ivannikov, Andriy
    Kaerkkaeinen, Tommi
    Ristaniemi, Tapani
    Lyytinen, Heikki
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 1361 - +
  • [29] DISNET: a framework for extracting phenotypic disease information from public sources
    Lagunes-Garcia, Gerardo
    Rodriguez-Gonzalez, Alejandro
    Prieto-Santamaria, Lucia
    Garcia del Valle, Eduardo P.
    Zanin, Massimiliano
    Menasalvas-Ruiz, Ernestina
    PEERJ, 2020, 8
  • [30] Extracting noise contaminated information in multiple sources
    Kasum, Obrad
    Dolicanin, Edin
    Jovanovic, Aleksandar
    Perovic, Aleksandar
    IEEE 13TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY), 2015, : 117 - 121