Re-identification potential of structured health data

被引:0
|
作者
Drechsler, Joerg [1 ,2 ,3 ,5 ]
Pauly, Hannah [4 ]
机构
[1] Inst Arbeitsmarkt & Berufsforschung IAB, Nurnberg, Germany
[2] Univ Mannheim, Mannheim, Germany
[3] Univ Maryland, Joint Program Survey Methodol JPSM, College Pk, MD USA
[4] Bundesinst Arzneimittel & Medizinprodukte BfArM, Forschungsdatenzentrum Gesundheit, Bonn, Germany
[5] Inst Arbeitsmarkt & Berufsforschung IAB, Regensburger Str 104, D-90478 Nurnberg, Germany
关键词
Reidentification risk; Anonymization; Synthetic data; Electronic health records; Data privacy; MICRODATA;
D O I
10.1007/s00103-023-03820-2
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Broad access to health data offers great potential for science and research. However, health data often contains sensitive information that must be protected in a special way. In this context, the article deals with the re-identification potential of health data. After defining the relevant terms, we discuss factors that influence the re-identification potential. We summarize international privacy standards for health data and highlight the importance of background knowledge. Given that the reidentification potential is often underestimated in practice, we present strategies for mitigation based on the Five Safes concept. We also discuss classical data protection strategies as well as methods for generating synthetic health data. The article concludes with a brief discussion and outlook on the planned Health Data Lab at the Federal Institute for Drugs and Medical Devices.
引用
收藏
页码:164 / 170
页数:7
相关论文
共 50 条
  • [41] An Improved Differential Privacy Algorithm to Protect Re-identification of Data
    Zaman, A. N. K.
    Obimbo, Charlie
    Dara, Rozita A.
    2017 IEEE CANADA INTERNATIONAL HUMANITARIAN TECHNOLOGY CONFERENCE (IHTC), 2017,
  • [42] Self-supervised data augmentation for person re-identification
    Chen, Feng
    Wang, Nian
    Tang, Jun
    Liang, Dong
    Feng, Hao
    NEUROCOMPUTING, 2020, 415 : 48 - 59
  • [43] Towards the formalization of re-identification for some data masking methods
    Torra, Vicenc
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2012, 248 : 47 - 55
  • [44] Fifty Shades of Personal Data - Partial Re-identification and GDPR
    Willemson, Jan
    PRIVACY TECHNOLOGIES AND POLICY, APF 2022, 2022, 13279 : 88 - 96
  • [45] Object Re-Identification with Synthetic Training Data in Industrial Environments
    Duemmel, Johannes
    Gao, Xue
    2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
  • [46] CamStyle: A Novel Data Augmentation Method for Person Re-Identification
    Zhong, Zhun
    Zheng, Liang
    Zheng, Zhedong
    Li, Shaozi
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1176 - 1190
  • [47] Generate and Purify: Efficient Person Data Generation for Re-Identification
    Lu, Jianjie
    Zhang, Weidong
    Yin, Haibing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 558 - 566
  • [48] Person Re-identification with pose variation aware data augmentation
    Zhang, Lei
    Jiang, Na
    Diao, Qishuai
    Zhou, Zhong
    Wu, Wei
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (14): : 11817 - 11830
  • [49] Domain Adaptation for Person Re-identification on New Unlabeled Data
    Pereira, Tiago de C. G.
    de Campos, Teofilo E.
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 695 - 703
  • [50] A Systematic Review of Re-Identification Attacks on Health Data (vol 6, e28071, 2011)
    El Emam, Khaled
    Jonker, Elizabeth
    Arbuckle, Luk
    Malin, Bradley
    PLOS ONE, 2015, 10 (04):