An enhanced privacy-preserving record linkage approach for multiple databases

被引:3
|
作者
Han, Shumin [1 ]
Shen, Derong [1 ]
Nie, Tiezheng [1 ]
Kou, Yue [1 ]
Yu, Ge [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Liaoning, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2022年 / 25卷 / 05期
基金
中国国家自然科学基金;
关键词
Record linkage; Privacy; Bloom filter; Multi-LUs; Blocking;
D O I
10.1007/s10586-022-03590-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the purpose of research, organizations often need to share and link data that belongs to a single individual while protecting the privacy, which is referred to as privacy preserving record linkage (PPRL). Various approaches have been developed to tackle this problem, however, it is still a challenging task due to the massive amount of data, multiple data sources, and 'dirty' data. Therefore, in this paper, an enhanced approximate multi-party PPRL (MP-PPRL) approach is proposed to improve privacy, scalability, and linkage quality. For privacy, bloom filter (BF) is a better and more efficient masking techniques than others so far. Thus, the records are encoded into BFs to ensure privacy. However, BFs may be compromised through frequency-based attacks. To enhance privacy, a distributed protocol that introduces multiple linkage units (Multi-LUs) to resist frequency-based attacks is proposed. In scalability, we develop a blocking technique based on sorted nearest neighborhood (SNN) approach for clustering similar BFs across multiple databases, called BF-SNN, which dramatically reduces complexity. In linkage quality, a personalized threshold that varies with different levels of 'dirty' data is introduced, which provides a more accurate error-tolerance for 'dirty' data and consequently improves linkage quality. An analysis and an empirical study are conducted on large real-world datasets to show the benefit of the proposed approach.
引用
收藏
页码:3641 / 3652
页数:12
相关论文
共 50 条
  • [21] A Graph Matching Attack on Privacy-Preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1485 - 1494
  • [22] Privacy-preserving record linkage using Bloom filters
    Schnell, Rainer
    Bachteler, Tobias
    Reiher, Joerg
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2009, 9
  • [23] ScaDS Research on Scalable Privacy-preserving Record Linkage
    Franke, Martin
    Gladbach, Marcel
    Sehili, Ziad
    Rohde, Florens
    Rahm, Erhard
    Datenbank-Spektrum, 2019, 19 (01): : 31 - 40
  • [24] A Tutorial on Blocking Methods for Privacy-Preserving Record Linkage
    Karapiperis, Dimitrios
    Verykios, Vassilios S.
    Katsiri, Eleftheria
    Delis, Alex
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2015, 2016, 9511 : 3 - 15
  • [25] Encoding of Numerical Data for Privacy-Preserving Record Linkage
    Demelius, Lea
    Kreiner, Karl
    Hayn, Dieter
    Nitzlnader, Michael
    Schreier, Guenter
    DHEALTH 2020 - BIOMEDICAL INFORMATICS FOR HEALTH AND CARE, 2020, 271 : 23 - 30
  • [26] Blind Attribute Pairing for Privacy-Preserving Record Linkage
    da Nobrega, Thiago Pereira
    Pires, Carlos Eduardo S.
    Araujo, Tiago Brasileiro
    Mestre, Demetrio Gomes
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 557 - 564
  • [27] A Vulnerability Assessment Framework for Privacy-preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (03)
  • [28] Privacy-preserving record linkage using Bloom filters
    Rainer Schnell
    Tobias Bachteler
    Jörg Reiher
    BMC Medical Informatics and Decision Making, 9
  • [29] A Privacy Attack on Multiple Dynamic Match-key based Privacy-Preserving Record Linkage
    Vidanage, A.
    Ranbaduge, T.
    Christen, P.
    Randall, S.
    INTERNATIONAL JOURNAL OF POPULATION DATA SCIENCE (IJPDS), 2020, 5 (01):
  • [30] Fairness-Aware Privacy-Preserving Record Linkage
    Vatsalan, Dinusha
    Yu, Joyce
    Henecka, Wilko
    Thorne, Brian
    DATA PRIVACY MANAGEMENT, CRYPTOCURRENCIES AND BLOCKCHAIN TECHNOLOGY, ESORICS 2020, DPM 2020, CBT 2020, 2020, 12484 : 3 - 18