A Distributed Near-Optimal LSH-based Framework for Privacy-Preserving Record Linkage

被引:10
|
作者
Karapiperis, Dimitrios [1 ]
Verykios, Vassilios S. [1 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras, Greece
关键词
Locality-Sensitive Hashing; Bloom filter; Map/Reduce;
D O I
10.2298/CSIS140215040K
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a framework which relies on the Map/Redtice paradigm in order to distribute computations among underutilized commodity hardware resources uniformly, without imposing an extra overhead on the existing infrastructure. The volume of the distance computations, required for records comparison, is largely reduced by utilizing the so-called Locality-Sensitive Hashing technique, which is optimally tuned in order to avoid highly redundant computations. Experimental results illustrate the effectiveness of our distributed framework in finding the matched record pairs in voluminous data sets.
引用
收藏
页码:745 / 763
页数:19
相关论文
共 50 条
  • [31] Securing Bloom Filters for Privacy-preserving Record Linkage
    Ranbaduge, Thilina
    Schnell, Rainer
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2185 - 2188
  • [32] Blockchain-based Privacy-Preserving Record Linkage: enhancing data privacy in an untrusted environment
    Nobrega, Thiago
    Pires, Carlos Eduardo S.
    Nascimento, Dimas Cassimiro
    INFORMATION SYSTEMS, 2021, 102 (102)
  • [33] Explanation and answers to critiques on: Blockchain-based Privacy-Preserving Record Linkage
    Nobrega, Thiago
    Pires, Carlos Eduardo S.
    Nascimento, Dimas Cassimiro
    INFORMATION SYSTEMS, 2022, 108
  • [34] Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage
    Christen, Peter
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Schnell, Rainer
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2164 - 2177
  • [35] A Privacy Attack on Multiple Dynamic Match-key based Privacy-Preserving Record Linkage
    Vidanage, A.
    Ranbaduge, T.
    Christen, P.
    Randall, S.
    INTERNATIONAL JOURNAL OF POPULATION DATA SCIENCE (IJPDS), 2020, 5 (01):
  • [36] Privacy-Preserving Record Linkage via Bilinear Pairing Approach
    Lin, Chih-Hsun
    Yu, Chia-Mu
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [37] Accurate privacy-preserving record linkage for databases with missing values
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    Schnell, Rainer
    INFORMATION SYSTEMS, 2022, 106
  • [38] Secure Approximate String Matching for Privacy-Preserving Record Linkage
    Essex, Aleksander
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (10) : 2623 - 2632
  • [39] An enhanced privacy-preserving record linkage approach for multiple databases
    Han, Shumin
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (05): : 3641 - 3652
  • [40] Privacy-preserving record linkage on large real world datasets
    Randall, Sean M.
    Ferrante, Anna M.
    Boyd, James H.
    Bauer, Jacqueline K.
    Semmens, James B.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 205 - 212