A Distributed Near-Optimal LSH-based Framework for Privacy-Preserving Record Linkage

被引:10
|
作者
Karapiperis, Dimitrios [1 ]
Verykios, Vassilios S. [1 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras, Greece
关键词
Locality-Sensitive Hashing; Bloom filter; Map/Reduce;
D O I
10.2298/CSIS140215040K
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a framework which relies on the Map/Redtice paradigm in order to distribute computations among underutilized commodity hardware resources uniformly, without imposing an extra overhead on the existing infrastructure. The volume of the distance computations, required for records comparison, is largely reduced by utilizing the so-called Locality-Sensitive Hashing technique, which is optimally tuned in order to avoid highly redundant computations. Experimental results illustrate the effectiveness of our distributed framework in finding the matched record pairs in voluminous data sets.
引用
收藏
页码:745 / 763
页数:19
相关论文
共 50 条
  • [22] Modern Privacy-Preserving Record Linkage Techniques: An Overview
    Gkoulalas-Divanis, Aris
    Vatsalan, Dinusha
    Karapiperis, Dimitrios
    Kantarcioglu, Murat
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4966 - 4987
  • [23] A Graph Matching Attack on Privacy-Preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1485 - 1494
  • [24] Privacy-preserving record linkage using Bloom filters
    Schnell, Rainer
    Bachteler, Tobias
    Reiher, Joerg
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2009, 9
  • [25] ScaDS Research on Scalable Privacy-preserving Record Linkage
    Franke, Martin
    Gladbach, Marcel
    Sehili, Ziad
    Rohde, Florens
    Rahm, Erhard
    Datenbank-Spektrum, 2019, 19 (01): : 31 - 40
  • [26] A Tutorial on Blocking Methods for Privacy-Preserving Record Linkage
    Karapiperis, Dimitrios
    Verykios, Vassilios S.
    Katsiri, Eleftheria
    Delis, Alex
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2015, 2016, 9511 : 3 - 15
  • [27] Encoding of Numerical Data for Privacy-Preserving Record Linkage
    Demelius, Lea
    Kreiner, Karl
    Hayn, Dieter
    Nitzlnader, Michael
    Schreier, Guenter
    DHEALTH 2020 - BIOMEDICAL INFORMATICS FOR HEALTH AND CARE, 2020, 271 : 23 - 30
  • [28] Blind Attribute Pairing for Privacy-Preserving Record Linkage
    da Nobrega, Thiago Pereira
    Pires, Carlos Eduardo S.
    Araujo, Tiago Brasileiro
    Mestre, Demetrio Gomes
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 557 - 564
  • [29] Privacy-preserving record linkage using Bloom filters
    Rainer Schnell
    Tobias Bachteler
    Jörg Reiher
    BMC Medical Informatics and Decision Making, 9
  • [30] Fairness-Aware Privacy-Preserving Record Linkage
    Vatsalan, Dinusha
    Yu, Joyce
    Henecka, Wilko
    Thorne, Brian
    DATA PRIVACY MANAGEMENT, CRYPTOCURRENCIES AND BLOCKCHAIN TECHNOLOGY, ESORICS 2020, DPM 2020, CBT 2020, 2020, 12484 : 3 - 18