A Distributed Near-Optimal LSH-based Framework for Privacy-Preserving Record Linkage

被引:10
|
作者
Karapiperis, Dimitrios [1 ]
Verykios, Vassilios S. [1 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras, Greece
关键词
Locality-Sensitive Hashing; Bloom filter; Map/Reduce;
D O I
10.2298/CSIS140215040K
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a framework which relies on the Map/Redtice paradigm in order to distribute computations among underutilized commodity hardware resources uniformly, without imposing an extra overhead on the existing infrastructure. The volume of the distance computations, required for records comparison, is largely reduced by utilizing the so-called Locality-Sensitive Hashing technique, which is optimally tuned in order to avoid highly redundant computations. Experimental results illustrate the effectiveness of our distributed framework in finding the matched record pairs in voluminous data sets.
引用
收藏
页码:745 / 763
页数:19
相关论文
共 50 条
  • [41] Privacy-Preserving String Comparisons in Record Linkage Systems: A Review
    Trepetin, Stanley
    INFORMATION SECURITY JOURNAL, 2008, 17 (5-6): : 253 - 266
  • [42] An Overview of Big Data Issues in Privacy-Preserving Record Linkage
    Vatsalan, Dinusha
    Karapiperis, Dimitrios
    Gkoulalas-Divanis, Aris
    ALGORITHMIC ASPECTS OF CLOUD COMPUTING (ALGOCLOUD 2018), 2019, 11409 : 118 - 136
  • [43] Differential Cryptanalysis of Bloom Filters for Privacy-Preserving Record Linkage
    Yin, Weifeng
    Yuan, Lifeng
    Ren, Yizhi
    Meng, Weizhi
    Wang, Dong
    Wang, Qiuhua
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 6665 - 6678
  • [44] Optimization of the Mainzelliste software for fast privacy-preserving record linkage
    Florens Rohde
    Martin Franke
    Ziad Sehili
    Martin Lablans
    Erhard Rahm
    Journal of Translational Medicine, 19
  • [45] An enhanced privacy-preserving record linkage approach for multiple databases
    Shumin Han
    Derong Shen
    Tiezheng Nie
    Yue Kou
    Ge Yu
    Cluster Computing, 2022, 25 : 3641 - 3652
  • [46] Privacy-Preserving Access Control in Electronic Health Record Linkage
    Lu, Yang
    Sinnott, Richard O.
    Verspoor, Kain
    Parampalli, Udaya
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1079 - 1090
  • [47] Optimization of the Mainzelliste software for fast privacy-preserving record linkage
    Rohde, Florens
    Franke, Martin
    Sehili, Ziad
    Lablans, Martin
    Rahm, Erhard
    JOURNAL OF TRANSLATIONAL MEDICINE, 2021, 19 (01)
  • [48] Efficient Cryptanalysis of Bloom Filters for Privacy-Preserving Record Linkage
    Christen, Peter
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Schnell, Rainer
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I, 2017, 10234 : 628 - 640
  • [49] Hyper-Parameter Optimization for Privacy-Preserving Record Linkage
    Yu, Joyce
    Nabaglo, Jakub
    Vatsalan, Dinusha
    Henecka, Wilko
    Thorne, Brian
    ECML PKDD 2020 WORKSHOPS, 2020, 1323 : 281 - 296
  • [50] A Privacy-Preserving Distributed Optimal Scheduling for Interconnected Microgrids
    Liu, Nian
    Wang, Cheng
    Cheng, Minyang
    Wang, Jie
    ENERGIES, 2016, 9 (12)