Secure Approximate String Matching for Privacy-Preserving Record Linkage

被引:8
|
作者
Essex, Aleksander [1 ]
机构
[1] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
关键词
Homomorphic encryption; secure computation; approximate string matching; privacy-preserving records linkage; EFFICIENT;
D O I
10.1109/TIFS.2019.2903651
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Real-world applications of record linkage often require matching to be robust in spite of small variations in string fields. For example, two health care providers should be able to detect a patient in common, even if one record contains a typo or transcription error. In the privacy-preserving setting, however, the problem of approximate string matching has been cast as a trade-off between security and practicality, and the literature has mainly focused on Bloom filter encodings, an approach which can leak significant information about the underlying records. We present a novel public-key construction for secure two-party evaluation of threshold functions in restricted domains based on embeddings found in the message spaces of additively homomorphic encryption schemes. We use this to construct an efficient two-party protocol for privately computing the threshold Dice coefficient. Relative to the approach of Bloom filter encodings, our proposal offers formal security guarantees and greater matching accuracy. We implement the protocol and demonstrate the feasibility of this approach in linking mediumsized patient databases with tens of thousands of records.
引用
收藏
页码:2623 / 2632
页数:10
相关论文
共 50 条
  • [2] Encryption-based sub-string matching for privacy-preserving record linkage
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 81
  • [3] A Graph Matching Attack on Privacy-Preserving Record Linkage
    Vidanage, Anushka
    Christen, Peter
    Ranbaduge, Thilina
    Schnell, Rainer
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1485 - 1494
  • [4] Privacy-Preserving String Comparisons in Record Linkage Systems: A Review
    Trepetin, Stanley
    INFORMATION SECURITY JOURNAL, 2008, 17 (5-6): : 253 - 266
  • [5] Privacy-preserving record linkage
    Verykios, Vassilios S.
    Christen, Peter
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (05) : 321 - 332
  • [6] Privacy-Preserving Record Linkage
    Hall, Rob
    Fienberg, Stephen E.
    PRIVACY IN STATISTICAL DATABASES, 2010, 6344 : 269 - +
  • [7] Privacy-Preserving Record Linkage with Spark
    Valkering, Onno
    Belloum, Adam
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 440 - 448
  • [8] Privacy-Preserving Temporal Record Linkage
    Ranbaduge, Thilina
    Christen, Peter
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 377 - 386
  • [9] On the effectiveness of graph matching attacks against privacy-preserving record linkage
    Heng, Youzhe
    Armknecht, Frederik
    Chen, Yanling
    Schnell, Rainer
    PLOS ONE, 2022, 17 (09):
  • [10] Approximate Two-Party Privacy-Preserving String Matching with Linear Complexity
    Beck, Martin
    Kerschbaum, Florian
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 31 - 37