CampER: An Effective Framework for Privacy-Aware Deep Entity Resolution

被引:2
|
作者
Guo, Yuxiang [1 ]
Chen, Lu [1 ]
Zhou, Zhengjie [2 ]
Zheng, Baihua [3 ]
Fang, Ziquan [1 ]
Zhang, Zhikun [4 ]
Mao, Yuren [2 ]
Gao, Yunjun [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Zhejiang Univ, Ningbo, Peoples R China
[3] Singapore Management Univ, Singapore, Singapore
[4] Stanford Univ, Palo Alto, CA 94304 USA
关键词
entity resolution; representation learning; similarity measurement; LINKAGE;
D O I
10.1145/3580305.3599266
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity Resolution (ER) is a fundamental problem in data preparation. Standard deep ER methods have achieved state-of-the-art effectiveness, assuming that relations from different organizations are centrally stored. However, due to privacy concerns, it can be difficult to centralize data in practice, rendering standard deep ER solutions inapplicable. Despite efforts to develop rule-based privacy-preserving ER methods, they often neglect subtle matching mechanisms and have poor effectiveness as a result. To bridge effectiveness and privacy, in this paper, we propose CampER, an effective framework for privacy-aware deep entity resolution. Specifically, we first design a training pair self-generation strategy to overcome the absence of manually labeled data in privacy-aware scenarios. Based on the self-constructed training pairs, we present a collaborative fine-tuning approach to learn the match-aware and uni-space individual tuple embeddings for accurate matching decisions. During the matching decision-making process, we first introduce a cryptographically secure approach to determine matches. Furthermore, we propose an order-preserving perturbation strategy to significantly accelerate the matching computation while guaranteeing the consistency of ER results. Extensive experiments on eight widely-used benchmark datasets demonstrate that CampER not only is comparable with the state-of-the-art standard deep ER solutions in effectiveness, but also preserves privacy.
引用
收藏
页码:626 / 637
页数:12
相关论文
共 50 条
  • [1] A privacy-aware framework for targeted advertising
    Wang, Wei
    Yang, Linlin
    Chen, Yanjiao
    Zhang, Qian
    COMPUTER NETWORKS, 2015, 79 : 17 - 29
  • [2] A Privacy-Aware Conceptual Framework for Coordination
    Elahi, Haroon
    Wang, Guojun
    Zhang, Wei
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 190 - 197
  • [3] Towards A Framework for Privacy-Aware Mobile Crowdsourcing
    Wang, Yang
    Huang, Yun
    Louis, Claudia
    2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 454 - 459
  • [4] Framework for Privacy-Aware Web Service Logging
    Chanakitkarnchok, Chaithat
    Senivongse, Twittie
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2018, : 467 - 474
  • [5] A Semantic Framework for Privacy-Aware Access Control
    Lioudakis, Georgios V.
    Dellas, Nikolaos L.
    Koutsoloukas, Eleftherios A.
    Kapitsaki, Georgia M.
    Kaklamani, Dimitra I.
    Venieris, Iakovos S.
    2008 INTERNATIONAL MULTICONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (IMCSIT), VOLS 1 AND 2, 2008, : 757 - 764
  • [6] A Privacy-aware Framework for Online Advertisement Targeting
    Yang, Linlin
    Wang, Wei
    Chen, Yanjiao
    Zhang, Qian
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 3145 - 3150
  • [7] A CONCEPTUAL PRIVACY FRAMEWORK FOR PRIVACY-AWARE IOT HEALTH APPLICATIONS
    Thinakaran, Kavenesh
    Dhillon, Jaspaljeet Singh
    Gunasekaran, Saraswathy Shamini
    Chen, Lim Fung
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS: EMBRACING ECO-FRIENDLY COMPUTING, 2017, : 175 - 183
  • [8] A privacy-aware deep learning framework for health recommendation system on analysis of big data
    T. Mahesh Selvi
    V. Kavitha
    The Visual Computer, 2022, 38 : 385 - 403
  • [9] Privacy-aware Incentive Mechanism Framework for Mobile Crowdsensing
    Zhu, Shaojun
    Tao, Dan
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [10] Privacy-Aware Blind Cloud Framework for Advanced Healthcare
    Sarkar, Subhadeep
    Chatterjee, Subarna
    Misra, Sudip
    Kudupudi, Rajesh
    IEEE COMMUNICATIONS LETTERS, 2017, 21 (11) : 2492 - 2495