QA-Matcher: Unsupervised Entity Matching Using a Question Answering Model

被引:0
|
作者
Hayashi, Shogo [1 ,3 ]
Dong, Yuyang [2 ]
Oyamada, Masafumi [2 ]
机构
[1] BizReach Inc, Tokyo, Japan
[2] NEC Corp Ltd, Tokyo, Japan
[3] NEC Corp Ltd, Tokyo, Japan
关键词
entity matching; question answering;
D O I
10.1007/978-3-031-33383-5_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity matching (EM) is a fundamental task in data integration, which involves identifying records that refer to the same real-world entity. Unsupervised EM is often preferred in real-world applications, as labeling data is often a labor-intensive process. However, existing unsupervised methods may not always perform well because the assumptions for these methods may not hold for tasks in different domains. In this paper, we propose QA-Matcher, an unsupervised EM model that is domain-agnostic and doesn't require any particular assumptions. Our idea is to frame EM as question answering (QA) by utilizing a trained QA model. Specifically, we generate a question that asks which record has the characteristics of a particular record and a passage that describes other records. We then use the trained QA model to predict the record pair that corresponds to the question-answer as a match. QA-Matcher leverages the power of a QA model to represent the semantics of various types of entities, allowing it to identify identical entities in a QA-like fashion. In extensive experiments on 16 real-world datasets, we demonstrate that QA-Matcher outperforms unsupervised EM methods and is competitive with supervised methods.
引用
收藏
页码:174 / 185
页数:12
相关论文
共 50 条
  • [1] Developing Question Answering (QA) systems using the patterns
    Moise, Maria
    Gheorghe, Ciprian
    Zingale, Marilena
    WSEAS Transactions on Computers, 2010, 9 (07): : 726 - 737
  • [2] HSM-QA: Question Answering System Based on Hierarchical Semantic Matching
    Zhang, Jinlu
    He, Jing
    Zhou, Yiyi
    Sun, Xiaoshuai
    Yu, Xiao
    IEEE ACCESS, 2023, 11 : 77826 - 77839
  • [3] Unsupervised Joint Entity Linking over Question Answering Pair with Global Knowledge
    Liu, Cao
    He, Shizhu
    Yang, Hang
    Liu, Kang
    Zhao, Jun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 273 - 286
  • [4] Arabic Narrative Question Answering (QA) Using Transformer Models
    Ateeq, Mohammad A.
    Tiun, Sabrina
    Abdelhaq, Hamed
    Rahhal, Nawras
    IEEE ACCESS, 2024, 12 : 2760 - 2777
  • [5] Bilingual Question Answering Using CINDI_QA at QA@CLEF 2007
    Haddad, Chedid
    Desai, Bipin C.
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 308 - 315
  • [6] Improving question answering using named entity recognition
    Toral, A
    Noguera, E
    Llopis, F
    Muñoz, R
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 181 - 191
  • [7] Developing an Entity Linking Model for Geographic Knowledge Base Question Answering
    Yang, TaeJoo
    Jeong, Evelyn Hyeji
    Yang, Jonghyeon
    Yu, Kiyun
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 385 - 386
  • [8] ISD-QA: Iterative Distillation of Commonsense Knowledge from General Language Models for Unsupervised Question Answering
    Ramamurthy, Priyadharsini
    Aakur, Sathyanarayanan N.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1229 - 1235
  • [9] Question answering using sentence parsing and semantic network matching
    Hartrumpf, S
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 512 - 521
  • [10] QA4QG: USING QUESTION ANSWERING TO CONSTRAIN MULTI-HOP QUESTION GENERATION
    Su, Dan
    Xu, Peng
    Fung, Pascale
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8232 - 8236