Domain-Independent Entity Coreference for Linking Ontology Instances

被引:13
|
作者
Song, Dezhao [1 ]
Heflin, Jeff [1 ]
机构
[1] Lehigh Univ, Dept Comp Sci & Engn, 19 Mem Dr West, Bethlehem, PA 18015 USA
来源
关键词
Algorithms; Experimentation; Theory; Entity coreference; semantic web; ontology; domain-independence; discriminability;
D O I
10.1145/2435221.2435223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of entity coreference is to determine if different mentions (e.g., person names, place names, database records, ontology instances, etc.) refer to the same real word object. Entity coreference algorithms can be used to detect duplicate database records and to determine if two Semantic Web instances represent the same underlying real word entity. The key issues in developing an entity coreference algorithm include how to locate context information and how to utilize the context appropriately. In this article, we present a novel entity coreference algorithm for ontology instances. For scalability reasons, we select a neighborhood of each instance from an RDF graph. To determine the similarity between two instances, our algorithm computes the similarity between comparable property values in the neighborhood graphs. The similarity of distinct URIs and blank nodes is computed by comparing their outgoing links. In an attempt to reduce the impact of distant nodes on the final similarity measure, we explore a distance-based discounting approach. To provide the best possible domain-independent matches, we propose an approach to compute the discriminability of triples in order to assign weights to the context information. We evaluated our algorithm using different instance categories from five datasets. Our experiments show that the best results are achieved by including both our discounting and triple discrimination approaches.
引用
收藏
页数:29
相关论文
共 50 条
  • [11] Domain-independent Design Theory
    Korn, J.
    Journal of Engineering Design, 7 (03):
  • [12] DOMAIN-INDEPENDENT FORMULAS AND DATABASES
    TOPOR, RW
    THEORETICAL COMPUTER SCIENCE, 1987, 52 (03) : 281 - 306
  • [13] Towards Consistent Document-level Entity Linking: Joint Models for Entity Linking and Coreference Resolution
    Zaporojets, Klim
    Deleu, Johannes
    Jiang, Yiwei
    Demeester, Thomas
    Develder, Chris
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 778 - 784
  • [14] Linking Heterogeneous Data in the Semantic Web Using Scalable and Domain-Independent Candidate Selection
    Song, Dezhao
    Luo, Yi
    Heflin, Jeff
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 143 - 156
  • [15] Semantic Labeling: A Domain-Independent Approach
    Minh Pham
    Alse, Suresh
    Knoblock, Craig A.
    Szekely, Pedro
    SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 446 - 462
  • [16] A tool for domain-independent model mutation
    Gomez-Abajo, Pablo
    Guerra, Esther
    de Lara, Juan
    Merayo, Mercedes G.
    SCIENCE OF COMPUTER PROGRAMMING, 2018, 163 : 85 - 92
  • [17] A Domain-Independent Algorithm for Plan Adaptation
    Hanks, Steve
    Weld, Daniel S.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 : 319 - 360
  • [18] Examining the canvas as a domain-independent artifact
    Antunes, Pedro
    Tate, Mary
    INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2022, 20 (03) : 495 - 514
  • [19] Examining the canvas as a domain-independent artifact
    Pedro Antunes
    Mary Tate
    Information Systems and e-Business Management, 2022, 20 : 495 - 514
  • [20] On the predictability of domain-independent temporal planners
    Cenamor, Isabel
    Vallati, Mauro
    Chrpa, Lukas
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (04) : 745 - 773