Exploiting Linked Data for Open and Configurable Named Entity Extraction

被引:8
|
作者
Fafalios, Pavlos [1 ]
Baritakis, Manolis
Tzitzikas, Yannis
机构
[1] Fdn Res & Technol Hellas, Inst Comp Sci, Iraklion, Greece
关键词
Named entity extraction; named entity recognition; semantic annotation; linked data; entity mining; entity linking; WEB;
D O I
10.1142/S0218213015400126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Extraction (NEE) is the process of identifying entities in texts and, very commonly, linking them to related (Web) resources. This task is useful in several applications, e.g. for question answering, annotating documents, post-processing of search results, etc. However, existing NEE tools lack an open or easy configuration although this is very important for building domain-specific applications. For example, supporting a new category of entities, or specifying how to link the detected entities with online resources, is either impossible or very laborious. In this paper, we show how we can exploit semantic information (Linked Data) at real-time for configuring (handily) a NEE system and we propose a generic model for configuring such services. To explicitly define the semantics of the proposed model, we introduce an RDF/S vocabulary, called "Open NEE Configuration Model", which allows a NEE service to describe (and publish as Linked Data) its entity mining capabilities, but also to be dynamically configured. To allow relating the output of a NEE process with an applied configuration, we propose an extension of the Open Annotation Data Model which also enables an application to run advanced queries over the annotated data. As a proof of concept, we present X-Link, a fully-configurable NEE framework that realizes this approach. Contrary to the existing tools, X-Link allows the user to easily define the categories of entities that are interesting for the application at hand by exploiting one or more semantic Knowledge Bases. The user is also able to update a category and specify how to semantically link and enrich the identified entities. This enhanced configurability allows X-Link to be easily configured for different contexts for building domain-specific applications. To test the approach, we conducted a task-based evaluation with users that demonstrates its usability, and a case study that demonstrates its feasibility.
引用
收藏
页数:42
相关论文
共 50 条
  • [1] Exploiting Linked Open Data to Uncover Entity Types
    Gao, Jie
    Mazumdar, Suvodeep
    SEMANTIC WEB EVALUATION CHALLENGES, 2015, 548 : 51 - 62
  • [2] Arabic Named Entity Disambiguation Using Linked Open Data
    Al-Qawasmeh, Omar
    AL-Smadi, Mohammad
    Fraihat, Nisreen
    2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2016, : 333 - 338
  • [3] Enhancing the Open-Domain Classification of Named Entity Using Linked Open Data
    Ni, Yuan
    Zhang, Lei
    Qiu, Zhaoming
    Wang, Chen
    SEMANTIC WEB-ISWC 2010, PT I, 2010, 6496 : 566 - 581
  • [4] Configuring Named Entity Extraction through Real-Time Exploitation of Linked Data
    Fafalios, Pavlos
    Baritakis, Manolis
    Tzitzikas, Yannis
    4TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS, 2014,
  • [5] Chinese Data Extraction and Named Entity Recognition
    Yang, Tingwei
    Jiang, Daguang
    Shi, Shenghui
    Than, Siyan
    Zhuo, Lin
    Yin, Yukang
    Liang, Zheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 105 - 109
  • [6] Exploiting anonymous entity mentions for named entity linking
    Feng Hou
    Ruili Wang
    See-Kiong Ng
    Michael Witbrock
    Fangyi Zhu
    Xiaoyun Jia
    Knowledge and Information Systems, 2023, 65 : 1221 - 1242
  • [7] A Named Entity Extraction System for Historical Financial Data
    Swaileh, Wassim
    Paquet, Thierry
    Adam, Sebastien
    Camacho, Andres Rojas
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 324 - 340
  • [8] Exploiting anonymous entity mentions for named entity linking
    Hou, Feng
    Wang, Ruili
    Ng, See-Kiong
    Witbrock, Michael
    Zhu, Fangyi
    Jia, Xiaoyun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (03) : 1221 - 1242
  • [9] Improved named entity translation and bilingual named entity extraction
    Huang, F
    Vogel, S
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 253 - 258
  • [10] One Class per Named Entity: Exploiting Unlabeled Text for Named Entity Recognition
    Wong, Yingchuan
    Ng, Hwee Tou
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1763 - 1768