A Purely Entity-Based Semantic Search Approach for Document Retrieval

被引:2
|
作者
Sidi, Mohamed Lemine [1 ]
Gunal, Serkan [1 ]
机构
[1] Eskisehir Tech Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkiye
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 18期
关键词
information retrieval; document retrieval; knowledge graphs; entity-based search; entity linking; WORD;
D O I
10.3390/app131810285
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Over the past decade, knowledge bases (KB) have been increasingly utilized to complete and enrich the representation of queries and documents in order to improve the document retrieval task. Although many approaches have used KB for such purposes, the problem of how to effectively leverage entity-based representation still needs to be resolved. This paper proposes a Purely Entity-based Semantic Search Approach for Information Retrieval (PESS4IR) as a novel solution. The approach includes (i) its own entity linking method and (ii) an inverted indexing method, and for document retrieval and ranking, (iii) an appropriate ranking method is designed to take advantage of all the strengths of the approach. We report the findings on the performance of our approach, which is tested by queries annotated by two known entity linking tools, REL and DBpedia-Spotlight. The experiments are performed on the standard TREC 2004 Robust and MSMARCO collections. By using the REL method on the Robust collection, for the queries whose terms are all annotated and whose average annotation scores are greater than or equal to 0.75, our approach achieves the maximum nDCG@5 score (1.00). Also, it is shown that using PESS4IR alongside another document retrieval method would improve performance, unless that method alone achieves the maximum nDCG@5 score for those highly annotated queries.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Semantic Entity Search Diversification
    Ruotsalo, Tuukka
    Frosterus, Matias
    2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 32 - 39
  • [42] Semantic based entity retrieval and disambiguation system for Twitter streams
    Kumar, Narayanasamy Senthil
    Dinakaran, Muruganantham
    KNOWLEDGE MANAGEMENT & E-LEARNING-AN INTERNATIONAL JOURNAL, 2019, 11 (02) : 262 - 280
  • [43] Entity-based noun phrase coreference resolution
    Yang, XF
    Su, J
    Yang, LP
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 218 - 221
  • [44] Entity-based Neural Local Coherence Modeling
    Jeon, Sungho
    Strube, Michael
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7787 - 7805
  • [45] Controlled entity-based access control technique
    Yang, Ximin
    Xie, Changsheng
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2007, 35 (08): : 56 - 59
  • [46] Bringing Precision to Office Document Search by Semantic Relationship Approach
    Chatvichienchai, Somchai
    Tanaka, Katsumi
    ADVANCES IN INFORMATION TECHNOLOGY, PROCEEDINGS, 2009, 55 : 48 - +
  • [47] An Approach of Entity Alignment Based on Semantic Features
    Wan, Jing
    Li, Lin
    Wang, Shaohua
    Wang, Xiaofang
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 170 - 174
  • [48] Legal Information Retrieval System with Entity-Based Query Expansion: Case study in Traffic Accident Litigation
    Catacora, Joel
    Casali, Ana
    Deco, Claudia
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2022, 22 (02): : 151 - 163
  • [49] ANNE: Improving Source Code Search using Entity Retrieval Approach
    Vinayakarao, Venkatesh
    Sarma, Anita
    Purandare, Rahul
    Jain, Shuktika
    Jain, Saumya
    WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 211 - 220
  • [50] Encapsulation and entity-based approach of interconnection between sensor platform and middleware of pervasive computing
    Lim, Shinyoung
    Helal, Abdelsalam
    UBIQUITOUS COMPUTING SYSTEMS, PROCEEDINGS, 2006, 4239 : 500 - 515