A Purely Entity-Based Semantic Search Approach for Document Retrieval

被引:2
|
作者
Sidi, Mohamed Lemine [1 ]
Gunal, Serkan [1 ]
机构
[1] Eskisehir Tech Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkiye
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 18期
关键词
information retrieval; document retrieval; knowledge graphs; entity-based search; entity linking; WORD;
D O I
10.3390/app131810285
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Over the past decade, knowledge bases (KB) have been increasingly utilized to complete and enrich the representation of queries and documents in order to improve the document retrieval task. Although many approaches have used KB for such purposes, the problem of how to effectively leverage entity-based representation still needs to be resolved. This paper proposes a Purely Entity-based Semantic Search Approach for Information Retrieval (PESS4IR) as a novel solution. The approach includes (i) its own entity linking method and (ii) an inverted indexing method, and for document retrieval and ranking, (iii) an appropriate ranking method is designed to take advantage of all the strengths of the approach. We report the findings on the performance of our approach, which is tested by queries annotated by two known entity linking tools, REL and DBpedia-Spotlight. The experiments are performed on the standard TREC 2004 Robust and MSMARCO collections. By using the REL method on the Robust collection, for the queries whose terms are all annotated and whose average annotation scores are greater than or equal to 0.75, our approach achieves the maximum nDCG@5 score (1.00). Also, it is shown that using PESS4IR alongside another document retrieval method would improve performance, unless that method alone achieves the maximum nDCG@5 score for those highly annotated queries.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Entity-Based Relevance Feedback for Document Retrieval
    Sheetrit, Eilon
    Raiber, Fiana
    Kurland, Oren
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 177 - 187
  • [2] Document Retrieval Using Entity-Based Language Models
    Raviv, Hadas
    Kurland, Oren
    Carmel, David
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 65 - 74
  • [3] Entity-Based Retrieval
    Raviv, Hadas
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1277 - 1277
  • [4] TweetSpector: Entity-based retrieval of Tweets
    Yerva, Surender Reddy
    Miklos, Zoltan
    Grosan, Flavia
    Tandrau, Alexandru
    Aberer, Karl
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1016 - 1016
  • [5] Entity-Based Language Model Smoothing Approach for Smart Search
    Zhao, Feng
    Tian, Zeliang
    Jin, Hai
    IEEE ACCESS, 2018, 6 : 9991 - 10002
  • [6] Fuzzy Named Entity-Based Document Clustering
    Cao, Tru H.
    Do, Hai T.
    Hong, Dung T.
    Quan, Tho T.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 2030 - 2036
  • [7] Entity Linking and Retrieval for Semantic Search
    Meij, Edgar
    Balog, Krisztian
    Odijk, Daan
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 683 - 683
  • [8] Entity-based keyword search in web documents
    Sartori E.
    Velegrakis Y.
    Guerra F.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 9630 : 21 - 49
  • [9] Document-level Entity-based Extraction as Template Generation
    Huang, Kung-Hsiang
    Tang, Sam
    Peng, Nanyun
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5257 - 5269
  • [10] Entity-Based Classification of Web Page in Search Engine
    Liu, Yicen
    Liu, Mingrong
    Xiang, Liang
    Yang, Qing
    Digital Libraries: Universal and Ubiquitous Access to Information, Proceedings, 2008, 5362 : 410 - 411