Named Entity Based Ranking with Term Proximity for XML Retrieval

被引:1
|
作者
Roko, Abubakar [1 ]
Doraisamy, Shyamala [2 ]
Azman, Azreen [1 ]
Jantan, Azrul Hazri [2 ]
机构
[1] Univ Putra Malaysia, Serdang, Malaysia
[2] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Multimedia, Serdang, Malaysia
关键词
BM25F; Keyword Query; Named Entity Category; Ranking Function;
D O I
10.4018/IJIRR.2018040104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, an indexing scheme that includes the named entity category for each indexed term is proposed. Based on this, two methods are proposed, one to infer the semantics of an XML element based on its data content, called the confidence value of the element, and the second method computes the proximity scores of the query terms. The confidence value of an element is obtained based on the probability of a named entity category in the data content of the underlying XML element. The proximity score of the query terms measures the proximity and ordering of the query term within an XML element. The article then shows how a ranking function uses the confidence value of an XML element and proximity score to mitigate the impact of higher frequency terms and compute the relevance between a keyword query and an XML fragment. Finally, a keyword search system is introduced and experiments show that the proposed system outperforms existing approaches in terms of search quality and achieve a higher efficiency.
引用
收藏
页码:57 / 77
页数:21
相关论文
共 50 条
  • [11] NAME - A Rich XML Format for Named Entity and Relation Tagging
    Clausner, Christian
    Pletschacher, Stefan
    Antonacopoulos, Apostolos
    PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, : 91 - 96
  • [12] Structured Named Entity Retrieval in Audio Broadcast News
    Zidouni, Azeddine
    Quafafou, Mohamed
    Glotin, Herve
    CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 126 - +
  • [13] ComRank: Metasearch and automatic ranking of XML retrieval system
    Woodley, A
    Geva, S
    2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 147 - 154
  • [14] Component ranking and automatic query refinement for XML retrieval
    Mass, Y
    Mandelbrod, M
    ADVANCES IN XML INFORMATION RETRIEVAL, 2005, 3493 : 73 - 84
  • [15] Re-ranking for Joint Named-Entity Recognition and Linking
    Sil, Avirup
    Yates, Alexander
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2369 - 2374
  • [16] Discovering Significant Persons, Locations and Organizations through Named Entity Ranking
    Su, Xing
    Mo, Songhai
    Wang, Hui
    Zhang, Xin
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 328 - 331
  • [17] Ranking algorithms for named-entity extraction: Boosting and the voted perceptron
    Collins, M
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 489 - 496
  • [18] Named Entity Recognition for Improving Retrieval and Translation of Chinese Documents
    Srihari, Rohini K.
    Peterson, Erik
    DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 404 - +
  • [19] The impact of named entity normalization on information retrieval for question answering
    Khalid, Mahboob Alam
    Jijkoun, Valentin
    de Rijke, Maarten
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 705 - 710
  • [20] Named Entity Recognition Approach for Malay Crime News Retrieval
    Saad, Saidah
    Mansor, Mohamed Kamil
    GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (04): : 216 - 235