Named Entity Based Ranking with Term Proximity for XML Retrieval

被引:1
|
作者
Roko, Abubakar [1 ]
Doraisamy, Shyamala [2 ]
Azman, Azreen [1 ]
Jantan, Azrul Hazri [2 ]
机构
[1] Univ Putra Malaysia, Serdang, Malaysia
[2] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Multimedia, Serdang, Malaysia
关键词
BM25F; Keyword Query; Named Entity Category; Ranking Function;
D O I
10.4018/IJIRR.2018040104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, an indexing scheme that includes the named entity category for each indexed term is proposed. Based on this, two methods are proposed, one to infer the semantics of an XML element based on its data content, called the confidence value of the element, and the second method computes the proximity scores of the query terms. The confidence value of an element is obtained based on the probability of a named entity category in the data content of the underlying XML element. The proximity score of the query terms measures the proximity and ordering of the query term within an XML element. The article then shows how a ranking function uses the confidence value of an XML element and proximity score to mitigate the impact of higher frequency terms and compute the relevance between a keyword query and an XML fragment. Finally, a keyword search system is introduced and experiments show that the proposed system outperforms existing approaches in terms of search quality and achieve a higher efficiency.
引用
收藏
页码:57 / 77
页数:21
相关论文
共 50 条
  • [1] Term Relevance Feedback for Contextual Named Entity Retrieval
    Sarwar, Sheikh Muhammad
    Foley, John
    Allan, James
    CHIIR'18: PROCEEDINGS OF THE 2018 CONFERENCE ON HUMAN INFORMATION INTERACTION & RETRIEVAL, 2018, : 301 - 304
  • [2] Graph Ranking for Collective Named Entity Disambiguation
    Alhelbawy, Ayman
    Gaizauskas, Rob
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 75 - 80
  • [3] Named Entity Based Document Similarity with SVM-Based Re-ranking for Entity Linking
    Alhelbawy, Ayman
    Gaizauskas, Rob
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 379 - 388
  • [4] NESM: a Named Entity based Proximity Measure for Multilingual News Clustering
    Montalvo, Soto
    Fresno, Victor
    Martinez, Raquel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (48): : 81 - 88
  • [5] Configurable ranking for XML vague retrieval
    School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China
    不详
    J. Comput. Inf. Syst., 2009, 2 (683-692):
  • [6] Unsupervised Ranking of Knowledge Bases for Named Entity Recognition
    Mrabet, Yassine
    Kilicoglu, Halil
    Demner-Fushman, Dina
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1248 - 1255
  • [7] A ranking scheme for XML information retrieval based on benefit and reading effort
    Shimizu, Toshiyuki
    Yoshikawa, Masatoshi
    ASIAN DIGITAL LIBRARIES: LOOKING BACK 10 YEARS AND FORGING NEW FRONTIERS, PROCEEDINGS, 2007, 4822 : 230 - 240
  • [8] Ranking and Fusion Approaches for XML Book Retrieval
    Larson, Ray R.
    FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 179 - 189
  • [9] A framework for XML web services retrieval with ranking
    Lee, Kyong-Ha
    Lee, Mi-young
    Hwang, Yun-Young
    Lee, Kyu-Chul
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 773 - +
  • [10] A neural-based re-ranking model for Chinese named entity recognition
    Guo J.
    Han Y.
    Ke Y.
    International Journal of Reasoning-based Intelligent Systems, 2019, 11 (03): : 265 - 272