A semantic approach to post-retrieval query performance prediction

被引:9
|
作者
Jafarzadeh, Parastoo [1 ]
Ensan, Faezeh [1 ]
机构
[1] Ryerson Univ, Toronto, ON, Canada
关键词
Query performance prediction; Semantic-enabled prediction; Post-retrieval prediction; Semantic information retrieval; SIMILARITY; DOCUMENTS; MODELS; WEB;
D O I
10.1016/j.ipm.2021.102746
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The importance of query performance prediction has been widely acknowledged in the literature, especially for query expansion, refinement, and interpolating different retrieval approaches. This paper proposes a novel semantics-based query performance prediction approach based on estimating semantic similarities between queries and documents. We introduce three post-retrieval predictors, namely (1) semantic distinction, (2) semantic query drift, and (3) semantic cohesion based on (1) the semantic similarity of a query to the top-ranked documents compared to the whole collection, (2) the estimation of non-query related aspects of the retrieved documents using semantic measures, and (3) the semantic cohesion of the retrieved documents. We assume that queries and documents are modeled as sets of entities from a knowledge graph, e.g., DBPedia concepts, instead of bags of words. With this assumption, semantic similarities between two texts are measured based on the relatedness between entities, which are learned from the contextual information represented in the knowledge graph. We empirically illustrate these predictors' effectiveness, especially when term-based measures fail to quantify query performance prediction hypotheses correctly. We report our findings on the proposed predictors' performance and their interpolation on three standard collections, namely ClueWeb09-B, ClueWeb12-B, and Robust04. We show that the proposed predictors are effective across different datasets in terms of Pearson and Kendall correlation coefficients between the predicted performance and the average precision measured by relevance judgments.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Post-retrieval extinction as reconsolidation interference: methodological issues or boundary conditions?
    Auber, Alessia
    Tedesco, Vincenzo
    Jones, Carolyn E.
    Monfils, Marie-H.
    Chiamulera, Christian
    PSYCHOPHARMACOLOGY, 2013, 226 (04) : 631 - 647
  • [42] An Abrupt Transformation of Phobic Behavior After a Post-Retrieval Amnesic Agent
    Soeter, Marieke
    Kindt, Merel
    BIOLOGICAL PSYCHIATRY, 2015, 78 (12) : 880 - 886
  • [43] A Machine Learning Approach to SPARQL Query Performance Prediction
    Hasan, Rakebul
    Gandon, Fabien
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2014, : 266 - 273
  • [44] A Neural Networks Approach to SPARQL Query Performance Prediction
    Amat, Daniel Arturo Casal
    Buil-Aranda, Carlos
    Valle-Vidal, Carlos
    2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,
  • [45] A contrastive neural disentanglement approach for query performance prediction
    Salamat, Sara
    Arabzadeh, Negar
    Seyedsalehi, Shirin
    Bigdeli, Amin
    Zihayat, Morteza
    Bagheri, Ebrahim
    MACHINE LEARNING, 2025, 114 (04)
  • [47] EFFICIENT QUERY KEYWORD INTERPRETATION FOR SEMANTIC INFORMATION RETRIEVAL
    Setia, Sonia
    Verma, Jyoti
    Duhan, Neelam
    IIOAB JOURNAL, 2020, 11 (02) : 64 - 68
  • [48] Semantic thesaurus for automatic expanded query in information retrieval
    Gonzalez, M
    de Lima, VLS
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 68 - 75
  • [49] SQORE: A framework for semantic query based ontology retrieval
    Anutariya, Chutiporn
    Ungrangsi, Rachanee
    Wuwongse, Vilas
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 924 - +
  • [50] Semantic thesaurus for automatic expanded query in information retrieval
    Gonzalez, Marco
    De Lima, Vera Lúcia Strube
    Proceedings - 8th Symposium on String Processing and Information Retrieval, SPIRE 2001, 2001, : 68 - 75