Query Performance Prediction for Neural IR: Are We There Yet?

被引:6
|
作者
Faggioli, Guglielmo [1 ]
Formal, Thibault [2 ,3 ]
Marchesin, Stefano [1 ]
Clinchant, Stephane [2 ]
Ferro, Nicola [1 ]
Piwowarski, Benjamin [3 ,4 ]
机构
[1] Univ Padua, Padua, Italy
[2] Naver Labs Europe, Meylan, France
[3] Sorbonne Univ, ISIR, Paris, France
[4] CNRS, Paris, France
基金
欧盟地平线“2020”;
关键词
DIVERGENCE;
D O I
10.1007/978-3-031-28244-7_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Evaluation in Information Retrieval (IR) relies on post-hoc empirical procedures, which are time-consuming and expensive operations. To alleviate this, Query Performance Prediction (QPP) models have been developed to estimate the performance of a system without the need for human-made relevance judgements. Such models, usually relying on lexical features from queries and corpora, have been applied to traditional sparse IR methods - with various degrees of success. With the advent of neural IR and large Pre-trained Language Models, the retrieval paradigm has significantly shifted towards more semantic signals. In this work, we study and analyze to what extent current QPP models can predict the performance of such systems. Our experiments consider seven traditional bag-of-words and seven BERT-based IR approaches, as well as nineteen state-of-the-art QPPs evaluated on two collections, Deep Learning '19 and Robust '04. Our findings show that QPPs perform statistically significantly worse on neural IR systems. In settings where semantic signals are prominent (e.g., passage retrieval), their performance on neural models drops by as much as 10% compared to bagof-words approaches. On top of that, in lexical-oriented scenarios, QPPs fail to predict performance for neural IR systems on those queries where they differ from traditional approaches the most.
引用
收藏
页码:232 / 248
页数:17
相关论文
共 50 条
  • [1] SIGIR 2012 Tutorial: Query Performance Prediction for IR
    Carmel, David
    Kurland, Oren
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1196 - 1197
  • [2] A Neural Networks Approach to SPARQL Query Performance Prediction
    Amat, Daniel Arturo Casal
    Buil-Aranda, Carlos
    Valle-Vidal, Carlos
    2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,
  • [3] A contrastive neural disentanglement approach for query performance prediction
    Salamat, Sara
    Arabzadeh, Negar
    Seyedsalehi, Shirin
    Bigdeli, Amin
    Zihayat, Morteza
    Bagheri, Ebrahim
    MACHINE LEARNING, 2025, 114 (04)
  • [4] Risk Prediction Are We There Yet?
    Jensen, Jesper K.
    CIRCULATION, 2016, 134 (19) : 1441 - 1443
  • [5] Query performance prediction
    He, Ben
    Ounis, Iadh
    INFORMATION SYSTEMS, 2006, 31 (07) : 585 - 594
  • [6] Towards Query Performance Prediction for Neural Information Retrieval: Challenges and Opportunities
    Faggioli, Guglielmo
    Formal, Thibault
    Lupart, Simon
    Marchesin, Stefano
    Clinchant, Stephane
    Ferro, Nicola
    Piwowarski, Benjamin
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 51 - 63
  • [7] Unsupervised Query Performance Prediction for Neural Models with Pairwise Rank Preferences
    Singh, Ashutosh
    Ganguly, Debasis
    Datta, Suchana
    Macdonald, Craig
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2486 - 2490
  • [8] Dementia Risk Prediction: Are We There Yet?
    Kamat, Sanjeev M.
    Kamat, Anjali S.
    Grossberg, George T.
    CLINICS IN GERIATRIC MEDICINE, 2010, 26 (01) : 113 - +
  • [9] Crystal structure prediction: are we there yet?
    Cruz-Cabeza, Aurora J.
    ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE CRYSTAL ENGINEERING AND MATERIALS, 2016, 72 : 437 - 438
  • [10] Genetic Risk Prediction - Are We There Yet?
    Kraft, Peter
    Hunter, David J.
    NEW ENGLAND JOURNAL OF MEDICINE, 2009, 360 (17): : 1701 - 1703