Query Performance Prediction for Neural IR: Are We There Yet?

被引:6
|
作者
Faggioli, Guglielmo [1 ]
Formal, Thibault [2 ,3 ]
Marchesin, Stefano [1 ]
Clinchant, Stephane [2 ]
Ferro, Nicola [1 ]
Piwowarski, Benjamin [3 ,4 ]
机构
[1] Univ Padua, Padua, Italy
[2] Naver Labs Europe, Meylan, France
[3] Sorbonne Univ, ISIR, Paris, France
[4] CNRS, Paris, France
基金
欧盟地平线“2020”;
关键词
DIVERGENCE;
D O I
10.1007/978-3-031-28244-7_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Evaluation in Information Retrieval (IR) relies on post-hoc empirical procedures, which are time-consuming and expensive operations. To alleviate this, Query Performance Prediction (QPP) models have been developed to estimate the performance of a system without the need for human-made relevance judgements. Such models, usually relying on lexical features from queries and corpora, have been applied to traditional sparse IR methods - with various degrees of success. With the advent of neural IR and large Pre-trained Language Models, the retrieval paradigm has significantly shifted towards more semantic signals. In this work, we study and analyze to what extent current QPP models can predict the performance of such systems. Our experiments consider seven traditional bag-of-words and seven BERT-based IR approaches, as well as nineteen state-of-the-art QPPs evaluated on two collections, Deep Learning '19 and Robust '04. Our findings show that QPPs perform statistically significantly worse on neural IR systems. In settings where semantic signals are prominent (e.g., passage retrieval), their performance on neural models drops by as much as 10% compared to bagof-words approaches. On top of that, in lexical-oriented scenarios, QPPs fail to predict performance for neural IR systems on those queries where they differ from traditional approaches the most.
引用
收藏
页码:232 / 248
页数:17
相关论文
共 50 条
  • [41] Query Variation Performance Prediction for Systematic Reviews
    Scells, Harrisen
    Azzopardi, Leif
    Zuccon, Guido
    Koopman, Bevan
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1089 - 1092
  • [42] Query Performance Prediction Using Reference Lists
    Shtok, Anna
    Kurland, Oren
    Carmel, David
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2016, 34 (04)
  • [43] Information Needs, Queries, and Query Performance Prediction
    Zendel, Oleg
    Shtok, Anna
    Rabier, Fiana
    Kurland, Oren
    Culpepper, J. Shane
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 395 - 404
  • [44] Query Performance Prediction using Passage Information
    Roitman, Haggai
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 893 - 896
  • [45] Keyphrase extraction through query performance prediction
    Ercan, Gonenc
    Cicekli, Ilyas
    JOURNAL OF INFORMATION SCIENCE, 2012, 38 (05) : 476 - 488
  • [46] Explainable Just-In-Time Bug Prediction: Are We There Yet?
    Aleithan, Reem
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 129 - 131
  • [47] Epigenomics and Transcriptomics in the Prediction and Diagnosis of Childhood Asthma: Are We There Yet?
    Forno, Erick
    Celedon, Juan C.
    FRONTIERS IN PEDIATRICS, 2019, 7
  • [48] Diagnosing Appendicitis on the Basis of Clinical Prediction Rules: Are We There Yet?
    Bolia, Rishi
    INDIAN JOURNAL OF PEDIATRICS, 2023, 90 (12): : 1173 - 1174
  • [49] An Analysis of Variations in the Effectiveness of Query Performance Prediction
    Ganguly, Debasis
    Datta, Suchana
    Mitra, Mandar
    Greene, Derek
    ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 215 - 229
  • [50] Genetics and Breast Cancer Risk Prediction-Are We There Yet?
    Cook, Nancy R.
    Paynter, Nina P.
    JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2010, 102 (21): : 1605 - 1606