Query Variability and Experimental Consistency: A Concerning Case Study

被引:0
|
作者
Rashidi, Lida [1 ,2 ]
Zobel, Justin [1 ]
Moffat, Alistair [1 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
[2] RMIT Univ, Melbourne, Vic, Australia
来源
PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024 | 2024年
基金
澳大利亚研究理事会;
关键词
Evaluation; significance testing;
D O I
10.1145/3664190.3672519
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In offline experimentation, the effectiveness of a search engine is evaluated using a document collection, a set of queries against that collection, a set of relevance judgments connecting the documents and the queries, and an effectiveness metric. This measurement pipeline is used as a surrogate for user satisfaction - the extent to which the system provides useful information to the users that are issuing the queries. But queries are responses to information needs, or topics, and there can be a wide variety of ways in which any given information need can be expressed as a query. That one-to-many relationship suggests that, in an IR experiment, use of any single query to represent a topic may be insufficient. In this case study, we demonstrate that this practice is indeed a weakness, by showing that the TREC 2013 and 2014 Web track queries, which are regarded as being indicative of specific information needs, are not necessarily representative of crowd-generated queries for the same underlying needs, and can give rise to inconsistent system relativities when compared to user-generated queries. From this instance we must thus note an element of concern: that current test collection design strategies can lead to effectiveness results that are at odds with those experienced by typical non-expert users.
引用
收藏
页码:35 / 41
页数:7
相关论文
共 50 条
  • [31] A case study of the consistency problem in the inverse estimation
    WEI Yanzhou
    KANG Xianbiao
    PEI Yuhua
    ActaOceanologicaSinica, 2017, 36 (09) : 45 - 51
  • [32] A case study of the consistency problem in the inverse estimation
    Yanzhou Wei
    Xianbiao Kang
    Yuhua Pei
    Acta Oceanologica Sinica, 2017, 36 : 45 - 51
  • [33] A case study of the consistency problem in the inverse estimation
    Wei Yanzhou
    Kang Xianbiao
    Pei Yuhua
    ACTA OCEANOLOGICA SINICA, 2017, 36 (09) : 45 - 51
  • [34] Using a query language to state consistency constraints for repositories
    Henrich, A
    Daberitz, D
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 1996, 1134 : 59 - 68
  • [35] Query Representation with Global Consistency on User Click Graphs
    Zhang, Daqiang
    Zhu, Rongbo
    Men, Shuaiqiu
    Raychoudhury, Vaskar
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (05): : 759 - 769
  • [36] Relaxed Marginal Consistency for Differentially Private Query Answering
    McKenna, Ryan
    Pradhan, Siddhant
    Sheldon, Daniel
    Miklau, Gerome
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [37] Approximate schemas, source-consistency and query answering
    de Rougemont, Michel
    Vieilleribiere, Adrien
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2008, 31 (02) : 127 - 146
  • [38] Semantics, consistency, and query processing of empirical deductive databases
    Ng, RT
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1997, 9 (01) : 32 - 49
  • [39] Approximate schemas, source-consistency and query answering
    Michel de Rougemont
    Adrien Vieilleribière
    Journal of Intelligent Information Systems, 2008, 31 : 127 - 146
  • [40] An experimental study of variability in ocular latency
    Hackman, RB
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1940, 27 (05): : 546 - 558