Query Variability and Experimental Consistency: A Concerning Case Study

被引:0
|
作者
Rashidi, Lida [1 ,2 ]
Zobel, Justin [1 ]
Moffat, Alistair [1 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
[2] RMIT Univ, Melbourne, Vic, Australia
基金
澳大利亚研究理事会;
关键词
Evaluation; significance testing;
D O I
10.1145/3664190.3672519
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In offline experimentation, the effectiveness of a search engine is evaluated using a document collection, a set of queries against that collection, a set of relevance judgments connecting the documents and the queries, and an effectiveness metric. This measurement pipeline is used as a surrogate for user satisfaction - the extent to which the system provides useful information to the users that are issuing the queries. But queries are responses to information needs, or topics, and there can be a wide variety of ways in which any given information need can be expressed as a query. That one-to-many relationship suggests that, in an IR experiment, use of any single query to represent a topic may be insufficient. In this case study, we demonstrate that this practice is indeed a weakness, by showing that the TREC 2013 and 2014 Web track queries, which are regarded as being indicative of specific information needs, are not necessarily representative of crowd-generated queries for the same underlying needs, and can give rise to inconsistent system relativities when compared to user-generated queries. From this instance we must thus note an element of concern: that current test collection design strategies can lead to effectiveness results that are at odds with those experienced by typical non-expert users.
引用
收藏
页码:35 / 41
页数:7
相关论文
共 50 条
  • [1] REPLY TO QUERY CONCERNING SCIATIC-NERVE BLOCK STUDY
    BAILEY, SL
    LITTLE, WL
    REGIONAL ANESTHESIA, 1995, 20 (01) : 81 - 82
  • [2] The consistency of fairness rules: An experimental study
    Ubeda, Paloma
    JOURNAL OF ECONOMIC PSYCHOLOGY, 2014, 41 : 88 - 100
  • [3] Experimental Study on a Medium Consistency Pump
    Ma, X. D.
    Li, Z. F.
    Yu, H.
    Wu, D. Z.
    Wang, L. Q.
    JOURNAL OF FLUIDS ENGINEERING-TRANSACTIONS OF THE ASME, 2013, 135 (10):
  • [4] The consistency dimension, compactness and query learning
    Balcázar, JL
    COMPUTER SCIENCE LOGIC, PROCEEDINGS, 1999, 1683 : 2 - 13
  • [5] CONSISTENCY OF SPATIAL DATABASE QUERY RESULTS
    MAINGUENAUD, M
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 1994, 18 (05) : 333 - 342
  • [6] Experimental results concerning variability of several quantitative characters in hop
    Cernea, S
    Muntean, LS
    Salontai, A
    Morar, G
    Duda, M
    Vârban, D
    Muste, S
    Vârban, R
    Muntean, S
    Bulletin of the University of Agricultural Sciences and Veterinary Medicine, Vol 57, 2002, 57 : 107 - 114
  • [7] CONSISTENCY RESULTS CONCERNING SUPERCOMPACTNESS
    MENAS, TK
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1976, 223 (OCT) : 61 - 91
  • [8] INVESTIGATIONS CONCERNING THE CONSISTENCY OF YOGURT
    GALESLOOT, TE
    NEDERLANDS MELK-EN ZUIVELTIJDSCHRIFT, 1958, 12 (02): : 130 - 165
  • [9] Concerning Clairvoyance - A critical experimental Study
    Henning, Hans
    ZEITSCHRIFT FUR PSYCHOLOGIE UND PHYSIOLOGIE DER SINNESORGANE, 1918, 79 : 297 - 297
  • [10] QUERY CONCERNING 2 MISNOMERS AND MISCONCEPTIONS
    GRAY, W
    KUSANO, S
    SOCIAL SCIENCE, 1976, 51 (03): : 170 - 171