An information-theoretic approach to automatic query expansion

被引:195
|
作者
Carpineto, C
De Mori, R
Romano, G
Bigi, B
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
[2] Univ Avignon, Lab Informat, F-84911 Avignon 9, France
关键词
information retrieval; automatic query expansion; pseudorelevance feedback; information theory;
D O I
10.1145/366836.366860
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion from top retrieved documents have shown promise for improving retrieval effectiveness on large collections; however, they often rely on an empirical ground, and there is a shortage of cross-system comparisons. Using ideas from Information Theory, we present a computationally simple and theoretically justified method for assigning scores to candidate expansion terms. Such scores are used to select and weight expansion terms within Rocchio's framework for query reweighting. We compare ranking with information-theoretic query expansion versus ranking with other query expansion techniques, showing that the former achieves better retrieval effectiveness on several performance measures. We also discuss the effect on retrieval effectiveness of the main parameters involved in automatic query expansion, such as data sparseness, query difficulty, number of selected documents, and number of selected terms, pointing out interesting relationships.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [1] Information-Theoretic Approach to the Problem of Automatic Image Recognition
    Savchenko A.V.
    Journal of Mathematical Sciences, 2022, 267 (1) : 99 - 107
  • [2] Relevance feedback using weight propagation compared with information-theoretic query expansion
    Yamout, Fadi
    Oakes, Michael
    Tait, John
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 258 - +
  • [3] An Information-Theoretic Privacy Criterion for Query Forgery in Information Retrieval
    Rebollo-Monedero, David
    Parra-Arnau, Javier
    Forne, Jordi
    SECURITY TECHNOLOGY, 2011, 259 : 146 - 154
  • [4] An Information-theoretic Approach to Distribution Shifts
    Federici, Marco
    Tomioka, Ryota
    Forre, Patrick
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] An information-theoretic approach to steganography and watermarking
    Mittelholzer, T
    INFORMATION HIDING, PROCEEDINGS, 2000, 1768 : 1 - 16
  • [6] Information-theoretic approach to steganographic systems
    Ryabko, Boris
    Ryabko, Daniil
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-7, 2007, : 2461 - +
  • [7] An information-theoretic approach to band selection
    Ahlberg, J
    Renhorn, I
    Targets and Backgrounds XI: Characterization and Representation, 2005, 5811 : 15 - 23
  • [8] Information-Theoretic Approach to Bidirectional Scaling
    Boso, Francesca
    Tartakovsky, Daniel M.
    WATER RESOURCES RESEARCH, 2018, 54 (07) : 4916 - 4928
  • [9] An Information-theoretic Approach to Hardness Amplification
    Maurer, Ueli
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 948 - 952
  • [10] An information-theoretic approach to interactions in images
    Boccignone, G
    Ferraro, M
    SPATIAL VISION, 1999, 12 (03): : 345 - 362