An information-theoretic approach to automatic query expansion

被引:195
|
作者
Carpineto, C
De Mori, R
Romano, G
Bigi, B
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
[2] Univ Avignon, Lab Informat, F-84911 Avignon 9, France
关键词
information retrieval; automatic query expansion; pseudorelevance feedback; information theory;
D O I
10.1145/366836.366860
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion from top retrieved documents have shown promise for improving retrieval effectiveness on large collections; however, they often rely on an empirical ground, and there is a shortage of cross-system comparisons. Using ideas from Information Theory, we present a computationally simple and theoretically justified method for assigning scores to candidate expansion terms. Such scores are used to select and weight expansion terms within Rocchio's framework for query reweighting. We compare ranking with information-theoretic query expansion versus ranking with other query expansion techniques, showing that the former achieves better retrieval effectiveness on several performance measures. We also discuss the effect on retrieval effectiveness of the main parameters involved in automatic query expansion, such as data sparseness, query difficulty, number of selected documents, and number of selected terms, pointing out interesting relationships.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [21] An information-theoretic approach for the quantification of relevance
    Polani, Daniel
    Martinetz, Thomas
    Kim, Jan
    ADVANCES IN ARTIFICIAL LIFE, 2001, 2159 : 704 - 713
  • [22] An Information-Theoretic Approach to Analyzing CLEAN
    Bose, Ranjan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2014, 50 (03) : 1673 - 1679
  • [23] A geometric approach to information-theoretic private information retrieval
    Woodruff, D
    Yekhanin, S
    TWENTIETH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2005, : 275 - 284
  • [24] An information-theoretic approach to statistical dependence: Copula information
    Calsaverini, R. S.
    Vicente, R.
    EPL, 2009, 88 (06)
  • [25] A geometric approach to information-theoretic private information retrieval
    Woodruff, David
    Yekhanin, Sergey
    SIAM JOURNAL ON COMPUTING, 2007, 37 (04) : 1046 - 1056
  • [26] The confined helium atom: An information-theoretic approach
    Estanon, C. R.
    Montgomery Jr, H. E.
    Angulo, J. C.
    Aquino, N.
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2024, 124 (04)
  • [27] An Information-Theoretic Approach for Clonal Selection Algorithms
    Cutello, Vincenzo
    Nicosia, Giuseppe
    Pavone, Mario
    Stracquadanio, Giovanni
    ARTIFICIAL IMMUNE SYSTEMS, 2010, 6209 : 144 - 157
  • [28] Information-theoretic approach to image description and interpretation
    Potapov, AS
    Lutsiv, VR
    SEVENTH INTERNATIONAL WORKSHOP ON NONDESTRUCTIVE TESTING AND COMPUTER SIMULATIONS IN SCIENCE AND ENGINEERING, 2004, 5400 : 277 - 283
  • [29] TRADITIONAL AND NONTRADITIONAL BANKING - AN INFORMATION-THEORETIC APPROACH
    MESTER, LJ
    JOURNAL OF BANKING & FINANCE, 1992, 16 (03) : 545 - 566
  • [30] An Information-theoretic approach for computational material modeling
    Furukawa, Tomonari
    Michopoulos, John G.
    ADVANCES IN FRACTURE AND MATERIALS BEHAVIOR, PTS 1 AND 2, 2008, 33-37 : 857 - +