An information-theoretic approach to automatic query expansion

被引:195
|
作者
Carpineto, C
De Mori, R
Romano, G
Bigi, B
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
[2] Univ Avignon, Lab Informat, F-84911 Avignon 9, France
关键词
information retrieval; automatic query expansion; pseudorelevance feedback; information theory;
D O I
10.1145/366836.366860
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion from top retrieved documents have shown promise for improving retrieval effectiveness on large collections; however, they often rely on an empirical ground, and there is a shortage of cross-system comparisons. Using ideas from Information Theory, we present a computationally simple and theoretically justified method for assigning scores to candidate expansion terms. Such scores are used to select and weight expansion terms within Rocchio's framework for query reweighting. We compare ranking with information-theoretic query expansion versus ranking with other query expansion techniques, showing that the former achieves better retrieval effectiveness on several performance measures. We also discuss the effect on retrieval effectiveness of the main parameters involved in automatic query expansion, such as data sparseness, query difficulty, number of selected documents, and number of selected terms, pointing out interesting relationships.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [31] Information-theoretic approach to atomic spin nonclassicality
    Dai, Hao
    Luo, Shunlong
    PHYSICAL REVIEW A, 2019, 100 (06)
  • [32] Information-theoretic approach to quantifying currency risk
    Fiedor, Pawel
    Holda, Artur
    JOURNAL OF RISK FINANCE, 2016, 17 (01) : 93 - 109
  • [33] An information-theoretic approach to microseismic source location
    Prange, Michael D.
    Bose, Sandip
    Kodio, Ousmane
    Djikpesse, Hugues A.
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2015, 201 (01) : 193 - 206
  • [34] An Information-Theoretic Approach to Joint Sensing and Communication
    Ahmadipour, Mehrasa
    Kobayashi, Mari
    Wigger, Michele
    Caire, Giuseppe
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (02) : 1124 - 1146
  • [35] OBJECTIONS TO AN INFORMATION-THEORETIC APPROACH TO SYNCHRONICITY - REPLY
    BRAUDE, SE
    JOURNAL OF THE AMERICAN SOCIETY FOR PSYCHICAL RESEARCH, 1979, 73 (03): : 325 - 330
  • [36] An information-theoretic approach to combining object models
    Kruppa, H
    Schiele, B
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2002, 39 (3-4) : 195 - 203
  • [37] Information-theoretic approach to Fourier transform spectrometry
    Barducci, Alessandro
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA B-OPTICAL PHYSICS, 2011, 28 (04) : 637 - 648
  • [38] An information-theoretic approach to stochastic materials modeling
    Zabaras, Nicholas
    Sankaran, Sethuraman
    COMPUTING IN SCIENCE & ENGINEERING, 2007, 9 (02) : 30 - 39
  • [39] An information-theoretic approach for detecting communities in networks
    Yongli Li
    Chong Wu
    Zizheng Wang
    Quality & Quantity, 2015, 49 : 1719 - 1733
  • [40] Information-theoretic clustering: A representative and evolutionary approach
    Araujo, Daniel
    Doria Neto, Adriao
    Martins, Allan
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (10) : 4190 - 4205