Towards more effective techniques for automatic query expansion

被引:0
|
作者
Carpineto, C [1 ]
Romano, G [1 ]
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion from top retrieved documents have recently shown promise for improving retrieval effectiveness on large collections but there is still a lack of systematic evaluation and comparative studies. In this paper we focus on term-scoring methods based on the differences between the distribution of terms in (pseudo-)relevant documents and the distribution of terms in all documents, seen as a complement or an alternative to more conventional techniques. We show that when such distributional methods are used to select expansion terms within Rocchio's classical reweighting scheme, the overall performance is not likely to improve. However, we also show that when the same distributional methods are used to both select and weight expansion terms the retrieval effectiveness may considerably improve. We then argue, based on their variation in performance on individual queries, that the set of ranked terms suggested by individual distributional methods can be combined to further improve mean performance, by analogy with ensembling classifiers, and present experimental evidence supporting this view. Taken together, our experiments show that with automatic query expansion it is possible to achieve performance gains as high as 21.34% over non-expanded query (for non-interpolated average precision). We also discuss the effect that the main parameters involved in automatic query expansion, such as query difficulty, number of selected documents, and number of selected terms, have on retrieval effectiveness.
引用
收藏
页码:126 / 141
页数:16
相关论文
共 50 条
  • [1] Towards the development of heuristics for automatic query expansion
    Vilares, J
    Vilares, M
    Alonso, MA
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 887 - 896
  • [2] Towards an effective automatic query expansion process using an association rule mining approach
    Chiraz Latiri
    Hatem Haddad
    Tarek Hamrouni
    Journal of Intelligent Information Systems, 2012, 39 : 209 - 247
  • [3] Towards an effective automatic query expansion process using an association rule mining approach
    Latiri, Chiraz
    Haddad, Hatem
    Hamrouni, Tarek
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 39 (01) : 209 - 247
  • [4] A Framework for Automatic Query Expansion
    Imran, Hazra
    Sharan, Aditi
    WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 386 - +
  • [5] Exploiting Underrepresented Query Aspects for Automatic Query Expansion
    Crabtree, Daniel
    Andreae, Peter
    Gao, Xiaoying
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 191 - 200
  • [6] Techniques for efficient query expansion
    Billerbeck, B
    Zobel, J
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 3246 : 30 - 42
  • [7] A Survey of Query Expansion, Query Suggestion and Query Refinement Techniques
    Ooi, Jessie
    Qin, Hongwu
    Ma, Xiuqin
    Liew, Siau Chuin
    2015 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND COMPUTER SYSTEMS (ICSECS), 2015, : 112 - 117
  • [8] Towards effective genomic information retrieval: The impact of query complexity and expansion strategies
    Mu, Xiangming
    Lu, Kun
    JOURNAL OF INFORMATION SCIENCE, 2010, 36 (02) : 194 - 208
  • [9] A New Approach for Automatic Query Expansion
    Hmeidi, Ismail
    Al-Badarneh, Amer
    Al-Qtaish, Ahmad A.
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 3 AND 4, 2010, : 1975 - 1989
  • [10] Soft Computing Techniques Based Automatic Query Expansion Approach for Improving Document Retrieval
    Sharma, Dilip Kumar
    Pamula, Rajendra
    Chauhan, D. S.
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 972 - 976