Towards more effective techniques for automatic query expansion

被引:0
|
作者
Carpineto, C [1 ]
Romano, G [1 ]
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
来源
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS | 1999年 / 1696卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion from top retrieved documents have recently shown promise for improving retrieval effectiveness on large collections but there is still a lack of systematic evaluation and comparative studies. In this paper we focus on term-scoring methods based on the differences between the distribution of terms in (pseudo-)relevant documents and the distribution of terms in all documents, seen as a complement or an alternative to more conventional techniques. We show that when such distributional methods are used to select expansion terms within Rocchio's classical reweighting scheme, the overall performance is not likely to improve. However, we also show that when the same distributional methods are used to both select and weight expansion terms the retrieval effectiveness may considerably improve. We then argue, based on their variation in performance on individual queries, that the set of ranked terms suggested by individual distributional methods can be combined to further improve mean performance, by analogy with ensembling classifiers, and present experimental evidence supporting this view. Taken together, our experiments show that with automatic query expansion it is possible to achieve performance gains as high as 21.34% over non-expanded query (for non-interpolated average precision). We also discuss the effect that the main parameters involved in automatic query expansion, such as query difficulty, number of selected documents, and number of selected terms, have on retrieval effectiveness.
引用
收藏
页码:126 / 141
页数:16
相关论文
共 50 条
  • [31] A New Hybrid Document Clustering for PRF-Based Automatic Query Expansion Approach for Effective IR
    Gupta, Yogesh
    Saini, Ashish
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2020, 16 (03) : 73 - 95
  • [32] Context Recognition: Towards Automatic Query Generation
    Alirezaie, Marjan
    Pecora, Federico
    Loutfi, Amy
    AMBIENT INTELLIGENCE, AMI 2015, 2015, 9425 : 205 - 218
  • [33] Query exhaustivity, relevance feedback and search success in automatic and interactive query expansion
    Vakkari, P
    Jones, S
    MacFarlane, A
    Sormunen, E
    JOURNAL OF DOCUMENTATION, 2004, 60 (02) : 109 - 127
  • [34] Subjective and objective evaluation of interactive and automatic query expansion
    Shapira, B
    Taieb-Maimon, M
    Nemeth, Y
    ONLINE INFORMATION REVIEW, 2005, 29 (04) : 374 - 390
  • [35] Efficient Association Rules Selecting for Automatic Query Expansion
    Bouziri, Ahlem
    Latiri, Chiraz
    Gaussier, Eric
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 563 - 574
  • [36] AN EVALUATION OF AUTOMATIC QUERY EXPANSION IN AN ONLINE LIBRARY CATALOG
    HANCOCKBEAULIEU, M
    WALKER, S
    JOURNAL OF DOCUMENTATION, 1992, 48 (04) : 406 - 421
  • [37] Automatic Term Mismatch Diagnosis for Selective Query Expansion
    Zhao, Le
    Callan, Jamie
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 515 - 524
  • [38] An information-theoretic approach to automatic query expansion
    Carpineto, C
    De Mori, R
    Romano, G
    Bigi, B
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (01) : 1 - 27
  • [39] Specific academic area based automatic query expansion
    Yuan, Yuan
    Zhang, Yong
    Xing, Chunxiao
    2007 2ND INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2007, : 612 - 617
  • [40] Towards a "More Declarative" XML Query Language
    Li, Xuhui
    Liu, Mengchi
    Zhang, Yongfa
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT 2, 2010, 6262 : 375 - +