Construction of query concepts based on feature clustering of documents

被引:0
|
作者
Youjin Chang
Minkoo Kim
Vijay V. Raghavan
机构
[1] Ajou University,Graduate School of Information and Communication
[2] Ajou University,Department of Information and Computer Engineering
[3] University of Louisiana,The Center for Advanced Computer Studies
来源
Information Retrieval | 2006年 / 9卷
关键词
concept-based information retrieval; query reformulation; query concepts;
D O I
暂无
中图分类号
学科分类号
摘要
In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.
引用
收藏
页码:231 / 248
页数:17
相关论文
共 50 条
  • [21] An Approach to Feature Space Construction from Clustering Feature Tree
    Dudarin, Pavel
    Samokhvalov, Mikhail
    Yarushkina, Nadezhda
    ARTIFICIAL INTELLIGENCE (RCAI 2018), 2018, 934 : 176 - 189
  • [22] Comparison of Feature Extraction Methods for Brazilian Legal Documents Clustering
    Lima, Joao Pedro
    Costa, Jose Alfredo
    Araujo, Diogenes Carlos
    2021 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2021,
  • [23] Learning to rank based on query clustering
    Hua, Gui-Chun
    Zhang, Min
    Liu, Yi-Qun
    Ma, Shao-Ping
    Ru, Li-Yun
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2012, 25 (01): : 118 - 123
  • [24] Constraint-based query clustering
    Ruiz, Carlos
    Menasalvas, Ernestina
    Spiliopoulou, Myra
    ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 304 - +
  • [25] Query Disambiguation Based on Clustering Techniques
    Kotoula, Panagiota
    Makris, Christos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 520 : 133 - 145
  • [26] An Evolutionary Algorithm for Feature Selective Double Clustering of Text Documents
    Nourashrafeddin, S. N.
    Milios, Evangelos
    Arnold, Dirk V.
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 446 - 453
  • [27] EXTRACTION OF CHARACTERS FROM FORM DOCUMENTS BY FEATURE POINT CLUSTERING
    FAN, KC
    LU, JM
    WANG, LS
    LIAO, HY
    PATTERN RECOGNITION LETTERS, 1995, 16 (09) : 963 - 970
  • [28] Correlation Clustering Based on Genetic Algorithm for Documents Clustering
    Zhang, Zhenya
    Cheng, Hongmei
    Chen, Wanli
    Zhang, Shuguang
    Fang, Qiansheng
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 3193 - +
  • [29] Clustering Aggregation Based on Genetic Algorithm for Documents Clustering
    Zhang, Zhenya
    Cheng, Hongmei
    Zhang, Shuguang
    Chen, Wanli
    Fang, Qiansheng
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 3156 - +
  • [30] Feature Weighting in Finding Feedback Documents for Query Expansion in Biomedical Document Retrieval
    Sankhavara J.
    SN Computer Science, 2020, 1 (2)