Construction of query concepts based on feature clustering of documents

被引:0
|
作者
Youjin Chang
Minkoo Kim
Vijay V. Raghavan
机构
[1] Ajou University,Graduate School of Information and Communication
[2] Ajou University,Department of Information and Computer Engineering
[3] University of Louisiana,The Center for Advanced Computer Studies
来源
Information Retrieval | 2006年 / 9卷
关键词
concept-based information retrieval; query reformulation; query concepts;
D O I
暂无
中图分类号
学科分类号
摘要
In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.
引用
收藏
页码:231 / 248
页数:17
相关论文
共 50 条
  • [1] Construction of query concepts based on feature clustering of documents
    Chang, Youjin
    Kim, Minkoo
    Raghavan, Vijay V.
    INFORMATION RETRIEVAL, 2006, 9 (03): : 231 - 248
  • [2] Approximate XML query algorithm based on clustering of XML documents
    School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    Jisuanji Gongcheng, 2006, 15 (52-54):
  • [3] Improve query performance by clustering XML documents
    Wang, L
    Cheung, DW
    Mamoulis, N
    Yiu, SM
    INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 6, POST-CONFERENCE ISSUE, PROCEEDINGS, 2004, : 329 - 334
  • [4] Contextual Query based on Segmentation and Clustering of Selected Documents for Acquiring Web Documents for Supporting Knowledge Management
    Prates, Joao C.
    Siqueira, Sean S. M.
    AMCIS 2011 PROCEEDINGS, 2011,
  • [5] Feature- and query-based table of contents generation for XML documents
    Szlavik, Zoltan
    Tombros, Anastasios
    Lalmas, Mounia
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 456 - +
  • [6] Automatic clustering of construction project documents based on textual similarity
    Al Qady, Mohammed
    Kandil, Amr
    AUTOMATION IN CONSTRUCTION, 2014, 42 : 36 - 49
  • [7] Diversifying Query Suggestions Based on Query Documents
    Kim, Youngho
    Croft, W. Bruce
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 891 - 894
  • [8] Construction of query concepts in a document space based on data mining techniques
    Chang, Y
    Kim, M
    Ounis, I
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 137 - 149
  • [9] A New Multimedia Documents Clustering Approach based on Feature Patterns Similarity
    Pushpalatha, K.
    Ananthanarayana, V. S.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 296 - 299
  • [10] Exploring Concepts' Semantic Relations for Clustering-Based Query Senses Disambiguation
    Chen, Yan
    Zhang, Yan-Qing
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2009, 5589 : 674 - 681