Construction of query concepts based on feature clustering of documents

被引:0
|
作者
Youjin Chang
Minkoo Kim
Vijay V. Raghavan
机构
[1] Ajou University,Graduate School of Information and Communication
[2] Ajou University,Department of Information and Computer Engineering
[3] University of Louisiana,The Center for Advanced Computer Studies
来源
Information Retrieval | 2006年 / 9卷
关键词
concept-based information retrieval; query reformulation; query concepts;
D O I
暂无
中图分类号
学科分类号
摘要
In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.
引用
收藏
页码:231 / 248
页数:17
相关论文
共 50 条
  • [41] Continuous query scheduler based on operators clustering
    Soliman, M. Sami
    Tan Guan-zheng
    JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2011, 18 (03): : 782 - 790
  • [42] Feature-based spatial query language
    Song, Jingang
    Zhang, Dalu
    Jisuanji Gongcheng/Computer Engineering, 2000, 26 (01): : 49 - 50
  • [43] Semantic annotation of documents based on wikipedia concepts
    Brank, Janez
    Leban, Gregor
    Grobelnik, Marko
    Informatica (Slovenia), 2018, 42 (01): : 23 - 32
  • [44] Semantic Annotation of Documents Based on Wikipedia Concepts
    Brank, Janez
    Leban, Gregor
    Grobelnik, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (01): : 23 - 32
  • [45] A study on searching for similar documents based on multiple concepts and distribution of concepts
    Weng, SS
    Lin, YJ
    EXPERT SYSTEMS WITH APPLICATIONS, 2003, 25 (03) : 355 - 368
  • [46] XML Documents Clustering based on Representative Path
    Kim, Woosaeng
    PROCEEDINGS OF THE 13TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS, 2009, : 108 - +
  • [47] Clustering XML Documents for Web Based Learning
    Periakaruppan, Ramanathan
    Nadarajan, Rethinaswamy
    ADVANCES IN WEB-BASED LEARNING, 2015, 8390 : 234 - 243
  • [48] Clustering web documents based on knowledge granularity
    Huang, FL
    Zhang, SC
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 85 - 96
  • [49] Clustering XML Documents based on Data Type
    Zhou, Chong
    Lu, Yansheng
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 685 - 690
  • [50] Clustering XML documents based on structural similarity
    Xing, Guangming
    Xia, Zhonghang
    Guo, Jinhua
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 905 - +