Construction of query concepts based on feature clustering of documents

被引:0
|
作者
Youjin Chang
Minkoo Kim
Vijay V. Raghavan
机构
[1] Ajou University,Graduate School of Information and Communication
[2] Ajou University,Department of Information and Computer Engineering
[3] University of Louisiana,The Center for Advanced Computer Studies
来源
Information Retrieval | 2006年 / 9卷
关键词
concept-based information retrieval; query reformulation; query concepts;
D O I
暂无
中图分类号
学科分类号
摘要
In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.
引用
收藏
页码:231 / 248
页数:17
相关论文
共 50 条
  • [11] SQL query construction from database concepts
    Gorskis, Henrihs
    2018 59TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2018,
  • [12] Autoencoder-based feature construction for IoT attacks clustering
    Haseeb, Junaid
    Mansoori, Masood
    Hirose, Yuichi
    Al-Sahaf, Harith
    Welch, Ian
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 487 - 502
  • [13] Approximate query algorithm based on eight-neighbor grid clustering for heterogeneous XML documents
    Heng, Xingchen
    Luo, Junjie
    Guo, Junwen
    Qin, Zheng
    Shao, Liping
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2007, 41 (08): : 907 - 911
  • [14] FEATURE CLUSTERING FOR PSO-BASED FEATURE CONSTRUCTION ON HIGH-DIMENSIONAL DATA
    Swesi, Idheba Mohamad Ali Omer
    Abu Bakar, Azuraliza
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2019, 18 (04): : 439 - 472
  • [15] Finding target and constraint concepts for XML query construction
    Gan, Keng Hoon
    Phang, Keat Keong
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2015, 11 (04) : 468 - 490
  • [16] A feature point clustering approach to the recognition of form documents
    Fan, KC
    Lu, JM
    Chen, GD
    PATTERN RECOGNITION, 1998, 31 (09) : 1205 - 1220
  • [17] Multi-view Construction for Clustering Based on Feature set Partitioning
    Chang, Xiaojing
    Yang, Yan
    Wang, Hongjun
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [18] Feature Selection and Clustering of Documents Using Random Feature Set Generation Technique
    Christy, A.
    Gandhi, G. Meera
    ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 : 67 - 79
  • [19] Using Feature Clustering for GP-Based Feature Construction on High-Dimensional Data
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    GENETIC PROGRAMMING, EUROGP 2017, 2017, 10196 : 210 - 226
  • [20] Structural query expansion based on weighted query term for XML documents
    School of Information and Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China
    不详
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (10): : 2611 - 2619