Construction of query concepts based on feature clustering of documents

被引：0

作者：

Youjin Chang

Minkoo Kim

Vijay V. Raghavan

机构：

[1] Ajou University,Graduate School of Information and Communication

[2] Ajou University,Department of Information and Computer Engineering

[3] University of Louisiana,The Center for Advanced Computer Studies

来源：

Information Retrieval | 2006年 / 9卷

关键词：

concept-based information retrieval; query reformulation; query concepts;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.

引用

页码：231 / 248

页数：17

共 50 条

[41] Continuous query scheduler based on operators clustering
Soliman, M. Sami
Tan Guan-zheng
JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2011, 18 (03): : 782 - 790
[42] Feature-based spatial query language
Song, Jingang
Zhang, Dalu
Jisuanji Gongcheng/Computer Engineering, 2000, 26 (01): : 49 - 50
[43] Semantic annotation of documents based on wikipedia concepts
Brank, Janez
Leban, Gregor
Grobelnik, Marko
Informatica (Slovenia), 2018, 42 (01): : 23 - 32
[44] Semantic Annotation of Documents Based on Wikipedia Concepts
Brank, Janez
Leban, Gregor
Grobelnik, Marko
INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (01): : 23 - 32
[45] A study on searching for similar documents based on multiple concepts and distribution of concepts
Weng, SS
Lin, YJ
EXPERT SYSTEMS WITH APPLICATIONS, 2003, 25 (03) : 355 - 368
[46] XML Documents Clustering based on Representative Path
Kim, Woosaeng
PROCEEDINGS OF THE 13TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS, 2009, : 108 - +
[47] Clustering XML Documents for Web Based Learning
Periakaruppan, Ramanathan
Nadarajan, Rethinaswamy
ADVANCES IN WEB-BASED LEARNING, 2015, 8390 : 234 - 243
[48] Clustering web documents based on knowledge granularity
Huang, FL
Zhang, SC
FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 85 - 96
[49] Clustering XML Documents based on Data Type
Zhou, Chong
Lu, Yansheng
2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 685 - 690
[50] Clustering XML documents based on structural similarity
Xing, Guangming
Xia, Zhonghang
Guo, Jinhua
ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 905 - +

← 1 2 3 4 5 →