Query expansion based on clustering and personalized information retrieval

被引:0
|
作者
Hamid Khalifi
Walid Cherif
Abderrahim El Qadi
Youssef Ghanou
机构
[1] Moulay Ismail University,TIM Team, High School of Technology
[2] National Institute of Statistics and Applied Economics,SI2M Laboratory
[3] Mohammed V University,High School of Technology
来源
关键词
Information retrieval; Personalized information retrieval; Automatic query completion; Clustering; Performance evaluation; Support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Information retrieval systems are used to describe a variety of processes involving the delivery of information to people who need it. Although several mathematical approaches have been studied in order to formalize the main components of an information retrieval system: queries representation, information items representations and the retrieval process, such systems still face many difficulties to extract relevant information for users especially when the processed data are texts. This is due to the complex nature of text databases. Generally, an information retrieval system reformulates queries according to associations among information items before matching them to dataset items. In this sense, semantic relationships or machine learning techniques can be applied to refine the returned results. This paper presents a formal model to organize data, and a new search algorithm to browse it. It incorporates a natural language preprocessing stage, a statistical representation of short documents and queries and a machine learning model to select relevant results. We propose later in this paper two further optimizations that proved quite interesting and returned significantly satisfying results on two datasets in a reasonable computation time. The first optimization concerns queries expansions, while the second one concerns dataset restructuration. Thus, we formally evaluate the impact of each optimization by computing the performance of the information retrieval system with and without it; the highest reached recall and precision were 96.2% and 99.2%, respectively.
引用
收藏
页码:241 / 251
页数:10
相关论文
共 50 条
  • [21] Query Expansion for Effective Geographic Information Retrieval
    Pu, Qiang
    He, Daqing
    Li, Qi
    EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 843 - +
  • [22] Query expansion for intelligent information retrieval on Internet
    Lim, JH
    Seung, HW
    Hwang, J
    Kim, YC
    Kim, HN
    1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 656 - 662
  • [24] An information retrieval system based on automatic query expansion and Hopfield network
    Sheng, XW
    Jiang, MH
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 1624 - 1627
  • [25] Design and implementation of ontology-based query expansion for information retrieval
    Fang Wu
    Guoshi Wu
    Xangling Fu
    RESEARCH AND PRACTICAL ISSUES OF ENTERPRISE INFORMATION SYSTEMS II, VOL 1, 2008, 254 : 293 - +
  • [26] An information retrieval system based on automatic query expansion and hopfield network
    Wang, Lin
    Jiang, Minghu
    Sheng, Xiaowei
    Lu, Yinghua
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1519 - 1524
  • [27] An improved VSM based information retrieval system and fuzzy query expansion
    Wu, JN
    Tanioka, H
    Wang, SZ
    Pan, DH
    Yamamoto, K
    Wang, ZT
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 537 - 546
  • [28] Query Expansion based on Word Embeddings and Ontologies for Efficient Information Retrieval
    Rastogi, Namrata
    Verma, Parul
    Kumar, Pankaj
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 367 - 373
  • [29] Design and implementation of ontology-based query expansion for information retrieval
    School of Software Engineering, Beijing University of Posts and Telecommunications, Beijing
    100879, China
    不详
    061001, China
    IFIP Advances in Information and Communication Technology, 2007, (293-298)
  • [30] RESEARCH ON THE WEB INFORMATION RETRIEVAL MODEL BASED ON METADATA AND QUERY EXPANSION
    Hu, Changxia
    Liu, Xiaoxing
    Jin, Weiying
    2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 384 - +