Using proximity and tag weights for focused retrieval in structured documents

被引:2
|
作者
Beigbeder, Michel [1 ]
Gery, Mathias [2 ]
Largeron, Christine [2 ]
机构
[1] Ecole Natl Super Mines, F-42023 St Etienne, France
[2] Univ Lyon, St Etienne, France
关键词
Focused information retrieval; Structured information retrieval; Proximity; XML; Tags; TERM PROXIMITY;
D O I
10.1007/s10115-014-0767-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Focused information retrieval is concerned with the retrieval of small units of information. In this context, the structure of the documents as well as the proximity among query terms have been found useful for improving retrieval effectiveness. In this article, we propose an approach combining the proximity of the terms and the tags which mark these terms. Our approach is based on a Fetch and Browse method where the fetch step is performed with BM25 and the browse step with a structure enhanced proximity model. In this way, the ranking of a document depends not only upon the existence of the query terms within the document but also upon the tags which mark these terms. Thus, the document tends to be highly relevant when query terms are close together and are emphasized by tags. The evaluation of this model on a large XML structured collection provided by the INEX 2010 XML IR evaluation campaign shows that the use of term proximity and structure improves the retrieval effectiveness of BM25 in the context of focused information retrieval.
引用
收藏
页码:51 / 76
页数:26
相关论文
共 50 条
  • [41] Retrieval system for patent documents using references
    Awaya, K
    Nasu, S
    Shigeno, H
    Matsushita, Y
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 397 - 401
  • [42] TagPlus: A retrieval system using synonym tag in folksonomy
    Lee, Sun-Sook
    Yong, Hwan-Seung
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 294 - +
  • [43] Using structural relationships for focused XML retrieval
    Ramirez, Georgina
    Westerveld, Thijs
    de Vries, Arjen P.
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 147 - 158
  • [44] An efficient image retrieval method using adaptive weights
    Quynh Nguyen Huu
    Quynh Dao Thi Thuy
    Canh Phuong Van
    Can Nguyen Van
    Tao Ngo Quoc
    Applied Intelligence, 2018, 48 : 3807 - 3826
  • [45] An efficient image retrieval method using adaptive weights
    Quynh Nguyen Huu
    Quynh Dao Thi Thuy
    Canh Phuong Van
    Can Nguyen Van
    Tao Ngo Quoc
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3807 - 3826
  • [46] Interactive Information Retrieval Using Clustering and Spatial Proximity
    Anton Leuski
    James Allan
    User Modeling and User-Adapted Interaction, 2004, 14 : 259 - 288
  • [47] Interactive information retrieval using clustering and spatial proximity
    Leuski, A
    Allan, J
    USER MODELING AND USER-ADAPTED INTERACTION, 2004, 14 (2-3) : 259 - 288
  • [48] SEARCHING STRUCTURED DOCUMENTS WITH THE ENHANCED RETRIEVAL FUNCTIONALITY OF FREE WAIS-SF AND SFGATE
    PFEIFER, U
    FUHR, N
    HUYNH, T
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06): : 1027 - 1036
  • [49] Evaluating structured information retrieval and multimedia retrieval using PF/Tijah
    Westerveld, Thijs
    Rode, Henning
    van Os, Roel
    Hiemstra, Djoerd
    Ramirez, Georgina
    Mihajlovic, Vojkan
    de Vries, Arjen P.
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 104 - 114
  • [50] Efficient Storage and Retrieval of XML Documents Using XQuery
    Chiu, Yu-Bin
    Chen, Huei-Huang
    Liu, Chu-Yen
    Chen, Shih-Chih
    Hung, Chung-Wen
    MATERIALS, TRANSPORTATION AND ENVIRONMENTAL ENGINEERING, PTS 1 AND 2, 2013, 779-780 : 1685 - +