ADDING COMPRESSION TO A FULL-TEXT RETRIEVAL-SYSTEM

被引:53
|
作者
ZOBEL, J [1 ]
MOFFAT, A [1 ]
机构
[1] UNIV MELBOURNE,DEPT COMP SCI,PARKVILLE,VIC 3052,AUSTRALIA
来源
SOFTWARE-PRACTICE & EXPERIENCE | 1995年 / 25卷 / 08期
关键词
FULL-TEXT RETRIEVAL; DATA COMPRESSION; TEXT COMPRESSION; HUFFMAN CODING; WORD-BASED MODEL;
D O I
10.1002/spe.4380250804
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe the implementation of a data compression scheme as an integral and transparent layer within a full-text retrieval system. Using a semi-static word-based compression model, the space needed to store the text is under 30 per cent of the original requirement. The model is used in conjunction with canonical Huffman coding and together these two paradigms provide fast decompression. Experiments with 500 Mb of newspaper articles show that in full-text retrieval environments compression not only saves space, it can also yield faster query processing - a win-win situation.
引用
收藏
页码:891 / 903
页数:13
相关论文
共 50 条
  • [41] INTEGRATION OF MENU RETRIEVAL AND BOOLEAN RETRIEVAL FROM A FULL-TEXT DATABASE
    WATTERS, CR
    SHEPHERD, MA
    GRUNDKE, EW
    BODORIK, P
    ONLINE REVIEW, 1985, 9 (05): : 391 - 401
  • [42] SUPPORT FOR BROWSING IN AN INTELLIGENT TEXT RETRIEVAL-SYSTEM
    THOMPSON, RH
    CROFT, WB
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1989, 30 (06): : 639 - 668
  • [43] FULL-TEXT RETRIEVAL FOR DOCUMENT DELIVERY - A VIABLE OPTION
    GLAVASH, K
    ONLINE, 1994, 18 (03): : 81 - 84
  • [44] Improved Compressed Indexes for Full-Text Document Retrieval
    Belazzougui, Djamal
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL, 2011, 7024 : 386 - +
  • [45] Expanded information retrieval using full-text searching
    Kostoff, Ronald N.
    JOURNAL OF INFORMATION SCIENCE, 2010, 36 (01) : 104 - 113
  • [46] Subject retrieval from full-text databases in the humanities
    East, John W.
    PORTAL-LIBRARIES AND THE ACADEMY, 2007, 7 (02) : 227 - 241
  • [47] AN EXPERT SYSTEM FOR SEARCHING IN FULL-TEXT
    GAUCH, S
    SMITH, JB
    INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (03) : 253 - 263
  • [48] Expert system for searching in full-text
    Gauch, Susan, 1600, (25):
  • [49] A COMPARISON OF INDEXING AND FULL-TEXT FOR THE RETRIEVAL OF CLINICAL MEDICAL LITERATURE
    SIEVERT, M
    MCKININ, EJ
    SLOUGH, M
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1988, 25 : 143 - 146
  • [50] Using Syllables As Indexing Terms in Full-Text Information Retrieval
    Kettunen, Kimmo
    Mcnamee, Paul
    Baskaya, Feza
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 : 225 - 232