ADDING COMPRESSION TO A FULL-TEXT RETRIEVAL-SYSTEM

被引:53
|
作者
ZOBEL, J [1 ]
MOFFAT, A [1 ]
机构
[1] UNIV MELBOURNE,DEPT COMP SCI,PARKVILLE,VIC 3052,AUSTRALIA
来源
SOFTWARE-PRACTICE & EXPERIENCE | 1995年 / 25卷 / 08期
关键词
FULL-TEXT RETRIEVAL; DATA COMPRESSION; TEXT COMPRESSION; HUFFMAN CODING; WORD-BASED MODEL;
D O I
10.1002/spe.4380250804
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe the implementation of a data compression scheme as an integral and transparent layer within a full-text retrieval system. Using a semi-static word-based compression model, the space needed to store the text is under 30 per cent of the original requirement. The model is used in conjunction with canonical Huffman coding and together these two paradigms provide fast decompression. Experiments with 500 Mb of newspaper articles show that in full-text retrieval environments compression not only saves space, it can also yield faster query processing - a win-win situation.
引用
收藏
页码:891 / 903
页数:13
相关论文
共 50 条
  • [21] AN EVALUATION OF THE APPLICABILITY OF RANKING ALGORITHMS TO IMPROVE THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL .1. ON THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL
    RO, JS
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1988, 39 (02): : 73 - 78
  • [22] LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS
    ATTAR, R
    FRAENKEL, AS
    JOURNAL OF THE ACM, 1977, 24 (03) : 397 - 417
  • [23] The Performance Study of Database Full-Text Retrieval
    Wu, Daiwen
    MODERN INDUSTRIAL IOT, BIG DATA AND SUPPLY CHAIN, IIOTBDSC 2020, 2021, 218 : 239 - 247
  • [24] A public library based on full-text retrieval
    Witten, IH
    Nevill-Manning, C
    McNab, R
    Cunningham, SJ
    COMMUNICATIONS OF THE ACM, 1998, 41 (04) : 71 - 75
  • [25] APPLICATION OF FULL-TEXT RETRIEVAL TO LITIGATION SUPPORT
    RUBIN, JS
    FORUM-AMERICAN BAR ASSOCIATION, 1976, 11 (04): : 1136 - 1141
  • [26] The Design and Implementation of Database On the Full-text Retrieval System of Multi Document
    Shi, Hui
    Cai, Jingjing
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 5964 - 5967
  • [27] Full-text ETD retrieval in library discovery system: designing a framework
    Sarkar, Prosenjit
    Mukhopadhyay, Parthasarathi
    ANNALS OF LIBRARY AND INFORMATION STUDIES, 2016, 63 (04) : 274 - 288
  • [28] FULL-TEXT COMPUTER RETRIEVAL OF MEDICAL LITERATURE
    LLAURADO, JG
    INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1986, 18 (3-4): : 161 - 163
  • [29] Automated indexing for full-text information retrieval
    Berrios, DC
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 71 - 75
  • [30] A Method of Full-text Retrieval Based on Lucene
    Chen, Xiangrong
    Sun, Yong
    Ge, Xiaopei
    Wang, Congwei
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 217 - 220