Revisiting dictionary-based compression

被引:26
|
作者
Skibinski, P
Grabowski, S
Deorowicz, S
机构
[1] Tech Univ Lodz, Dept Comp Engn, PL-90924 Lodz, Poland
[2] Univ Wroclaw, Inst Comp Sci, PL-51151 Wroclaw, Poland
[3] Silesian Tech Univ, Inst Comp Sci, PL-44100 Gliwice, Poland
来源
SOFTWARE-PRACTICE & EXPERIENCE | 2005年 / 35卷 / 15期
关键词
lossless data compression; preprocessing; text compression; dictionary compression;
D O I
10.1002/spe.678
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An attractive way to increase text compression is to replace words with references to a text dictionary given in advance. Although there exist a few works in this area, they do not fully exploit the compression possibilities or consider alternative preprocessing variants for various compressors in the latter phase. In this paper, we discuss several aspects of dictionary-based compression, including compact dictionary representation, and present a PPM/BWCA-oriented scheme, word replacing transformation, achieving compression ratios higher by 2-6% than the state-of-the-art StarNT (2003) text preprocessor, working at a greater speed. We also present an alternative scheme designed for LZ77 compressors, with the advantage over StarNT of reaching up to 14% in combination with gzip. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1455 / 1476
页数:22
相关论文
共 50 条
  • [31] On parsing optimality for dictionary-based text compression-the zip case
    Langiu, Alessio
    JOURNAL OF DISCRETE ALGORITHMS, 2013, 20 : 65 - 70
  • [32] Grayscale true two-dimensional dictionary-based image compression
    Brittain, Nathanael J.
    El-Sakka, Mahmoud R.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (01) : 35 - 44
  • [33] Joint LZW and Lightweight Dictionary-based Compression Techniques for Congested Network
    Kho, Lee Chin
    Tan, Yasuo
    Lim, Yuto
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS, AND CONTROL TECHNOLOGY (I4CT), 2015,
  • [34] The greedy approach to dictionary-based static text compression on a distributed system
    De Agostino, Sergio
    JOURNAL OF DISCRETE ALGORITHMS, 2015, 34 : 54 - 61
  • [35] Dynamic dictionary-based data compression for level-1 caches
    Keramidas, G
    Aisopos, K
    Kaxiras, S
    ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2006, PROCEEDINGS, 2006, 3894 : 114 - 129
  • [36] Dictionary-based background subtraction
    Sang, N. (nsang@hust.edu.cn), 1600, Huazhong University of Science and Technology (41):
  • [37] A Novel Dictionary-Based Method for Test Data Compression Using Heuristic Algorithm
    Wu, Diancheng
    Li, Jiarui
    Wang, Leiou
    Wang, Donghui
    Hao, Chengpeng
    IEICE TRANSACTIONS ON ELECTRONICS, 2016, E99C (06): : 730 - 733
  • [38] Dictionary-based electric properties tomography
    Hampe, Nils
    Herrmann, Max
    Amthor, Thomas
    Findeklee, Christian
    Doneva, Mariya
    Katscher, Ulrich
    MAGNETIC RESONANCE IN MEDICINE, 2019, 81 (01) : 342 - 349
  • [39] DESIGN AND IMPLEMENTATION OF A DICTIONARY-BASED ARCHIVER
    Radescu, Radu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2008, 70 (03): : 21 - 28
  • [40] Dictionary-based Order-preserving String Compression for Main Memory Column Stores
    Binnig, Carsten
    Hildenbrand, Stefan
    Faerber, Franz
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 283 - 295