Deplump for Streaming Data

被引:0
|
作者
Bartlett, Nicholas [1 ]
Wood, Frank [1 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
关键词
D O I
10.1109/DCC.2011.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a general-purpose, lossless compressor for streaming data. This compressor is based on the deplump probabilistic compressor for batch data. Approximations to the inference procedure used in the probabilistic model underpinning deplump are introduced that yield the computational asyptotics necessary for stream compression. We demonstrate the performance of this streaming deplump variant relative to the batch compressor on a benchmark corpus and find that it performs equivalently well despite these approximations. We also explore the performance of the streaming variant on corpora that are too large to be compressed by batch deplump and demonstrate excellent compression performance.
引用
收藏
页码:363 / 372
页数:10
相关论文
共 50 条
  • [31] Streaming Data Analysis on the Wire
    Katramatos, Dimitrios
    Yue, Meng
    Yoo, Shinjae
    van Dam, Kerstin Kleese
    Xu, Jin
    Zhang, Jiayao
    2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,
  • [32] Streaming Data Correlation on GPUs
    Fotopoulos, Spyros
    Malakonakis, Pavlos
    Chrysos, Grigorios
    Dollas, Apostolos
    2018 7TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2018,
  • [33] Efficient verifiable data streaming
    Kim, Kee Sung
    Jeong, Ik Rae
    SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (18) : 4013 - 4018
  • [34] Private Searching on Streaming Data
    Rafail Ostrovsky
    William E. Skeith
    Journal of Cryptology, 2007, 20 : 397 - 430
  • [35] Streaming PCA for Markovian Data
    Department of Computer Science, University of Texas, Austin, United States
    不详
    arXiv,
  • [36] Persistent Homology on Streaming Data
    Moitra, Anindya
    Malott, Nicholas O.
    Wilsey, Philip A.
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, : 636 - 643
  • [37] Data compression by volume prototypes for streaming data
    Tabata, Kenji
    Sato, Maiko
    Kudo, Mineichi
    PATTERN RECOGNITION, 2010, 43 (09) : 3162 - 3176
  • [38] Streaming Driving Behavior Data
    Basalamah, Anas
    Aurangzeb, Muhammad
    Elidrisi, Mohamed
    Basalamah, Saleh
    Mokbel, Mohamed
    PROCEEDINGS OF THE ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON GEOSTREAMING (IWGS) 2012, 2012, : 116 - 119
  • [39] Streaming Authenticated Data Structures
    Papamanthou, Charalampos
    Shi, Elaine
    Tamassia, Roberto
    Yi, Ke
    ADVANCES IN CRYPTOLOGY - EUROCRYPT 2013, 2013, 7881 : 353 - 370
  • [40] Streaming algorithms for data in motion
    Hoffmann, M.
    Muthukrishnan, S.
    Raman, Rajeev
    COMBINATORICS, ALGORITHMS, PROBABILISTIC AND EXPERIMENTAL METHODOLOGIES, 2007, 4614 : 294 - +