Deplump for Streaming Data

被引:0
|
作者
Bartlett, Nicholas [1 ]
Wood, Frank [1 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
关键词
D O I
10.1109/DCC.2011.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a general-purpose, lossless compressor for streaming data. This compressor is based on the deplump probabilistic compressor for batch data. Approximations to the inference procedure used in the probabilistic model underpinning deplump are introduced that yield the computational asyptotics necessary for stream compression. We demonstrate the performance of this streaming deplump variant relative to the batch compressor on a benchmark corpus and find that it performs equivalently well despite these approximations. We also explore the performance of the streaming variant on corpora that are too large to be compressed by batch deplump and demonstrate excellent compression performance.
引用
收藏
页码:363 / 372
页数:10
相关论文
共 50 条
  • [41] The streaming data management challenge
    Wallace, K
    Shepherd, K
    SEA TECHNOLOGY, 2003, 44 (02) : 37 - +
  • [42] Sampling Streaming data with replacement
    Park, Byung-Hoon
    Ostrouchov, George
    Samatova, Nagiza F.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (02) : 750 - 762
  • [43] Private searching on streaming data
    Ostrovsky, Rafail
    Skeith, William E., III
    JOURNAL OF CRYPTOLOGY, 2007, 20 (04) : 397 - 430
  • [44] Streaming Massive Electric Power Data Analysis Based on Spark Streaming
    Zhang, Xudong
    Qian, Zhongwen
    Shen, Siqi
    Shi, Jia
    Wang, Shujun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 200 - 212
  • [45] A data and query model for streaming geospatial image data
    Gertz, Michael
    Hart, Quinn
    Rueda, Carlos
    Singhal, Shefali
    Zhang, Jie
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 4254 : 687 - 699
  • [46] Dynamic data assigning assessment clustering of streaming data
    Georgieva, O.
    Klawonn, F.
    APPLIED SOFT COMPUTING, 2008, 8 (04) : 1305 - 1313
  • [47] Medical Data Opinion Retrieval on Twitter Streaming Data
    Sindhura, Vemuri
    Sandeep, Y.
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [48] Data streaming architecture for visualizing cryptocurrency temporal data
    Bandi A.
    Lecture Notes on Data Engineering and Communications Technologies, 2021, 66 : 651 - 661
  • [49] SDPP: Streaming Data Payment Protocol for Data Economy
    Radhakrishnan, Rahul
    Ramachandran, Gowri Sankar
    Krishnamachari, Bhaskar
    2019 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN AND CRYPTOCURRENCY (ICBC), 2019, : 17 - 18
  • [50] An Anonymous Data Publishing Framework for Streaming Genomic Data
    Wu, Xiang
    Wang, Huanhuan
    Wei, Yuyang
    Mao, Yaqing
    Jiang, Shuguang
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (03) : 546 - 554