Distributed Wavelet Thresholding for Maximum Error Metrics

被引:4
|
作者
Mytilinis, Ioannis [1 ]
Tsoumakos, Dimitrios [2 ]
Koziris, Nectarios [1 ]
机构
[1] Natl Tech Univ Athens, Comp Syst Lab, Athens, Greece
[2] Ionian Univ, Dept Informat, Corfu, Greece
关键词
D O I
10.1145/2882903.2915230
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern data analytics involve simple and complex computations over enormous numbers of data records. The volume of data and the increasingly stringent response-time requirements place increasing emphasis on the efficiency of approximate query processing. A major challenge over the past years has been the efficient construction of fixed-space synopses that provide a deterministic quality guarantee, often expressed in terms of a maximum error metric. For data reduction, wavelet decomposition has proved to be a very effective tool, as it can successfully approximate sharp discontinuities and provide accurate answers to queries. However, existing polynomial time wavelet thresholding schemes that minimize maximum error metrics are constrained with impractical time and space complexities for large datasets. In order to provide a practical solution to the problem, we develop parallel algorithms that take advantage of key properties of the wavelet decomposition and allocate tasks to multiple workers. To that end, we present i) a general framework for the parallelization of existing dynamic programming algorithms, ii) a parallel version of one such DP-based algorithm and iii) a new parallel greedy algorithm for the problem. To the best of our knowledge, this is the first attempt to scale algorithms for wavelet thresholding for maximum error metrics via a state-of-the-art distributed run-time. Our extensive experiments on both real and synthetic datasets over Hadoop show that the proposed algorithms achieve linear scalability and superior running-time performance compared to their centralized counterparts. Furthermore, our distributed greedy algorithm outperforms the distributed version of the current state-of-the-art dynamic programming algorithm by 2 to 4 times, without compromising the quality of results.
引用
收藏
页码:663 / 677
页数:15
相关论文
共 50 条
  • [31] Deconvolution by thresholding in mirror wavelet bases
    Kalifa, J
    Mallat, S
    Rougé, B
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (04) : 446 - 457
  • [32] On posterior distribution of Bayesian wavelet thresholding
    Lian, Heng
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (01) : 318 - 324
  • [33] Posterior probability intervals for wavelet thresholding
    Barber, S
    Nason, GP
    Silverman, BW
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 189 - 205
  • [34] Generalized cross validation for wavelet thresholding
    Jansen, M
    Malfait, M
    Bultheel, A
    SIGNAL PROCESSING, 1997, 56 (01) : 33 - 44
  • [35] Generalized cross validation for wavelet thresholding
    Katholieke Universiteit Leuven, Heverlee, Belgium
    Signal Process, 1 (33-44):
  • [36] Overview on thresholding functions of wavelet shrinkage
    Zhang Yi
    Li Jianping
    Wang Senhua
    Xiao Shucheng
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 102 - 103
  • [37] Multinomial Probability Estimation by Wavelet Thresholding
    Dong, Jianping
    Jiang, Renfang
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (09) : 1486 - 1507
  • [38] Wavelet thresholding based on image denoising
    Keita, A.
    Peng, J.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (06): : 13 - 15
  • [39] Denoising of infrared images by wavelet thresholding
    Wippig, Dietmar
    Klauer, Bernd
    Zeidler, Hans Christoph
    ADVANCES IN COMPUTER, INFORMATION, AND SYSTEMS SCIENCES AND ENGINEERING, 2006, : 103 - +
  • [40] Norm thresholding method in wavelet regression
    Wu, DF
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2002, 72 (03) : 233 - 245