Pseudo-random number generation for sketch-based estimations

被引:12
|
作者
Rusu, Florin [1 ]
Dobra, Alin [1 ]
机构
[1] Univ Florida, Dept Comp & Informat Sci & Engn, Gainesville, FL 32611 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2007年 / 32卷 / 02期
关键词
algorithms; experimentation; performance; theory; sketches; data synopses; approximate query processing; fast range-summation;
D O I
10.1145/1242524.1242528
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The exact computation of aggregate queries, like the size of join of two relations, usually requires large amounts of memory (constrained in data-streaming) or communication (constrained in distributed computation) and large processing times. In this situation, approximation techniques with provable guarantees, like sketches, are one possible solution. The performance of sketches depends crucially on the ability to generate particular pseudo-random numbers. In this article we investigate both theoretically and empirically the problem of generating k-wise independent pseudo-random numbers and, in particular, that of generating 3- and 4-wise independent pseudo-random numbers that are fast range-summable (i.e., they can be summed in sublinear time). Our specific contributions are: (a) we provide a thorough comparison of the various pseudo-random number generating schemes; (b) we study both theoretically and empirically the fast range-summation property of 3- and 4-wise independent generating schemes; (c) we provide algorithms for the fast range-summation of two 3-wise independent schemes, BCH and extended Hamming; and (d) we show convincing theoretical and empirical evidence that the extended Hamming scheme performs as well as any 4-wise independent scheme for estimating the size of join of two relations using AMS sketches, even though it is only 3-wise independent. We use this scheme to generate estimators that significantly outperform state-of-the-art solutions for two problems, namely, size of spatial joins and selectivity estimation.
引用
收藏
页数:48
相关论文
共 50 条
  • [11] Pseudo-random number generation based on digit isolation referenced to entropy buffers
    Richardson, Joseph D.
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2022, 98 (05): : 389 - 406
  • [12] FPGA implementations of cellular automata for pseudo-random number generation
    Anghelescu, Petre
    Sofron, Emil
    Ionita, Silviu
    Ionescu, Laurentiu
    2006 INTERNATIONAL SEMICONDUCTOR CONFERENCE, VOLS 1 AND 2, 2007, : 371 - +
  • [13] Hardware acceleration of pseudo-random number generation for simulation applications
    McCollum, JM
    Lancaster, JM
    Bouldin, DW
    Peterson, GD
    PROCEEDINGS OF THE 35TH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2003, : 299 - 303
  • [14] GENERATION OF PSEUDO-RANDOM NUMBERS
    HOWELL, LW
    RHEINFURTH, MH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (02): : 639 - 639
  • [15] Pseudo-Random Number Generator Based on Fuzzy Logic
    Anikin, Igor V.
    Alnajjar, Khaled
    2016 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2016,
  • [16] Sponge-Based Pseudo-Random Number Generators
    Bertoni, Guido
    Daemen, Joan
    Peeters, Michael
    Van Assche, Gilles
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS - CHES 2010, 2010, 6225 : 33 - 47
  • [17] Pseudo-random number generators based on the Collatz conjecture
    Xu D.
    Tamir D.E.
    International Journal of Information Technology, 2019, 11 (3) : 453 - 459
  • [18] A Forkcipher-Based Pseudo-Random Number Generator
    Andreeva, Elena
    Weninger, Andreas
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, PT II, ACNS 2023, 2023, 13906 : 3 - 31
  • [19] A Horadam-based Pseudo-random Number Generator
    Bagdasar, Ovidiu D.
    Chen, Minsi
    2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 227 - 231
  • [20] Stream cipher based on pseudo-random number generation using optical affine transformation
    Sasaki, T
    Togo, H
    Tanida, J
    Ichioka, Y
    OPTICAL STORAGE AND OPTICAL INFORMATION PROCESSING, 2000, 4081 : 114 - 123