More accurate streaming cardinality estimation with vectorized counters

被引:1
|
作者
Bruschi, Valerio [1 ]
Reviriego, Pedro [2 ]
Pontarelli, Salvatore [3 ]
Ting, Daniel [4 ]
Bianchi, Giuseppe [1 ]
机构
[1] Bruschi, Valerio
[2] Reviriego, Pedro
[3] Pontarelli, Salvatore
[4] Ting, Daniel
[5] Bianchi, Giuseppe
来源
Bruschi, Valerio (valerio.bruschi@uniroma2.it) | 1600年 / Institute of Electrical and Electronics Engineers Inc.卷 / 03期
关键词
D O I
10.1109/LNET.2021.3076048
中图分类号
学科分类号
摘要
Cardinality estimation, also known as count-distinct, is the problem of finding the number of different elements in a set with repeated elements. Among the many approximate algorithms proposed for this task, HyperLogLog (HLL) has established itself as the state of the art due to its ability to accurately estimate cardinality over a large range of values using a small memory footprint. When elements arrive in a stream, as in the case of most networking applications, improved techniques are possible. We specifically propose a new algorithm that improves the accuracy of cardinality estimation by grouping counters, and by using their new organization to further track all updates within a given counter size range (compared with just the last update as in the standard HLL). Results show that when using the same number of counters, one configuration of the new scheme reduces the relative error by approximately 0.86x using the same amount of memory as the streaming HLL and another configuration achieves a similar accuracy reducing the memory needed by approximately 0.85x. © 2019 IEEE.
引用
收藏
页码:75 / 79
相关论文
共 50 条
  • [41] Just how accurate are performance counters?
    Korn, W
    Teller, PJ
    Castillo, G
    CONFERENCE PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, 2001, : 303 - 310
  • [42] HOW ACCURATE ARE YOUR YARDAGE COUNTERS
    WILKINSON, V
    AMERICAN DYESTUFF REPORTER, 1976, 65 (03): : 48 - &
  • [43] SLD Revolution: A Cheaper, Faster yet More Accurate Streaming Linked Data Framework
    Balduini, Marco
    Della Valle, Emanuele
    Tommasini, Riccardo
    SEMANTIC WEB: ESWC 2017 SATELLITE EVENTS, 2017, 10577 : 263 - 279
  • [44] Viscoelastic material models for more accurate polyethylene wear estimation
    Alotta, Gioacchino
    Barrera, Olga
    Pegg, Elise C.
    JOURNAL OF STRAIN ANALYSIS FOR ENGINEERING DESIGN, 2018, 53 (05): : 302 - 312
  • [45] Shock therapies - A method of more accurate estimation of their therapeutic efficacy
    Alexander, GH
    JOURNAL OF NERVOUS AND MENTAL DISEASE, 1944, 99 : 922 - 924
  • [46] More accurate heatmap generation method for human pose estimation
    Qi, Yongfeng
    Zhang, Hengrui
    Liu, Jia
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [47] More Accurate Estimation of Working Set Size in Virtual Machines
    Harby, Ahmed A.
    Fahmy, Sherif F.
    Amin, Ahmed F.
    IEEE ACCESS, 2019, 7 : 94039 - 94047
  • [48] Bicriteria streaming algorithms to balance gain and cost with cardinality constraint
    Wang, Yijing
    Xu, Dachuan
    Du, Donglei
    Jiang, Yanjun
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2022, 44 (04) : 2946 - 2962
  • [49] A contribution for a more accurate estimation of the incidence of Kaposi sarcoma in Mozambique
    Carrilho, Carla
    Ferro, Josefo
    Lorenzoni, Cesaltina
    Sultane, Thebora
    Silva-Matos, Carla
    Lunet, Nuno
    INTERNATIONAL JOURNAL OF CANCER, 2013, 132 (04) : 988 - 989
  • [50] Bicriteria streaming algorithms to balance gain and cost with cardinality constraint
    Yijing Wang
    Dachuan Xu
    Donglei Du
    Yanjun Jiang
    Journal of Combinatorial Optimization, 2022, 44 : 2946 - 2962