Breaking the curse of cardinality on bitmap indexes

被引:0
|
作者
Wu, Kesheng [1 ]
Stockinger, Kurt [1 ]
Shoshani, Arie [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bitmap indexes are known to be efficient for ad-hoe range queries that are common in data warehousing and scientific applications. However, they suffer from the curse of cardinality, that is, their efficiency deteriorates as attribute cardinalities increase. A number of strategies have been proposed, but none of them addresses the problem adequately. In this paper, we propose a novel binned bitmap index that greatly reduces the cost to answer queries, and therefore breaks the curse of cardinality. The key idea is to augment the binned index with an Order-preserving Bin-based Clustering (OrBiC) structure. This data structure significantly reduces the I/O operations needed to resolve records that can not be resolved with the bitmaps. To further improve the proposed index structure, we also present a strategy to create single-valued bins for frequent values. This strategy reduces index sizes and improves query processing speed. Overall, the binned indexes with OrBiC great improves the query processing speed, and are 3 - 25 times faster than the best available indexes for high-cardinality data.
引用
收藏
页码:348 / 365
页数:18
相关论文
共 50 条
  • [1] Immune Algorithm for Bitmap Join Indexes
    Gacem, Amina
    Boukhalfa, Kamel
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 560 - 567
  • [2] Tree based indexes versus bitmap indexes:: A performance study
    Jürgerns, M
    Lenz, HJ
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2001, 10 (03) : 355 - 376
  • [3] Virtual self-adaptive bitmap for online cardinality estimation
    Lu, Jie
    Chen, Hongchang
    Zhang, Jianpeng
    Hu, Tao
    Sun, Penghao
    Zhang, Zhen
    INFORMATION SYSTEMS, 2023, 114
  • [4] Compressed bitmap indexes: beyond unions and intersections
    Kaser, Owen
    Lemire, Daniel
    SOFTWARE-PRACTICE & EXPERIENCE, 2016, 46 (02): : 167 - 198
  • [5] Compressing bitmap indexes for faster search operations
    Wu, KS
    Otoo, EJ
    Shoshani, A
    14TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2002, : 99 - 108
  • [6] CODIS: A New Compression Scheme for Bitmap Indexes
    Zheng, Wenxun
    Liu, Yin
    Chen, Zhen
    Cao, Junwei
    2017 ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS), 2017, : 103 - 104
  • [7] Breaking the curse of dimensionality
    Weimar, Markus
    DISSERTATIONES MATHEMATICAE, 2015, (505) : 5 - 112
  • [8] THE BREAKING OF A MATHEMATICAL CURSE
    CIPRA, BA
    SCIENCE, 1991, 251 (4990) : 165 - 165
  • [9] Managing PMU Data Sets with Bitmap Indexes
    McCamish, Ben
    Chiu, David
    Histand, Miles
    Landford, Jordan
    Bass, Robert B.
    Meier, Rich
    Cotilla-Sanchez, Eduardo
    2014 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH), 2014,
  • [10] B*TREE VS. BITMAP INDEXES AND HOW TO AVOID SUPPRESSING INDEXES
    Vigariu, Mihai Daniel
    INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2013, : 361 - 366