Breaking the curse of cardinality on bitmap indexes

被引:0
|
作者
Wu, Kesheng [1 ]
Stockinger, Kurt [1 ]
Shoshani, Arie [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
来源
SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS | 2008年 / 5069卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bitmap indexes are known to be efficient for ad-hoe range queries that are common in data warehousing and scientific applications. However, they suffer from the curse of cardinality, that is, their efficiency deteriorates as attribute cardinalities increase. A number of strategies have been proposed, but none of them addresses the problem adequately. In this paper, we propose a novel binned bitmap index that greatly reduces the cost to answer queries, and therefore breaks the curse of cardinality. The key idea is to augment the binned index with an Order-preserving Bin-based Clustering (OrBiC) structure. This data structure significantly reduces the I/O operations needed to resolve records that can not be resolved with the bitmaps. To further improve the proposed index structure, we also present a strategy to create single-valued bins for frequent values. This strategy reduces index sizes and improves query processing speed. Overall, the binned indexes with OrBiC great improves the query processing speed, and are 3 - 25 times faster than the best available indexes for high-cardinality data.
引用
收藏
页码:348 / 365
页数:18
相关论文
共 50 条
  • [21] Breaking the curse of dimensionality in nonparametric testing
    Lavergne, Pascal
    Patilea, Valentin
    JOURNAL OF ECONOMETRICS, 2008, 143 (01) : 103 - 122
  • [22] Breaking the Winner's Curse in Outsourcing
    Jiang, Bin
    Talluri, Srinivas
    Yao, Tao
    Moon, Yongma
    DECISION SCIENCES, 2010, 41 (03) : 573 - 594
  • [23] Curse of Dimensionality in Pivot-based Indexes
    Volnyansky, Ilya
    Pestov, Vladimir
    SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 39 - 46
  • [24] Real-time creation of bitmap indexes on streaming network data
    Francesco Fusco
    Michail Vlachos
    Marc Ph. Stoecklin
    The VLDB Journal, 2012, 21 : 287 - 307
  • [25] A linear programming approach for bitmap join indexes selection in data warehouses
    Toumi, Lyazid
    Moussaoui, Abdelouahab
    Ugur, Ahmet
    6TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2015), THE 5TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2015), 2015, 52 : 161 - 169
  • [26] Real-time creation of bitmap indexes on streaming network data
    Fusco, Francesco
    Vlachos, Michail
    Stoecklin, Marc Ph
    VLDB JOURNAL, 2012, 21 (03): : 287 - 307
  • [27] Generalized bitmap indexes for multi-way equijoin query processing
    Scott, K
    Perrizo, W
    Zou, QH
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 542 - 547
  • [28] Breaking Prometheus's curse for cartilage regeneration
    Florence Apparailly
    Nature Reviews Rheumatology, 2017, 13 : 516 - 518
  • [29] Breaking the Curse of Dimensionality with Convex Neural Networks
    Bach, Francis
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18