Breaking the curse of cardinality on bitmap indexes

被引:0
|
作者
Wu, Kesheng [1 ]
Stockinger, Kurt [1 ]
Shoshani, Arie [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
来源
SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS | 2008年 / 5069卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bitmap indexes are known to be efficient for ad-hoe range queries that are common in data warehousing and scientific applications. However, they suffer from the curse of cardinality, that is, their efficiency deteriorates as attribute cardinalities increase. A number of strategies have been proposed, but none of them addresses the problem adequately. In this paper, we propose a novel binned bitmap index that greatly reduces the cost to answer queries, and therefore breaks the curse of cardinality. The key idea is to augment the binned index with an Order-preserving Bin-based Clustering (OrBiC) structure. This data structure significantly reduces the I/O operations needed to resolve records that can not be resolved with the bitmaps. To further improve the proposed index structure, we also present a strategy to create single-valued bins for frequent values. This strategy reduces index sizes and improves query processing speed. Overall, the binned indexes with OrBiC great improves the query processing speed, and are 3 - 25 times faster than the best available indexes for high-cardinality data.
引用
收藏
页码:348 / 365
页数:18
相关论文
共 50 条
  • [31] Breaking a curse - Islam between civilization and barbarism
    Couland, Jacques
    PENSEE, 2009, (357): : 161 - 161
  • [32] Secondary Indexing in One Dimension: Beyond B-trees and Bitmap Indexes
    Pagh, Rasmus
    Satti, Srinivasa Rao
    PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2009, : 177 - 185
  • [33] Analyses of Multi-Level and Multi-Component Compressed Bitmap Indexes
    Wu, Kesheng
    Shoshani, Arie
    Stockinger, Kurt
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (01):
  • [34] Particle swarm optimization for bitmap join indexes selection problem in data warehouses
    Toumi, Lyazid
    Moussaoui, Abdelouahab
    Ugur, Ahmet
    JOURNAL OF SUPERCOMPUTING, 2014, 68 (02): : 672 - 708
  • [35] Static and incrementale selection of bitmap join indexes: Concepts, algorithms, performance study
    Bouchakri, Rima
    Bellatreche, Ladjel
    JOURNAL OF DECISION SYSTEMS, 2012, 21 (01) : 51 - 70
  • [36] Particle swarm optimization for bitmap join indexes selection problem in data warehouses
    Lyazid Toumi
    Abdelouahab Moussaoui
    Ahmet Ugur
    The Journal of Supercomputing, 2014, 68 : 672 - 708
  • [37] Breaking the resource curse: Heterogeneous effects of digital government
    Xue, Yan
    Chen, Li
    Feng, Zhiying
    Huang, Yunying
    RESOURCES POLICY, 2024, 90
  • [38] Breaking the Curse of Class Imbalance: Bangla Text Classification
    Rafi-Ur-Rashid, Md
    Mahbub, Mahim
    Adnan, Muhammad Abdullah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [39] Taxation, accountability, and cash transfers: Breaking the resource curse
    Devarajan, Shantayanan
    Do, Quy-Toan
    JOURNAL OF PUBLIC ECONOMICS, 2023, 218
  • [40] Breaking the curse of dimensionality for machine learning on genomic data
    O'Brien, A.
    Szul, P.
    Dunne, R.
    Bauer, D. C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 727 - 728