Multi-resolution bitmap indexes for scientific data

被引:42
|
作者
Sinha, Rishi Rakesh [1 ]
Winslett, Marianne [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2007年 / 32卷 / 03期
关键词
performance; algorithm; query processing; bitmap index; scientific data management; parallel index;
D O I
10.1145/1272743.1272746
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unique characteristics of scientific data and queries cause traditional indexing techniques to perform poorly on scientific workloads, occupy excessive space, or both. Refinements of bitmap indexes have been proposed previously as a solution to this problem. In this article, we describe the difficulties we encountered in deploying bitmap indexes with scientific data and queries from two real-world domains. In particular, previously proposed methods of binning, encoding, and compressing bitmap vectors either were quite slow for processing the large-range query conditions our scientists used, or required excessive storage space. Nor could the indexes easily be built or used on parallel platforms. In this article, we show how to solve these problems through the use of multi-resolution, parallelizable bitmap indexes, which support a fine-grained trade-off between storage requirements and query performance. Our experiments with large data sets from two scientific domains show that multi-resolution, parallelizable bitmap indexes occupy an acceptable amount of storage while improving range query performance by roughly a factor of 10, compared to a single-resolution bitmap index of reasonable size.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] Using Multi-Resolution Data to Accelerate Neural Network Training in Scientific Applications
    Wang, Kewei
    Lee, Sunwoo
    Balewski, Jan
    Sim, Alex
    Nugent, Peter
    Agrawal, Ankit
    Choudhary, Alok
    Wu, Kesheng
    Liao, Wei-Keng
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 404 - 413
  • [2] Dynamic Multi-Resolution Data Storage
    Hu, Yu-Ching
    Lokhandwala, Murtuza Taher
    Te, I
    Tseng, Hung-Wei
    MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 196 - 210
  • [3] Multi-Resolution Reversible Data Hiding
    Wen, Jingyang
    Wan, Yi
    2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 340 - 345
  • [4] Multi-resolution indexing for XML data
    Maghamez, A
    Hu, GZ
    Third ACIS International Conference on Software Engineering Research, Managment and Applications, Proceedings, 2005, : 206 - 211
  • [5] Parallel membership queries on very large scientific data sets using bitmap indexes
    Yildiz, Beytullah
    Wu, Kesheng
    Byna, Suren
    Shoshani, Arie
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (15):
  • [6] Multi-Resolution Data Fusion for Super Resolution Imaging
    Reid, Emma J.
    Drummy, Lawrence F.
    Bouman, Charles A.
    Buzzard, Gregery T.
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2022, 8 : 81 - 95
  • [7] Managing PMU Data Sets with Bitmap Indexes
    McCamish, Ben
    Chiu, David
    Histand, Miles
    Landford, Jordan
    Bass, Robert B.
    Meier, Rich
    Cotilla-Sanchez, Eduardo
    2014 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH), 2014,
  • [8] Multi-Resolution Weak Supervision for Sequential Data
    Sala, Frederic
    Varma, Paroma
    Fries, Jason
    Fu, Daniel Y.
    Sagawa, Shiori
    Khattar, Saelig
    Ramamoorthy, Ashwini
    Xiao, Ke
    Fatahalian, Kayvon
    Priest, James
    Re, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] Automatic selection of bitmap join indexes in data warehouses
    Aouiche, K
    Darmont, J
    Boussaïd, O
    Bentayeb, F
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 64 - 73
  • [10] Specification of multi-resolution modeling space for multi-resolution system simulation
    Hong, Su-Youn
    Kim, Tag Gon
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2013, 89 (01): : 28 - 40