Multi-resolution bitmap indexes for scientific data

被引:42
|
作者
Sinha, Rishi Rakesh [1 ]
Winslett, Marianne [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2007年 / 32卷 / 03期
关键词
performance; algorithm; query processing; bitmap index; scientific data management; parallel index;
D O I
10.1145/1272743.1272746
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unique characteristics of scientific data and queries cause traditional indexing techniques to perform poorly on scientific workloads, occupy excessive space, or both. Refinements of bitmap indexes have been proposed previously as a solution to this problem. In this article, we describe the difficulties we encountered in deploying bitmap indexes with scientific data and queries from two real-world domains. In particular, previously proposed methods of binning, encoding, and compressing bitmap vectors either were quite slow for processing the large-range query conditions our scientists used, or required excessive storage space. Nor could the indexes easily be built or used on parallel platforms. In this article, we show how to solve these problems through the use of multi-resolution, parallelizable bitmap indexes, which support a fine-grained trade-off between storage requirements and query performance. Our experiments with large data sets from two scientific domains show that multi-resolution, parallelizable bitmap indexes occupy an acceptable amount of storage while improving range query performance by roughly a factor of 10, compared to a single-resolution bitmap index of reasonable size.
引用
收藏
页数:39
相关论文
共 50 条
  • [41] Compression of Multi-resolution Terrain Data based on Binary Tree
    Bai Jianjun
    Yan Chaode
    ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 2628 - 2634
  • [42] Static and incremental dynamic approaches for multi-objective bitmap join indexes selection in data warehouses
    Lyazid Toumi
    Ahmet Ugur
    The Journal of Supercomputing, 2021, 77 : 3933 - 3958
  • [43] Static and incremental dynamic approaches for multi-objective bitmap join indexes selection in data warehouses
    Toumi, Lyazid
    Ugur, Ahmet
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (04): : 3933 - 3958
  • [44] Generation of synthetic multi-resolution time series load data
    Pinceti, Andrea
    Sankar, Lalitha
    Kosut, Oliver
    IET SMART GRID, 2023, 6 (05) : 492 - 502
  • [45] Scalable Multi-resolution Spatial Visualization for Anthropogenic Litter Data
    Kang, Yunfan
    Zhao, Ziang
    Magdy, Amr
    Cowger, Win Colton
    Gray, Andrew Bruce
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 560 - 563
  • [46] HiMuV: Hierarchical Framework for Modeling Multi-Modality Multi-Resolution Data
    Li, Jianbo
    He, Jingrui
    Zhu, Yada
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 267 - 276
  • [47] MULTI-RESOLUTION STATISTICAL ANALYSIS ON GRAPH STRUCTURED DATA IN NEUROIMAGING
    Kim, Won Hwa
    Singh, Vikas
    Chung, Moo K.
    Adluru, Nagesh
    Bendlin, Barbara B.
    Johnson, Sterling C.
    2015 IEEE 12TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2015, : 1548 - 1551
  • [48] Spatiotemporal multi-resolution approximations for analyzing global environmental data
    Appel, Marius
    Pebesma, Edzer
    SPATIAL STATISTICS, 2020, 38
  • [49] Hierarchical Key Management for Multi-resolution Load Data Representation
    Peer, Christian D.
    Engel, Dominik
    Wicker, Stephen B.
    2014 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2014, : 926 - 932
  • [50] The benefits of visual multi-source, multi-resolution data analysis and fusion
    Powell, AM
    Zuzolo, PA
    16TH INTERNATIONAL CONFERENCE ON INTERACTIVE INFORMATION AND PROCESSING SYSTEMS (IIPS) FOR METEOROLOGY, OCEANOGRAPHY AND HYDROLOGY, 2000, : 132 - 135