The PH-Tree - A Space-Efficient Storage Structure and Multi-Dimensional Index

被引:28
|
作者
Zaschke, Tilmann [1 ]
Zimmerli, Christoph [1 ]
Norrie, Moira C. [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Inst Informat Syst, Zurich, Switzerland
关键词
Multi-dimensional index; space efficiency; spatial index; patricia-trie; hypercube; quadtree; skewed data; PATRICIA; SEARCH;
D O I
10.1145/2588555.2588564
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose the PATRICIA-hypercube-tree, or PH-tree, a multi-dimensional data storage and indexing structure. It is based on binary PATRICIA-tries combined with hyper-cubes for efficient data access. Space efficiency is achieved by combining prefix sharing with a space optimised implementation. This leads to storage space requirements that are comparable or below storage of the same data in non-index structures such as arrays of objects. The storage structure also serves as a multi-dimensional index on all dimensions of the stored data. This enables efficient access to stored data via point and range queries. We explain the concept of the PH-tree and demonstrate the performance of a sample implementation on various datasets and compare it to other spatial indices such as the kD-tree. The experiments show that for larger datasets beyond 10(7) entries, the PH-tree increasingly and consistently outperforms other structures in terms of space efficiency, query performance and update performance. For some highly skewed datasets, it even shows super-constant performance, becoming faster for larger datasets.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 50 条
  • [41] Data space mapping for efficient I/O in large multi-dimensional databases
    Ferhatosmanoglu, Hakan
    Ramachandran, Aravind
    Agrawal, Divyakant
    El Abbadi, Amr
    INFORMATION SYSTEMS, 2007, 32 (01) : 83 - 103
  • [42] An efficient architecture for multi-dimensional convolution
    Elnaggar, A
    Aboelaze, M
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2000, 47 (12): : 1520 - 1523
  • [43] Space-efficient sparse matrix storage formats for massively parallel systems
    Simecek, I.
    Langr, D.
    Tvrdik, P.
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 54 - 60
  • [44] Space-efficient FTL for Mobile Storage via Tiny Neural Nets
    Marcus, Ron
    Rashelbach, Alon
    Ben-Zur, Ori
    Lifshits, Pavel
    Silberstein, Mark
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2024, 2024, : 146 - 161
  • [45] Compact suffix array -: A space-efficient full-text index
    Mäkinen, V
    FUNDAMENTA INFORMATICAE, 2003, 56 (1-2) : 191 - 210
  • [46] The DCB-Tree: A Space-Efficient Delta Coded Cache Conscious B-Tree
    Binna, Robert
    Pacher, Dominic
    Meindl, Thomas
    Specht, Guenther
    IN MEMORY DATA MANAGEMENT AND ANALYSIS, 2015, 8921 : 126 - 138
  • [47] Secure cloud-of-clouds storage with space-efficient secret sharing
    Niknia, Ahad
    Correia, Miguel
    Karimpour, Jaber
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 59
  • [48] Multi-Dimensional Scheduling in Cloud Storage Systems
    Yao, Zhihao
    Papapanagiotou, Ioannis
    Callaway, Robert D.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 395 - 400
  • [49] A HYBRID INDEX FOR MULTI-DIMENSIONAL QUERY IN HBASE
    Tang, Xiaosheng
    Han, Boda
    Chen, Han
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 332 - 336
  • [50] cBiK: A Space-Efficient Data Structure for Spatial Keyword Queries
    Sanjuan-Contreras, Carlos E.
    Gutierrez Retamal, Gilberto
    Martinez-Prieto, Miguel A.
    Seco, Diego
    IEEE ACCESS, 2020, 8 (08): : 98827 - 98846