Comparison of LSM indexing techniques for storing spatial data

被引:1
|
作者
Mao, Qizhong [1 ]
Qader, Mohiuddin Abdul [1 ]
Hristidis, Vagelis [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Spatial index; LSM; Merge policy; Stack-based; Leveled; R-tree; Partition;
D O I
10.1186/s40537-023-00734-3
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the pre-big data era, many traditional databases supported spatial queries via spatial indexes. However, modern applications are seeing a rapid increase of the volume and ingestion rate of spatial data. Log-structured Merge (LSM) tree is used by many big data systems as their storage structure in order to support write-intensive large-volume workloads, which are usually only optimized for single-dimensional data. Research has studied how spatial indexes can be supported on LSM systems, but focused mainly on the local index organization, that is, how data is organized inside a single LSM component. This paper studies various aspects of LSM spatial indexing, including spatial merge policies, which determine when and how spatial components are merged. Three stack-based and one leveled merge policies have been studied, which have been implemented on a common big data system Apache AsterixDB. The write and read performance on various workloads is evaluated, and our findings and recommendations are discussed. A key finding is that Leveled policies underperform other stack-based merge policies for most types of spatial workloads.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Comparison of LSM indexing techniques for storing spatial data
    Qizhong Mao
    Mohiuddin Abdul Qader
    Vagelis Hristidis
    Journal of Big Data, 10
  • [2] SYSTEM DESIGN FOR STORING AND INDEXING SPATIAL DATA IN DATABASE
    Helvaci, Cuneyd
    Gumusay, M. Umit
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2010, 28 (04): : 357 - 368
  • [3] Comprehensive Comparison of LSM Architectures for Spatial Data
    Mao, Qizhong
    Qader, Mohiuddin Abdul
    Hristidis, Vagelis
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 455 - 460
  • [4] Storing and Indexing Spatial Data in P2P Systems
    Kantere, Verena
    Skiadopoulos, Spiros
    Sellis, Timos
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (02) : 287 - 300
  • [5] Hierarchical Quadrant Spatial LSM Tree for Indexing Blockchain-Based Geospatial Point Data
    Lee, Junghyun
    Kwon, Taehyeon
    Seo, Minjun
    Jung, Sungwon
    IEEE ACCESS, 2023, 11 : 118088 - 118104
  • [6] A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases
    Qader, Mohiuddin Abdul
    Cheng, Shiwen
    Hristidis, Vagelis
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 551 - 566
  • [7] Storing and Indexing RDF Data in a Column-Oriented DBMS
    Wang, Xin
    Wang, Shuyi
    Du, Pufeng
    Feng, Zhiyong
    2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [8] HMIBase: An Hierarchical Indexing System for Storing and Querying Big Data
    Shengmei Luo
    Di Zhao
    Wei Ge
    Rong Gu
    Chunfeng Yuan
    Yihua Huang
    ZTE Communications, 2014, 12 (04) : 8 - 15
  • [9] ON INDEXING SPATIAL AND TEMPORAL DATA
    SALZBERG, B
    INFORMATION SYSTEMS, 1994, 19 (06) : 447 - 465
  • [10] Techniques for storing XML data in relations
    Saliu, MO
    Shafique, M
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 228 - 233