Interval Indexing and Querying on Key-Value Cloud Stores

被引:0
|
作者
Sfakianakis, George [1 ]
Patlakas, Ioannis [1 ]
Ntarmos, Nikos [1 ]
Triantafillou, Peter [1 ]
机构
[1] Univ Patras, Comp Engn & Informat Dept, Rion 26500, Greece
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud key-value stores are becoming increasingly more important. Challenging applications, requiring efficient and scalable access to massive data, arise every day. We focus on supporting interval queries (which are prevalent in several data intensive applications, such as temporal querying for temporal analytics), an efficient solution for which is lacking. We contribute a compound interval index structure, comprised of two tiers: (i) the MRSegmentTree (MRST), a key-value representation of the Segment Tree, and (ii) the Endpoints Index (EPI), a column family index that stores information for interval endpoints. In addition to the above, our contributions include: (i) algorithms for efficiently constructing and populating our indices using MapReduce jobs, (ii) techniques for efficient and scalable index maintenance, and (iii) algorithms for processing interval queries. We have implemented all algorithms using HBase and Hadoop, and conducted a detailed performance evaluation. We quantify the costs associated with the construction of the indices, and evaluate our query processing algorithms using queries on real data sets. We compare the performance of our approach to two alternatives: the native support for interval queries provided in HBase, and the execution of such queries using the Hive query execution tool. Our results show a significant speedup, far outperforming the state of the art.
引用
收藏
页码:805 / 816
页数:12
相关论文
共 50 条
  • [31] Taming Tail Latency in Key-Value Stores: A Scheduling Perspective
    Ben Mokhtar, Sonia
    Canon, Louis-Claude
    Dugois, Anthony
    Marchal, Loris
    Riviere, Etienne
    EURO-PAR 2021: PARALLEL PROCESSING, 2021, 12820 : 136 - 150
  • [32] Totally Ordered Replication for Massive Scale Key-Value Stores
    Ribeiro, Jose
    Machado, Nuno
    Maia, Francisco
    Matos, Miguel
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS (DAIS 2018), 2018, 10853 : 58 - 74
  • [33] Enabling Encrypted Rich Queries in Distributed Key-Value Stores
    Guo, Yu
    Yuan, Xingliang
    Wang, Xinyu
    Wang, Cong
    Li, Baochun
    Jia, Xiaohua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (06) : 1283 - 1297
  • [34] Rethink the Linearizability Constraints of Raft for Distributed Key-Value Stores
    Wang, Yangyang
    Wang, Zikai
    Chai, Yunpeng
    Wang, Xin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1877 - 1882
  • [35] KVSwitch: An In-network Load Balancer for Key-Value Stores
    Shi, Yang
    Fei, Jiawei
    Wen, Mei
    Zhang, Chunyuan
    2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 657 - 663
  • [36] Ontology-Mediated Query Answering for Key-Value Stores
    Bienvenu, Meghyn
    Bourhis, Pierre
    Mugnier, Marie-Laure
    Tison, Sophie
    Ulliana, Federico
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 844 - 851
  • [37] GeoWave: Utilizing Distributed Key-Value Stores for Multidimensional Data
    Whitby, Michael A.
    Fecher, Rich
    Bennight, Chris
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, SSTD 2017, 2017, 10411 : 105 - 122
  • [38] SplinterDB: Closing the Bandwidth Gap for NVMe Key-Value Stores
    Conway, Alex
    Gupta, Abhishek
    Chidambaran, Vijay
    Farach-Colton, Martin
    Spillane, Rick
    Tai, Amy
    Johnson, Rob
    PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 49 - 63
  • [39] Accordion: Better Memory Organization for LSM Key-Value Stores
    Bortnikov, Edward
    Braginsky, Anastasia
    Hillel, Eshcar
    Keidar, Idit
    Sheffi, Gali
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 1863 - 1875
  • [40] Brief Announcement: Robust Data Sharing with Key-Value Stores
    Basescu, Cristina
    Cachin, Christian
    Eyal, Ittay
    Haas, Robert
    Vukolic, Marko
    PODC 11: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM PRINCIPLES OF DISTRIBUTED COMPUTING, 2011, : 221 - 222