Parallax: Hybrid Key-Value Placement in LSM-based Key-Value Stores

被引:11
|
作者
Xanthakis, Giorgos [1 ,2 ]
Saloustros, Giorgos [1 ]
Batsaras, Nikos [1 ,2 ]
Papagiannis, Anastasios [1 ,2 ,3 ]
Bilas, Angelos [1 ,2 ]
机构
[1] Fdn Res & Technol Hellas FORTH, Inst Comp Sci ICS, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Rethimnon, Greece
[3] Facebook, London, England
基金
欧盟地平线“2020”;
关键词
D O I
10.1145/3472883.3487012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Key-value (KV) separation is a technique that introduces randomness in the I/O access patterns to reduce I/O amplification in LSM-based key-value stores. KV separation has a significant drawback that makes it less attractive: Delete and update operations in modern workloads result in frequent and expensive garbage collection (GC) in the value log. In this paper, we design and implement Parallax, which proposes hybrid KV placement to reduce GC overhead significantly and increases the benefits of using a log. We first model the benefits of KV separation for different KV pair sizes. We use this model to classify KV pairs in three categories small, medium, and large. Then, Parallax uses different approaches for each KV category: It always places large values in a log and small values in place. For medium values it uses a mixed strategy that combines the benefits of using a log and eliminates GC overhead as follows: It places medium values in a log for all but the last few (typically one or two) levels in the LSM structure, where it performs a full compaction, merges values in place, and reclaims log space without the need for GC. We evaluate Parallax against RocksDB that places all values in place and BlobDB that always performs KV separation. We find that Parallax increases throughput by up to 12.4x and 17.83x, decreases I/O amplification by up to 27.1x and 26x, and increases CPU efficiency by up to 18.7x and 28x, respectively, for all but scan-based YCSB workloads.
引用
收藏
页码:305 / 318
页数:14
相关论文
共 50 条
  • [21] Towards Elastic Key-value Stores on IaaS
    Li, Han
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 302 - 305
  • [22] A Design for Scalable and Secure Key-value Stores
    Chen, Longbin
    Dai, Wenyun
    Qiu, Meikang
    Jiang, Ning
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 216 - 221
  • [23] Re-LSM: A ReRAM-based Processing-in-Memory Framework for LSM-based Key-Value Store
    Wei, Qian
    Shen, Zhaoyan
    Tong, Yiheng
    Jia, Zhiping
    Ju, Lei
    Chen, Jiezhi
    Li, Bingzhe
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [24] Robust Data Sharing with Key-Value Stores
    Basescu, Cristina
    Cachin, Christian
    Eyal, Ittay
    Haas, Robert
    Sorniotti, Alessandro
    Vukolic, Marko
    Zachevsky, Ido
    2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [25] Integrity Verification in Cloud Key-Value Stores
    Weintraub, Grisha
    Rise, Leonid
    Kadosh, Alon
    PROCEEDINGS OF THE 15TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS AND STORAGE, SYSTOR 2022, 2022, : 140 - 140
  • [26] A Correlation-Aware Data Placement Strategy for Key-Value Stores
    Vilaca, Ricardo
    Oliveira, Rui
    Pereira, Jose
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, 2011, 6723 : 214 - 227
  • [27] Coordinating Compaction between LSM-tree based Key-Value Stores for Edge Federation
    Kim, Jeeseob
    Yoo, Honghyeon
    Lee, Seungjae
    Byun, Hongsu
    Park, Sungyong
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 419 - 429
  • [28] gLSM: Using GPGPU to Accelerate Compactions in LSM-tree-based Key-value Stores
    Sun, Hui
    Xu, Jinfeng
    Jiang, Xiangxiang
    Chen, Guanzhong
    Yue, Yinliang
    Qin, Xiao
    ACM TRANSACTIONS ON STORAGE, 2024, 20 (01)
  • [29] On the Support of Versioning in Distributed Key-Value Stores
    Felber, Pascal
    Pasin, Marcelo
    Riviere, Etienne
    Schiavoni, Valerio
    Sutra, Pierre
    Coelho, Fabio
    Oliveira, Rui
    Matos, Miguel
    Vilaca, Ricardo
    2014 IEEE 33RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2014, : 95 - 104
  • [30] Elevating Performance of LSM-Tree-Based Key-Value Stores with Gradient Data Hierarchy
    Sun, Hui
    Xu, Jinfeng
    Qin, Xiao
    2023 IEEE 16TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD, 2023, : 360 - 369