Parallax: Hybrid Key-Value Placement in LSM-based Key-Value Stores

被引:11
|
作者
Xanthakis, Giorgos [1 ,2 ]
Saloustros, Giorgos [1 ]
Batsaras, Nikos [1 ,2 ]
Papagiannis, Anastasios [1 ,2 ,3 ]
Bilas, Angelos [1 ,2 ]
机构
[1] Fdn Res & Technol Hellas FORTH, Inst Comp Sci ICS, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Rethimnon, Greece
[3] Facebook, London, England
基金
欧盟地平线“2020”;
关键词
D O I
10.1145/3472883.3487012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Key-value (KV) separation is a technique that introduces randomness in the I/O access patterns to reduce I/O amplification in LSM-based key-value stores. KV separation has a significant drawback that makes it less attractive: Delete and update operations in modern workloads result in frequent and expensive garbage collection (GC) in the value log. In this paper, we design and implement Parallax, which proposes hybrid KV placement to reduce GC overhead significantly and increases the benefits of using a log. We first model the benefits of KV separation for different KV pair sizes. We use this model to classify KV pairs in three categories small, medium, and large. Then, Parallax uses different approaches for each KV category: It always places large values in a log and small values in place. For medium values it uses a mixed strategy that combines the benefits of using a log and eliminates GC overhead as follows: It places medium values in a log for all but the last few (typically one or two) levels in the LSM structure, where it performs a full compaction, merges values in place, and reclaims log space without the need for GC. We evaluate Parallax against RocksDB that places all values in place and BlobDB that always performs KV separation. We find that Parallax increases throughput by up to 12.4x and 17.83x, decreases I/O amplification by up to 27.1x and 26x, and increases CPU efficiency by up to 18.7x and 28x, respectively, for all but scan-based YCSB workloads.
引用
收藏
页码:305 / 318
页数:14
相关论文
共 50 条
  • [1] AC-Key: Adaptive Caching for LSM-based Key-Value Stores
    Wu, Fenggang
    Yang, Ming-Hong
    Zhang, Baoquan
    Du, David H. C.
    PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 603 - 615
  • [2] SineKV: Decoupled Secondary Indexing for LSM-based Key-Value Stores
    Li, Fei
    Lu, Youyou
    Yang, Zhe
    Shu, Jiwu
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1112 - 1122
  • [3] SPEICHER: Securing LSM-based Key-Value Stores using Shielded Execution
    Bailleu, Maurice
    Thalheim, Jorg
    Bhatotia, Pramod
    Fetzer, Christof
    Honda, Michio
    Vaswani, Kapil
    PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 173 - 190
  • [4] Accelerating Native Transaction Processing in LSM-Based Persistent Key-Value Stores
    Xue, Jin
    Shao, Zili
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 1156 - 1158
  • [5] Can Modern LLMs Tune and Configure LSM-based Key-Value Stores?
    Thakkar, Viraj
    Sukumar, Madhumitha
    Dai, Jiaxin
    Singh, Kaushiki
    Cao, Zhichao
    PROCEEDINGS OF THE 2024 16TH ACM WORKSHOP ON HOT TOPICS IN STORAGE AND FILE SYSTEMS, HOTSTORAGE 2024, 2024, : 116 - 123
  • [6] Index Shipping for Efficient Replication in LSM Key-Value Stores with Hybrid KV Placement
    Stylianakis, Giorgos
    Saloustros, Giorgos
    Chiotakis, Orestis
    Xanthakis, Giorgos
    Forth, Angelos Bilas
    ACM TRANSACTIONS ON STORAGE, 2024, 20 (03)
  • [7] FPGA-Accelerated Compactions for LSM-based Key-Value Store
    Zhang, Teng
    Wang, Jianying
    Cheng, Xuntao
    Xu, Hao
    Yu, Nanlong
    Huang, Gui
    Zhang, Tieying
    He, Dengcheng
    Li, Feifei
    Cao, Wei
    Huang, Zhongdong
    Sun, Jianling
    PROCEEDINGS OF THE 18TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2020, : 225 - 237
  • [8] Optimization of LSM-Tree for Key-Value Stores
    Wu S.
    Xie J.
    Wang Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (11): : 2432 - 2441
  • [9] Studies on LSM-tree Based NoSQL Key-Value Stores
    Xu, Ming
    Xu, Xianbin
    Zheng, Fang
    Yin, Mengjia
    2016 3RD INTERNATIONAL CONFERENCE ON ECONOMIC, BUSINESS MANAGEMENT AND EDUCATIONAL INNOVATION (EBMEI 2016), PT 2, 2016, 55 : 726 - 730
  • [10] HiLSM: An LSM-Based Key-Value Store for Hybrid NVM-SSD Storage Systems
    Li, Wenjie
    Jiang, Dejun
    Xiong, Jin
    Bao, Yungang
    17TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2020 (CF 2020), 2020, : 208 - 216