XStore: Fast RDMA-Based Ordered Key-Value Store Using Remote Learned Cache

被引：5

作者：

Wei, Xingda ^{[1
,2
]}

Chen, Rong ^{[1
,2
]}

Chen, Haibo ^{[1
,3
]}

Zang, Binyu ^{[1
,3
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

[3] Minist Educ, Engn Res Ctr Domain Specif Operating Syst, Shanghai, Peoples R China

来源：

ACM TRANSACTIONS ON STORAGE | 2021年 / 17卷 / 03期

基金：

中国国家自然科学基金;

关键词：

RDMA-based key-value store; machine learning model; tree-based index structure; index caching; DISTRIBUTED TRANSACTIONS;

D O I：

10.1145/3468520

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

RDMA(Remote Direct MemoryAccess) has gained considerable interests in network-attached in-memory key-value stores. However, traversing the remote tree-based index in ordered key-value stores with RDMA becomes a critical obstacle, causing an order-of-magnitude slowdown and limited scalability due to multiple round trips. Using index cache with conventional wisdom-caching partial data and traversing them locally-usually leads to limited effect because of unavoidable capacity misses, massive random accesses, and costly cache invalidations. We argue that the machine learning (ML) model is a perfect cache structure for the tree-based index, termed learned cache. Based on it, we design and implement XStore, an RDMA-based ordered key-value store with a new hybrid architecture that retains a tree-based index at the server to perform dynamic workloads (e.g., inserts) and leverages a learned cache at the client to perform static workloads (e.g., gets and scans). The key idea is to decouple ML model retraining from index updating by maintaining a layer of indirection from logical to actual positions of key-value pairs. It allows a stale learned cache to continue predicting a correct position for a lookup key. XStore ensures correctness using a validation mechanism with a fallback path and further uses speculative execution to minimize the cost of cache misses. Evaluations with YCSB benchmarks and production workloads show that a single XStore server can achieve over 80 million read-only requests per second. This number outperforms state-of-the-art RDMA-based ordered key-value stores (namely, DrTMTree, Cell, and eRPC+Masstree) by up to 5.9x (from 3.7x). For workloads with inserts, XStore still provides up to 3.5x (from 2.7x) throughput speedup, achieving 53M reqs/s. The learned cache can also reduce clientside memory usage and further provides an efficient memory-performance tradeoff, e.g., saving 99% memory at the cost of 20% peak throughput.

引用

页数：32

共 50 条

[1] Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache
Wei, Xingda
Chen, Rong
Chen, Haibo
PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), 2020, : 117 - 135
[2] InnerCache: A Tactful Cache Mechanism for RDMA-Based Key-Value Store
Yang, Min
Yu, Songping
Yu, Rujie
Xiao, Nong
Liu, Fang
Chen, Wei
2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 646 - 649
[3] AStore: Uniformed Adaptive Learned Index and Cache for RDMA-Enabled Key-Value Store
Qiao, Pengpeng
Zhang, Zhiwei
Li, Yuntong
Yuan, Ye
Wang, Shuliang
Wang, Guoren
Yu, Jeffrey Xu
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2877 - 2894
[4] Fast In-Memory Key-Value Cache System with RDMA
Ghen, Wei
Yu, Songping
Wang, Zhiying
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (05)
[5] Exploiting Hybrid Index Scheme for RDMA-based Key-Value Stores
Han, Shukai
Zhang, Mi
Jiang, Dejun
Xiong, Jin
PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 49 - 59
[6] Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-based Key-Value Store
Islam, Nusrat Sharmin
Shankar, Dipti
Lu, Xiaoyi
Wasi-ur-Rahman, Md.
Panda, Dhabaleswar K.
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 280 - 289
[7] A Fast Learned Key-Value Store for Concurrent and Distributed Systems
Li, Pengfei
Hua, Yu
Jia, Jingnan
Zuo, Pengfei
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2301 - 2315
[8] ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems
Li, Pengfei
Hua, Yu
Zuo, Pengfei
Chen, Zhangyu
Sheng, Jiajie
PROCEEDINGS OF THE 21ST USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 2023, 2023, : 99 - 113
[9] Nessie: A Decoupled, Client-Driven Key-Value Store Using RDMA
Cassell, Benjamin
Szepesi, Tyler
Wong, Bernard
Brecht, Tim
Ma, Jonathan
Liu, Xiaoyi
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (12) : 3537 - 3552
[10] A High-performance RDMA-oriented Learned Key-value Store for Disaggregated Memory Systems
Li, Pengfei
Hua, Yu
Zuo, Pengfei
Chen, Zhangyu
Sheng, Jiajie
ACM TRANSACTIONS ON STORAGE, 2023, 19 (04)

← 1 2 3 4 5 →