XStore: Fast RDMA-Based Ordered Key-Value Store Using Remote Learned Cache

被引:5
|
作者
Wei, Xingda [1 ,2 ]
Chen, Rong [1 ,2 ]
Chen, Haibo [1 ,3 ]
Zang, Binyu [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
[3] Minist Educ, Engn Res Ctr Domain Specif Operating Syst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
RDMA-based key-value store; machine learning model; tree-based index structure; index caching; DISTRIBUTED TRANSACTIONS;
D O I
10.1145/3468520
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
RDMA(Remote Direct MemoryAccess) has gained considerable interests in network-attached in-memory key-value stores. However, traversing the remote tree-based index in ordered key-value stores with RDMA becomes a critical obstacle, causing an order-of-magnitude slowdown and limited scalability due to multiple round trips. Using index cache with conventional wisdom-caching partial data and traversing them locally-usually leads to limited effect because of unavoidable capacity misses, massive random accesses, and costly cache invalidations. We argue that the machine learning (ML) model is a perfect cache structure for the tree-based index, termed learned cache. Based on it, we design and implement XStore, an RDMA-based ordered key-value store with a new hybrid architecture that retains a tree-based index at the server to perform dynamic workloads (e.g., inserts) and leverages a learned cache at the client to perform static workloads (e.g., gets and scans). The key idea is to decouple ML model retraining from index updating by maintaining a layer of indirection from logical to actual positions of key-value pairs. It allows a stale learned cache to continue predicting a correct position for a lookup key. XStore ensures correctness using a validation mechanism with a fallback path and further uses speculative execution to minimize the cost of cache misses. Evaluations with YCSB benchmarks and production workloads show that a single XStore server can achieve over 80 million read-only requests per second. This number outperforms state-of-the-art RDMA-based ordered key-value stores (namely, DrTMTree, Cell, and eRPC+Masstree) by up to 5.9x (from 3.7x). For workloads with inserts, XStore still provides up to 3.5x (from 2.7x) throughput speedup, achieving 53M reqs/s. The learned cache can also reduce clientside memory usage and further provides an efficient memory-performance tradeoff, e.g., saving 99% memory at the cost of 20% peak throughput.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache
    Wei, Xingda
    Chen, Rong
    Chen, Haibo
    PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), 2020, : 117 - 135
  • [2] InnerCache: A Tactful Cache Mechanism for RDMA-Based Key-Value Store
    Yang, Min
    Yu, Songping
    Yu, Rujie
    Xiao, Nong
    Liu, Fang
    Chen, Wei
    2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 646 - 649
  • [3] AStore: Uniformed Adaptive Learned Index and Cache for RDMA-Enabled Key-Value Store
    Qiao, Pengpeng
    Zhang, Zhiwei
    Li, Yuntong
    Yuan, Ye
    Wang, Shuliang
    Wang, Guoren
    Yu, Jeffrey Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2877 - 2894
  • [4] Fast In-Memory Key-Value Cache System with RDMA
    Ghen, Wei
    Yu, Songping
    Wang, Zhiying
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (05)
  • [5] Exploiting Hybrid Index Scheme for RDMA-based Key-Value Stores
    Han, Shukai
    Zhang, Mi
    Jiang, Dejun
    Xiong, Jin
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 49 - 59
  • [6] Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-based Key-Value Store
    Islam, Nusrat Sharmin
    Shankar, Dipti
    Lu, Xiaoyi
    Wasi-ur-Rahman, Md.
    Panda, Dhabaleswar K.
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 280 - 289
  • [7] A Fast Learned Key-Value Store for Concurrent and Distributed Systems
    Li, Pengfei
    Hua, Yu
    Jia, Jingnan
    Zuo, Pengfei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2301 - 2315
  • [8] ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems
    Li, Pengfei
    Hua, Yu
    Zuo, Pengfei
    Chen, Zhangyu
    Sheng, Jiajie
    PROCEEDINGS OF THE 21ST USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 2023, 2023, : 99 - 113
  • [9] Nessie: A Decoupled, Client-Driven Key-Value Store Using RDMA
    Cassell, Benjamin
    Szepesi, Tyler
    Wong, Bernard
    Brecht, Tim
    Ma, Jonathan
    Liu, Xiaoyi
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (12) : 3537 - 3552
  • [10] A High-performance RDMA-oriented Learned Key-value Store for Disaggregated Memory Systems
    Li, Pengfei
    Hua, Yu
    Zuo, Pengfei
    Chen, Zhangyu
    Sheng, Jiajie
    ACM TRANSACTIONS ON STORAGE, 2023, 19 (04)