WIPE: A Write-Optimized Learned Index for Persistent Memory

被引:2
|
作者
Wang, Zhonghua [1 ]
Ding, Chen [1 ]
Song, Fengguang [2 ]
Lu, Kai [1 ]
Wan, Jiguang [1 ]
Tan, Zhihu [1 ]
Xie, Changsheng [1 ]
Li, Guokuan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Hubei, Peoples R China
[2] Indiana Univ Purdue Univ Indianapolis, Indianapolis, IN 46202 USA
基金
中国国家自然科学基金;
关键词
NVM-based learned index; learned index; persistent index; key-value store; RANDOM-ACCESS MEMORY; HIGH-PERFORMANCE; CHALLENGES; TREES;
D O I
10.1145/3634915
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learned Index, which utilizes effective machine learning models to accelerate locating sorted data positions, has gained increasing attention in many big data scenarios. Using efficient learned models, the learned indexes build large nodes and flat structures, thereby greatly improving the performance. However, most of the state-of-the-art learned indexes are designed for DRAM, and there is hence an urgent need to enable high-performance learned indexes for emerging Non-Volatile Memory (NVM). In this article, we first evaluate and analyze the performance of the existing learned indexes on NVM. We discover that these learned indexes encounter severe write amplification and write performance degradation due to the requirements of maintaining large sorted/semi-sorted data nodes. To tackle the problems, we propose a novel three-tiered architecture of write-optimized persistent learned index, which is named WIPE, by adopting unsorted fine-granularity data nodes to achieve high write performance on NVM. There into, we devise a new root node construction algorithm to accelerate searching numerous small data nodes. The algorithm ensures stable flat structure and high read performance in large-size datasets by introducing an intermediate layer (i.e., index nodes) and achieving accurate prediction of index node positions from the root node. Our extensive experiments on Intel DCPMM show that WIPE can improve write throughput and read throughput by up to 3.9x and 7x, respectively, compared to the state-of-the-art learned indexes. Also, WIPE can recover from a system crash in similar to 18ms. WIPE is free as an open-source software package.(1)
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Write-Optimized Dynamic Hashing for Persistent Memory
    Nam, Moohyeon
    Cha, Hokeun
    Choi, Young-ri
    Noh, Sam H.
    Nam, Beomseok
    PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 31 - 44
  • [2] Write-Optimized B+ Tree Index Technology for Persistent Memory
    Ma, Rui-Xiang
    Wu, Fei
    Dong, Bu-Rong
    Zhang, Meng
    Li, Wei-Jun
    Xie, Chang-Sheng
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 36 (05) : 1037 - 1050
  • [3] Write-Optimized B+ Tree Index Technology for Persistent Memory
    Rui-Xiang Ma
    Fei Wu
    Bu-Rong Dong
    Meng Zhang
    Wei-Jun Li
    Chang-Sheng Xie
    Journal of Computer Science and Technology, 2021, 36 : 1037 - 1050
  • [4] Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory
    Zuo, Pengfei
    Hua, Yu
    Wu, Jie
    PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2018, : 461 - 476
  • [5] PMEH: A Parallel and Write-Optimized Extendible Hashing for Persistent Memory
    Hu, Jing
    Chen, Jianxi
    Zhu, Yifeng
    Yang, Qing
    Peng, Zhouxuan
    Yu, Ya
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (11) : 3801 - 3814
  • [6] A highly write-optimized concurrent B plus -tree for persistent memory
    Yan, Wei
    Zhang, Xingjun
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 219 - 230
  • [7] ReoFS: A Read-Efficient and Write-Optimized File System for Persistent Memory
    Yan, Yan
    Huang, Kaixin
    Zheng, Shengan
    Xue, Dongliang
    Huang, Linpeng
    2020 25TH INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS (ICECCS 2020), 2020, : 177 - 186
  • [8] TLBtree: A Read/Write-Optimized Tree Index for Non-Volatile Memory
    Luo, Yongping
    Jin, Peiquan
    Zhang, Qinglin
    Cheng, Bin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1889 - 1894
  • [9] Zallocator: A High Throughput Write-Optimized Persistent Allocator for Non-Volatile Memory
    Wu, You
    Li, Lin
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2022, 18 (04)
  • [10] Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory
    Wang, Qing
    Lu, Youyou
    Shu, Jiwu
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1033 - 1048