A Performance Optimization Method for Key-Value Store Based on LSM-tree

被引:0
|
作者
Wang H. [1 ,2 ]
Li Z. [1 ,2 ]
Zhang X. [1 ,2 ]
Zhao X. [1 ,2 ]
机构
[1] School of Computer Science, Northwestern Polytechnical University, Xi'an
[2] Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an
基金
中国国家自然科学基金;
关键词
File system; Key-value store; LSM-tree; Write performance; Write-ahead log (WAL);
D O I
10.7544/issn1000-1239.2019.20190110
中图分类号
学科分类号
摘要
Nowadays, persistent key-value (KV) stores play a critical role in a variety of modern data-intensive applications, such as Web indexing, e-commerce, and cloud data storage systems, etc. KV stores that are based on log-structured merge tree (LSM-tree) have attracted growing attention because of their ability to eliminate random writes and maintain acceptable read performance. However, they also suffer from some performance issues. On one hand, they need to leverage write-ahead log (WAL) files to guarantee the atomicity and safety of write operations to enable recovery in case of a crash. This will result in severe write amplification and metadata overhead because of frequent WAL file update, leading to performance degradation. On the other hand, these KV stores usually use a conventional local filesystem to store KV data, which can harm the performance due to unnecessary operations in the filesystem. In this paper, we present RocksFS, an optimized filesystem for KV stores based on LSM-tree. We simplify the filesystem to remove unnecessary functions and attributes to reduce filesystem overhead and redesign the format and I/O path of WAL file to decrease metadata overhead. We compare RocksFS with conventional filesystems in the environment of RocksDB, a popular LSM-tree-based KV store. The experimental results demonstrate that RocksFS can observably improve the small key-value data write performance of RocksDB by 8x at most compared with traditional filesystems on both hard disk drive and solid state disk. © 2019, Science Press. All right reserved.
引用
收藏
页码:1792 / 1802
页数:10
相关论文
共 20 条
  • [1] Chen M., Mao S., Liu Y., Big data: A survey, Mobile Networks and Applications, 19, 2, pp. 171-209, (2014)
  • [2] Hashem I.A.T., Yaqoob I., Anuar N.B., Et al., The rise of "big data" on cloud computing: Review and open research issues, Information Systems, 47, pp. 98-115, (2015)
  • [3] Leavitt N., Will NoSQL databases live up to their promise?, Computer, 43, 2, pp. 12-14, (2010)
  • [4] Stonebraker M., SQL databases v. NoSQL databases, Communications of the ACM, 53, 4, pp. 10-11, (2010)
  • [5] O'Neil P., Cheng E., Gawlick D., Et al., The log-structured merge-tree (LSM-tree), Acta Informatica, 33, 4, pp. 351-385, (1996)
  • [6] Chang F., Dean J., Ghemawat S., Et al., Bigtable: A distributed storage system for structured data, ACM Transactions on Computer Systems, 26, 2, (2008)
  • [7] Lakshman A., Malik P., Cassandra: A decentralized structured storage system, ACM SIGOPS Operating Systems Review, 44, 2, pp. 35-40, (2010)
  • [8] Apache Hbase
  • [9] Ghemawat S., Dean J., Google LevelDB
  • [10] Facebook RocksDB, v4.3.0