Survey on Cloud-native Databases

被引:0
|
作者
Dong H.-W. [1 ]
Zhang C. [1 ]
Li G.-L. [1 ]
Feng J.-H. [1 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2024年 / 35卷 / 02期
关键词
cloud-native database; compute-storage disaggregation; database-as-a-service (DBaaS);
D O I
10.13328/j.cnki.jos.006952
中图分类号
学科分类号
摘要
The virtualization, high availability, high scheduling elasticity, and other characteristics of cloud infrastructure provide cloud databases with many advantages, such as the out-of-the-box feature, high reliability and availability, and pay-as-you-go model. Cloud databases can be divided into two categories according to the architecture design: cloud-hosted databases and cloud-native databases. Cloud-hosted databases, deploying the database system in the virtual machine environment on the cloud, offer the advantages of low cost, easy operation and maintenance, and high reliability. Besides, cloud-native databases take full advantage of the characteristic elastic scaling of the cloud infrastructure. The disaggregated compute and storage architecture is adopted to achieve the independent scaling of computing and storage resources and further increase the cost-performance ratio of the databases. However, the disaggregated compute and storage architecture poses new challenges to the design of database systems. This survey is an in-depth analysis of the architecture and technology of the cloud-native database system. Specifically, the architectures of cloud-native online transaction processing (OLTP) and online analytical processing (OLAP) databases are classified and analyzed, respectively, according to the difference in the resource disaggregation mode, and the advantages and limitations of each architecture are compared. Then, on the basis of the disaggregated compute and storage architectures, this study explores the key technologies of cloud-native databases in depth by functional modules. The technologies under discussion include those of cloud-native OLTP (data organization, replica consistency, main/standby synchronization, failure recovery, and mixed workload processing) and those of cloud-native OLAP (storage management, query processing, serverless-aware compute, data protection, and machine learning optimization). At last, the study summarizes the technical challenges for existing cloud-native databases and suggests the directions for future research. © 2024 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:899 / 926
页数:27
相关论文
共 62 条
  • [21] Gutmans A., Cloud blog databases, Introducing AlloyDB for PostgreSQL: Free yourself from expensive, legacy databases, (2022)
  • [22] Azure Managed Disks pricing, (2022)
  • [23] Depoutovitch A, Chen C, Chen J, Larson P, Lin S, Ng J, Cui WL, Liu Q, Huang W, Xiao Y, He YJ., Taurus database: How to be fast, available, and frugal in the cloud, Proc. of the 2020 ACM SIGMOD Int’l Conf. on Management of Data, pp. 1463-1478, (2020)
  • [24] Verbitski A, Gupta A, Saha D, Corey J, Gupta K, Brahmadesam M, Mittal R, Krishnamurthy S, Maurice S, Kharatishvilli T, Bao XF., Amazon aurora: On avoiding distributed consensus for I/Os, commits, and membership changes, Proc. of the 2018 Int’l Conf. on Management of Data, pp. 789-796, (2018)
  • [25] Cao W, Liu ZJ, Wang P, Chen S, Zhu CF, Zheng S, Wang YH, Ma GQ., PolarFS: An ultra-low latency and failure resilient distributed file system for shared storage cloud database, Proc. of the VLDB Endowment, 11, 12, pp. 1849-1862, (2018)
  • [26] Zhang YQ, Ruan CY, Li C, Yang XJ, Cao W, Li FF, Wang B, Fang J, Wang YH, Huo JZ, Bi C., Towards cost-effective and elastic cloud database deployment via memory disaggregation, Proc. of the VLDB Endowment, 14, 10, pp. 1900-1912, (2021)
  • [27] Ranganath Sheshadri, Murthy Ravi, Cloud Blog Databases. AlloyDB for PostgreSQL under the hood: Columnar engine, (2022)
  • [28] Huang DX, Liu Q, Cui Q, Fang ZH, Ma XY, Xu F, Shen L, Tang L, Zhou YX, Huang ML, Wei W, Liu C, Zhang J, Li JJ, Wu XL, Song LY, Sun RX, Yu SP, Zhao L, Cameron N, Pei LQ, Tang X., TiDB: A raft-based HTAP database, Proc. of the VLDB Endowment, 13, 12, pp. 3072-3084, (2020)
  • [29] Prout A, Wang SP, Victor J, Sun Z, Li YZ, Chen J, Bergeron E, Hanson E, Walzer R, Gomes R, Shamgunov N., Cloud-native transactions and analytics in SingleStore, Proc. of the 2022 Int’l Conf. on Management of Data, pp. 2340-2352, (2022)
  • [30] Skeen D., A Quorum-based Commit Protocol, (1982)