Survey on Cloud-native Databases

被引:0
|
作者
Dong H.-W. [1 ]
Zhang C. [1 ]
Li G.-L. [1 ]
Feng J.-H. [1 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2024年 / 35卷 / 02期
关键词
cloud-native database; compute-storage disaggregation; database-as-a-service (DBaaS);
D O I
10.13328/j.cnki.jos.006952
中图分类号
学科分类号
摘要
The virtualization, high availability, high scheduling elasticity, and other characteristics of cloud infrastructure provide cloud databases with many advantages, such as the out-of-the-box feature, high reliability and availability, and pay-as-you-go model. Cloud databases can be divided into two categories according to the architecture design: cloud-hosted databases and cloud-native databases. Cloud-hosted databases, deploying the database system in the virtual machine environment on the cloud, offer the advantages of low cost, easy operation and maintenance, and high reliability. Besides, cloud-native databases take full advantage of the characteristic elastic scaling of the cloud infrastructure. The disaggregated compute and storage architecture is adopted to achieve the independent scaling of computing and storage resources and further increase the cost-performance ratio of the databases. However, the disaggregated compute and storage architecture poses new challenges to the design of database systems. This survey is an in-depth analysis of the architecture and technology of the cloud-native database system. Specifically, the architectures of cloud-native online transaction processing (OLTP) and online analytical processing (OLAP) databases are classified and analyzed, respectively, according to the difference in the resource disaggregation mode, and the advantages and limitations of each architecture are compared. Then, on the basis of the disaggregated compute and storage architectures, this study explores the key technologies of cloud-native databases in depth by functional modules. The technologies under discussion include those of cloud-native OLTP (data organization, replica consistency, main/standby synchronization, failure recovery, and mixed workload processing) and those of cloud-native OLAP (storage management, query processing, serverless-aware compute, data protection, and machine learning optimization). At last, the study summarizes the technical challenges for existing cloud-native databases and suggests the directions for future research. © 2024 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:899 / 926
页数:27
相关论文
共 62 条
  • [1] Global database management system (DBMS) market outlook, (2023)
  • [2] Database-as-a-service (DBaaS): Global strategic business report, (2023)
  • [3] China database industry market depth research and investment outlook report on 2023–2028, (2022)
  • [4] Li GL, Dong HW, Zhang C., Cloud databases: New techniques, challenges, and opportunities, Proc. of the VLDB Endowment, 15, 12, pp. 3758-3761, (2022)
  • [5] Mathew S, Varia J., Overview of Amazon Web services, Amazon Whitepapers, 105, pp. 1-22, (2014)
  • [6] Mazumdar P, Agarwal S, Banerjee A, Mazumdar P, Agarwal S, Banerjee A., Azure SQL database, Pro SQL Server on Microsoft Azure, pp. 129-156, (2016)
  • [7] Krishnan SP, Gonzalez JL, Krishnan SP, Gonzalez JL., Google Cloud SQL. Building Your Next Big Thing with Google Cloud Platform: A Guide for Developers and Enterprise Architects, (2015)
  • [8] Verbitski A, Gupta A, Saha D, Brahmadesam M, Gupta K, Mittal R, Krishnamurthy S, Maurice S, Kharatishvili T, Bao XF., Amazon aurora: Design considerations for high throughput cloud-native relational databases, Proc. of the 2017 ACM Int’l Conf. on Management of Data, pp. 1041-1052, (2017)
  • [9] Dageville B, Cruanes T, Zukowski M, Antonov V, Avanes A, Bock J, Claybaugh J, Engovatov D, Hentschel M, Huang JS, Lee AW, Motivala A, Munir AQ, Pelley S, Povinec P, Rahn G, Triantafyllis S, Unterbrunner P., The snowflake elastic data warehouse, Proc. of the 2016 Int’l Conf. on Management of Data, pp. 215-226, (2016)
  • [10] Bisong E., Google BigQuery, Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, pp. 485-517, (2019)