AI-based Database Performance Diagnosis

被引:0
|
作者
Jin L.-Y. [1 ]
Li G.-L. [1 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2021年 / 32卷 / 03期
基金
中国国家自然科学基金;
关键词
Anomaly diagnosis; Artificial intelligence; OLTP database;
D O I
10.13328/j.cnki.jos.006177
中图分类号
学科分类号
摘要
Database is a kind of important and fundamental computer system software. With the development of database application in all walks of life, a growing number of people begin to concern the stability of the database. Because of the numerous internal of external effect, performance anomaly may emerge when the Database running and it may cause huge economic loss. People usually diagnose database anomaly by analyzing monitoring metrics. However, there are hundreds of metrics in the system and ordinary database users are unable to extract valuable information from them. Some major companies employ DBA to manage the databases but the cost is unacceptable for many other companies. Achieving automatic database monitor and diagnose with low cost is a challenging problem. Current methods have many limitations, including high cost of metrics information collection, narrow range of application or poor stability. This study proposes an anomaly diagnose framework AutoMonitor which is deployed on the PostgreSQL database. The framework contains LSTM-based anomaly detection module and modified K nearest-neighbor algorithm-based root cause diagnose module. Framework consists of an offline training and an online diagnose stage. The evaluations on the datasets show that the proposed framework has high diagnose accuracy with minor overload to system performance. © Copyright 2021, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:845 / 858
页数:13
相关论文
共 18 条
  • [1] Dang Y, Lin Q, Huang P., AIOps: Real-world challenges and research innovations, Proc. of the 41st IEEE/ACM Int'l Conf. on Software Engineering: Companion (ICSE-Companion 2019), pp. 4-5, (2019)
  • [2] Cao W, Gao Y, Lin B, Feng X, Xie Y, Lou X, Wang P., TCPRT: Instrument and diagnostic analysis system for service quality of cloud databases at massive scale in real-time, Proc. of the SIGMOD, (2018)
  • [3] Xu Y, Sui K, Yao R, Zhang H, Lin Q, Dang Y, Li P, Jiang K, Zhang W, Lou JG, Chintalapati M, Zhang D., Improving service availability of cloud systems by predicting disk error, Proc. of the UsenixAtc, pp. 481-494, (2018)
  • [4] Zhou X, Peng X, Xie T, Sun J, Ji C, Liu D, Xiang Q, He C., Latent error prediction and fault localization for microservice applications by learning from system trace logs, Proc. of the 2019 27th ACM Joint Meeting European Software Engineering Conf. and Symp. on the Foundations of Software Engineering (ESEC/FSE 2019), pp. 683-694, (2019)
  • [5] Li GL, Zhou XH, Sun J, Yu X, Yuan HT, Liu JB, Han Y., A survey of machine-learning-based database techniques, Chinese Journal of Computers, (2019)
  • [6] Benoit DG., Automatic diagnosis of performance problems in database management systems, Proc. of the 2nd Int'l Conf. on Autonomic Computing (ICAC), pp. 326-327, (2005)
  • [7] Dias K, Ramacher M, Shaft U, Venkataramani V, Wood G., Automatic performance diagnosis and tuning in oracle, Proc. of the CIDR, (2005)
  • [8] Yoon DY, Niu N, Mozafari B., DBSherlock: A performance diagnostic tool for transactional databases, Proc. of the SIGMOD, (2016)
  • [9] Ma M, Yin Z, Zhang S, Wang S, Zheng C, Jiang X., Diagnosing root causes of intermittent slow queries in cloud databases, Proc. of the PVLDB Endowment, pp. 1176-1189, (2020)
  • [10] Borisov N, Uttamchandani S, Routray R, Singh A., Why did my query slow down?, Proc. of the CIDR, (2009)