WATuning: A Workload-Aware Tuning System with Attention-Based Deep Reinforcement Learning

被引:10
|
作者
Ge, Jia-Ke [1 ,2 ]
Chai, Yan-Feng [2 ,3 ]
Chai, Yun-Peng [1 ,2 ]
机构
[1] Renmin Univ China, Key Lab Data Engn & Knowledge Engn Minist Educ, Beijing 100872, Peoples R China
[2] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
[3] Taiyuan Univ Sci & Technol, Coll Comp Sci & Technol, Taiyuan 030027, Peoples R China
基金
中国国家自然科学基金;
关键词
attention mechanism; auto-tuning system; reinforcement learning (RL); workload-aware; TIME;
D O I
10.1007/s11390-021-1350-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Configuration tuning is essential to optimize the performance of systems (e.g., databases, key-value stores). High performance usually indicates high throughput and low latency. At present, most of the tuning tasks of systems are performed artificially (e.g., by database administrators), but it is hard for them to achieve high performance through tuning in various types of systems and in various environments. In recent years, there have been some studies on tuning traditional database systems, but all these methods have some limitations. In this article, we put forward a tuning system based on attention-based deep reinforcement learning named WATuning, which can adapt to the changes of workload characteristics and optimize the system performance efficiently and effectively. Firstly, we design the core algorithm named ATT-Tune for WATuning to achieve the tuning task of systems. The algorithm uses workload characteristics to generate a weight matrix and acts on the internal metrics of systems, and then ATT-Tune uses the internal metrics with weight values assigned to select the appropriate configuration. Secondly, WATuning can generate multiple instance models according to the change of the workload so that it can complete targeted recommendation services for different types of workloads. Finally, WATuning can also dynamically fine-tune itself according to the constantly changing workload in practical applications so that it can better fit to the actual environment to make recommendations. The experimental results show that the throughput and the latency of WATuning are improved by 52.6% and decreased by 31%, respectively, compared with the throughput and the latency of CDBTune which is an existing optimal tuning method.
引用
收藏
页码:741 / 761
页数:21
相关论文
共 50 条
  • [21] K2vTune: A workload-aware configuration tuning for RocksDB
    Lee, Jieun
    Seo, Sangmin
    Choi, Jonghwan
    Park, Sanghyun
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [22] Ensemble CNN Attention-Based BiLSTM Deep Learning Architecture for Multivariate Cloud Workload Prediction
    Kaim, Ananya
    Singh, Surjit
    Patel, Yashwant Singh
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 342 - 348
  • [23] Attention-Based Highway Safety Planner for Autonomous Driving via Deep Reinforcement Learning
    Chen, Guoxi
    Zhang, Ya
    Li, Xinde
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 162 - 175
  • [24] Attention-based Deep Reinforcement Learning Model for Pair-Wise Interaction Recommendation
    Zhang, Chenyan
    Guan, Huanmei
    Li, Ni
    2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 169 - 174
  • [25] Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
    Chiu, Zih-Yun
    Tuan, Yi-Lin
    Wang, William Yang
    Yip, Michael C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] Attention-based Hierarchical Deep Reinforcement Learning for Lane Change Behaviors in Autonomous Driving
    Chen, Yilun
    Dong, Chiyu
    Palanisamy, Praveen
    Mudalige, Priyantha
    Muelling, Katharina
    Dolan, John M.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3697 - 3703
  • [27] Attention-based Hierarchical Deep Reinforcement Learning for Lane Change Behaviors in Autonomous Driving
    Chen, Yilun
    Dong, Chiyu
    Palanisamy, Praveen
    Mudalige, Priyantha
    Muelling, Katharina
    Dolan, John M.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1326 - 1334
  • [28] Workload-Aware DRAM Error Prediction using Machine Learning
    Mukhanov, Lev
    Tovletoglou, Konstantinos
    Vandierendonck, Hans
    Nikolopoulos, Dimitrios S.
    Karakonstantis, Georgios
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 106 - 118
  • [29] Federated learning with workload-aware client scheduling in heterogeneous systems
    Li, Li
    Liu, Duo
    Duan, Moming
    Zhang, Yu
    Ren, Ao
    Chen, Xianzhang
    Tan, Yujuan
    Wang, Chengliang
    NEURAL NETWORKS, 2022, 154 : 560 - 573
  • [30] A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning
    Wiggins, Samuel
    Meng, Yuan
    Iyer, Mahesh A.
    Prasanna, Viktor
    2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 236 - 242