WATuning: A Workload-Aware Tuning System with Attention-Based Deep Reinforcement Learning

被引：10

作者：

Ge, Jia-Ke ^{[1
,2
]}

Chai, Yan-Feng ^{[2
,3
]}

Chai, Yun-Peng ^{[1
,2
]}

机构：

[1] Renmin Univ China, Key Lab Data Engn & Knowledge Engn Minist Educ, Beijing 100872, Peoples R China

[2] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China

[3] Taiyuan Univ Sci & Technol, Coll Comp Sci & Technol, Taiyuan 030027, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2021年 / 36卷 / 04期

基金：

中国国家自然科学基金;

关键词：

attention mechanism; auto-tuning system; reinforcement learning (RL); workload-aware; TIME;

D O I：

10.1007/s11390-021-1350-8

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Configuration tuning is essential to optimize the performance of systems (e.g., databases, key-value stores). High performance usually indicates high throughput and low latency. At present, most of the tuning tasks of systems are performed artificially (e.g., by database administrators), but it is hard for them to achieve high performance through tuning in various types of systems and in various environments. In recent years, there have been some studies on tuning traditional database systems, but all these methods have some limitations. In this article, we put forward a tuning system based on attention-based deep reinforcement learning named WATuning, which can adapt to the changes of workload characteristics and optimize the system performance efficiently and effectively. Firstly, we design the core algorithm named ATT-Tune for WATuning to achieve the tuning task of systems. The algorithm uses workload characteristics to generate a weight matrix and acts on the internal metrics of systems, and then ATT-Tune uses the internal metrics with weight values assigned to select the appropriate configuration. Secondly, WATuning can generate multiple instance models according to the change of the workload so that it can complete targeted recommendation services for different types of workloads. Finally, WATuning can also dynamically fine-tune itself according to the constantly changing workload in practical applications so that it can better fit to the actual environment to make recommendations. The experimental results show that the throughput and the latency of WATuning are improved by 52.6% and decreased by 31%, respectively, compared with the throughput and the latency of CDBTune which is an existing optimal tuning method.

引用

页码：741 / 761

页数：21

共 50 条

[21] K2vTune: A workload-aware configuration tuning for RocksDB
Lee, Jieun
Seo, Sangmin
Choi, Jonghwan
Park, Sanghyun
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
[22] Ensemble CNN Attention-Based BiLSTM Deep Learning Architecture for Multivariate Cloud Workload Prediction
Kaim, Ananya
Singh, Surjit
Patel, Yashwant Singh
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 342 - 348
[23] Attention-Based Highway Safety Planner for Autonomous Driving via Deep Reinforcement Learning
Chen, Guoxi
Zhang, Ya
Li, Xinde
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 162 - 175
[24] Attention-based Deep Reinforcement Learning Model for Pair-Wise Interaction Recommendation
Zhang, Chenyan
Guan, Huanmei
Li, Ni
2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 169 - 174
[25] Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
Chiu, Zih-Yun
Tuan, Yi-Lin
Wang, William Yang
Yip, Michael C.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[26] Attention-based Hierarchical Deep Reinforcement Learning for Lane Change Behaviors in Autonomous Driving
Chen, Yilun
Dong, Chiyu
Palanisamy, Praveen
Mudalige, Priyantha
Muelling, Katharina
Dolan, John M.
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3697 - 3703
[27] Attention-based Hierarchical Deep Reinforcement Learning for Lane Change Behaviors in Autonomous Driving
Chen, Yilun
Dong, Chiyu
Palanisamy, Praveen
Mudalige, Priyantha
Muelling, Katharina
Dolan, John M.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1326 - 1334
[28] Workload-Aware DRAM Error Prediction using Machine Learning
Mukhanov, Lev
Tovletoglou, Konstantinos
Vandierendonck, Hans
Nikolopoulos, Dimitrios S.
Karakonstantis, Georgios
PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 106 - 118
[29] Federated learning with workload-aware client scheduling in heterogeneous systems
Li, Li
Liu, Duo
Duan, Moming
Zhang, Yu
Ren, Ao
Chen, Xianzhang
Tan, Yujuan
Wang, Chengliang
NEURAL NETWORKS, 2022, 154 : 560 - 573
[30] A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning
Wiggins, Samuel
Meng, Yuan
Iyer, Mahesh A.
Prasanna, Viktor
2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 236 - 242

← 1 2 3 4 5 →