Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

被引：7

作者：

Niu, Liwen ^{[1
]}

Chen, Xianfu ^{[2
]}

Zhang, Ning ^{[3
]}

Zhu, Yongdong ^{[4
]}

Yin, Rui ^{[5
]}

Wu, Celimuge ^{[6
,7
]}

Cao, Yangjie ^{[1
]}

机构：

[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China

[2] VTT Tech Res Ctr Finland, Oulu 90570, Finland

[3] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON N9B 3P4, Canada

[4] Zhejiang Lab, Intelligent Network Res, Hangzhou 311121, Peoples R China

[5] Zhejiang Univ City Coll, Informat Sci & Elect Engn, Hangzhou 310015, Peoples R China

[6] Univ Electro Commun, Grad Sch Informat & Engn, Tokyo 1828585, Japan

[7] Univ Electro Commun, Meta Networking Res Ctr, Tokyo 1828585, Japan

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Wireless fidelity; Task analysis; Processor scheduling; Edge computing; Servers; Scheduling; Training; Computation task scheduling; heterogeneous edge computing systems; Markov decision process (MDP); meta-learning; multiagent proximal policy optimization (PPO); RESOURCE-ALLOCATION;

D O I：

10.1109/JIOT.2023.3241222

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mobile-edge computing (MEC) brings the potential to address the ever increasing computation demands from the mobile users (MUs). In addition to local processing, the resource-constrained MUs in an MEC system can also offload computation to the nearby servers for remote execution. With the explosive growth of mobile devices, computation offloading faces the challenge of spectrum congestion, which, in turn, deteriorates the overall quality of computation experience. This article, hence, investigates computation task scheduling in a heterogeneous cellular and WiFi MEC system. Such a system provides both licensed and unlicensed spectrum opportunities. Due to the sharing of communication and computation resources as well as the uncertainties, we formulate the problem of computation task scheduling among the competing MUs in a stationary heterogeneous edge computing system as a noncooperative stochastic game. We propose an approximation-based multiagent Markov decision process without the global system state observations, under which a multiagent proximal policy optimization (PPO) algorithm is derived to solve the corresponding Nash equilibrium. When expanding to a nonstationary heterogeneous edge computing system, the obtained algorithm suffers from the slow convergence due to constrained adaptability. Accordingly, we explore meta-learning and propose a multiagent meta-PPO algorithm, which rapidly adapts the control policy learning to the nonstationarity. Numerical experiments demonstrate performance gains from our proposed algorithms.

引用

页码：10519 / 10531

页数：13

共 50 条

[21] Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
He, Hongcai
Zhu, Anjie
Liang, Shuang
Chen, Feiyu
Shao, Jie
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12358 - 12366
[22] Scheduling in Multiagent Systems Using Reinforcement Learning
I. K. Minashina
R. A. Gorbachev
E. M. Zakharova
Doklady Mathematics, 2022, 106 : S70 - S78
[23] Scheduling in Multiagent Systems Using Reinforcement Learning
Minashina, I. K.
Gorbachev, R. A.
Zakharova, E. M.
DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S70 - S78
[24] Scheduling of a meta-task with QoS requirements in heterogeneous computing systems
Dogan, A
Özgüner, F
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (02) : 181 - 196
[25] Information-theoretic Task Selection for Meta-Reinforcement Learning
Gutierrez, Ricardo Luna
Leonetti, Matteo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[26] Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing Systems
Tang, Ming
Wong, Vincent W. S.
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (06) : 1985 - 1997
[27] Hypernetworks in Meta-Reinforcement Learning
Beck, Jacob
Jackson, Matthew
Vuorio, Risto
Whiteson, Shimon
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1478 - 1487
[28] Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning
Wang, Jin
Hu, Jia
Min, Geyong
Zomaya, Albert Y.
Georgalas, Nektarios
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 242 - 253
[29] Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Zhou, Renzhe
Gao, Chen-Xiao
Zhang, Zongzhang
Yu, Yang
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 17132 - 17140
[30] Task Scheduling in Heterogeneous Computing Systems Based on Machine Learning Approach
Xie, Hui
Wei, Li
Liu, Dong
Wang, Luda
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (12)

← 1 2 3 4 5 →