Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

被引:7
|
作者
Niu, Liwen [1 ]
Chen, Xianfu [2 ]
Zhang, Ning [3 ]
Zhu, Yongdong [4 ]
Yin, Rui [5 ]
Wu, Celimuge [6 ,7 ]
Cao, Yangjie [1 ]
机构
[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
[2] VTT Tech Res Ctr Finland, Oulu 90570, Finland
[3] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON N9B 3P4, Canada
[4] Zhejiang Lab, Intelligent Network Res, Hangzhou 311121, Peoples R China
[5] Zhejiang Univ City Coll, Informat Sci & Elect Engn, Hangzhou 310015, Peoples R China
[6] Univ Electro Commun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[7] Univ Electro Commun, Meta Networking Res Ctr, Tokyo 1828585, Japan
基金
中国国家自然科学基金;
关键词
Wireless fidelity; Task analysis; Processor scheduling; Edge computing; Servers; Scheduling; Training; Computation task scheduling; heterogeneous edge computing systems; Markov decision process (MDP); meta-learning; multiagent proximal policy optimization (PPO); RESOURCE-ALLOCATION;
D O I
10.1109/JIOT.2023.3241222
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile-edge computing (MEC) brings the potential to address the ever increasing computation demands from the mobile users (MUs). In addition to local processing, the resource-constrained MUs in an MEC system can also offload computation to the nearby servers for remote execution. With the explosive growth of mobile devices, computation offloading faces the challenge of spectrum congestion, which, in turn, deteriorates the overall quality of computation experience. This article, hence, investigates computation task scheduling in a heterogeneous cellular and WiFi MEC system. Such a system provides both licensed and unlicensed spectrum opportunities. Due to the sharing of communication and computation resources as well as the uncertainties, we formulate the problem of computation task scheduling among the competing MUs in a stationary heterogeneous edge computing system as a noncooperative stochastic game. We propose an approximation-based multiagent Markov decision process without the global system state observations, under which a multiagent proximal policy optimization (PPO) algorithm is derived to solve the corresponding Nash equilibrium. When expanding to a nonstationary heterogeneous edge computing system, the obtained algorithm suffers from the slow convergence due to constrained adaptability. Accordingly, we explore meta-learning and propose a multiagent meta-PPO algorithm, which rapidly adapts the control policy learning to the nonstationarity. Numerical experiments demonstrate performance gains from our proposed algorithms.
引用
收藏
页码:10519 / 10531
页数:13
相关论文
共 50 条
  • [41] Deep Reinforcement Learning for Task Offloading in Edge Computing
    Xie, Bo
    Cui, Haixia
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 250 - 254
  • [42] Meta-Reinforcement Learning for Adaptive Control of Second Order Systems
    McClement, Daniel G.
    Lawrence, Nathan P.
    Forbes, Michael G.
    Loewen, Philip D.
    Backstrom, Johan U.
    Gopaluni, R. Bhushan
    2022 IEEE INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP 2022), 2022, : 78 - 83
  • [43] A Kind of Reinforcement Learning to Improve Genetic Algorithm for Multiagent Task Scheduling
    Li, Zhipeng
    Wei, Xiumei
    Jiang, Xuesong
    Pang, Yewen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [44] Real-Time Microgrid Energy Scheduling Using Meta-Reinforcement Learning
    Shen, Huan
    Shen, Xingfa
    Chen, Yiming
    ENERGIES, 2024, 17 (10)
  • [45] Software-Defined Heterogeneous Edge Computing Network Resource Scheduling Based on Reinforcement Learning
    Li, Yaofang
    Wu, Bin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [46] Adaptable Image Quality Assessment Using Meta-Reinforcement Learning of Task Amenability
    Saeed, Shaheer U.
    Fu, Yunguan
    Stavrinides, Vasilis
    Baum, Zachary M. C.
    Yang, Qianye
    Rusu, Mirabela
    Fan, Richard E.
    Sonn, Geoffrey A.
    Noble, J. Alison
    Barratt, Dean C.
    Hu, Yipeng
    SIMPLIFYING MEDICAL ULTRASOUND, 2021, 12967 : 191 - 201
  • [47] Meta-Reinforcement Learning-Based Transferable Scheduling Strategy for Energy Management
    Xiong, Luolin
    Tang, Yang
    Liu, Chensheng
    Mao, Shuai
    Meng, Ke
    Dong, Zhaoyang
    Qian, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (04) : 1685 - 1695
  • [48] Fast Adaptive Task Offloading and Resource Allocation via Multiagent Reinforcement Learning in Heterogeneous Vehicular Fog Computing
    Gao, Zhen
    Yang, Lei
    Dai, Yu
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (08) : 6818 - 6835
  • [49] On benchmarking task scheduling algorithms for heterogeneous computing systems
    Ashish Kumar Maurya
    Anil Kumar Tripathi
    The Journal of Supercomputing, 2018, 74 : 3039 - 3070
  • [50] MAPPING AND SCHEDULING WITH TASK CLUSTERING FOR HETEROGENEOUS COMPUTING SYSTEMS
    Lam, Y. M.
    Coutinho, J. G. F.
    Luk, W.
    Leong, P. H. W.
    2008 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE AND LOGIC APPLICATIONS, VOLS 1 AND 2, 2008, : 275 - +