Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

被引:7
|
作者
Niu, Liwen [1 ]
Chen, Xianfu [2 ]
Zhang, Ning [3 ]
Zhu, Yongdong [4 ]
Yin, Rui [5 ]
Wu, Celimuge [6 ,7 ]
Cao, Yangjie [1 ]
机构
[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
[2] VTT Tech Res Ctr Finland, Oulu 90570, Finland
[3] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON N9B 3P4, Canada
[4] Zhejiang Lab, Intelligent Network Res, Hangzhou 311121, Peoples R China
[5] Zhejiang Univ City Coll, Informat Sci & Elect Engn, Hangzhou 310015, Peoples R China
[6] Univ Electro Commun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[7] Univ Electro Commun, Meta Networking Res Ctr, Tokyo 1828585, Japan
基金
中国国家自然科学基金;
关键词
Wireless fidelity; Task analysis; Processor scheduling; Edge computing; Servers; Scheduling; Training; Computation task scheduling; heterogeneous edge computing systems; Markov decision process (MDP); meta-learning; multiagent proximal policy optimization (PPO); RESOURCE-ALLOCATION;
D O I
10.1109/JIOT.2023.3241222
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile-edge computing (MEC) brings the potential to address the ever increasing computation demands from the mobile users (MUs). In addition to local processing, the resource-constrained MUs in an MEC system can also offload computation to the nearby servers for remote execution. With the explosive growth of mobile devices, computation offloading faces the challenge of spectrum congestion, which, in turn, deteriorates the overall quality of computation experience. This article, hence, investigates computation task scheduling in a heterogeneous cellular and WiFi MEC system. Such a system provides both licensed and unlicensed spectrum opportunities. Due to the sharing of communication and computation resources as well as the uncertainties, we formulate the problem of computation task scheduling among the competing MUs in a stationary heterogeneous edge computing system as a noncooperative stochastic game. We propose an approximation-based multiagent Markov decision process without the global system state observations, under which a multiagent proximal policy optimization (PPO) algorithm is derived to solve the corresponding Nash equilibrium. When expanding to a nonstationary heterogeneous edge computing system, the obtained algorithm suffers from the slow convergence due to constrained adaptability. Accordingly, we explore meta-learning and propose a multiagent meta-PPO algorithm, which rapidly adapts the control policy learning to the nonstationarity. Numerical experiments demonstrate performance gains from our proposed algorithms.
引用
收藏
页码:10519 / 10531
页数:13
相关论文
共 50 条
  • [31] Meta Reinforcement Learning for Multi-Task Offloading in Vehicular Edge Computing
    Dai, Penglin
    Huang, Yaorong
    Hu, Kaiwen
    Wu, Xiao
    Xing, Huanlai
    Yu, Zhaofei
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (03) : 2123 - 2138
  • [32] Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
    Wang, Mingyang
    Bing, Zhenshan
    Yao, Xiangtong
    Wang, Shuai
    Kai, Huang
    Su, Hang
    Yang, Chenguang
    Knoll, Alois
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10157 - 10165
  • [33] Deep Reinforcement Learning based Task Scheduling Scheme in Mobile Edge Computing Network
    Zhao, Qi
    Feng, Mingjie
    Li, Li
    Li, Yi
    Liu, Hang
    Chen, Genshe
    SENSORS AND SYSTEMS FOR SPACE APPLICATIONS XIV, 2021, 11755
  • [34] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
    Yuan, Haoqi
    Lu, Zongqing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [35] DDDQN-TS: A task scheduling and load balancing method based on optimized deep reinforcement learning in heterogeneous computing environment
    Sun, Changyong
    Yang, Tan
    Lei, Youxun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9138 - 9172
  • [36] On task matching and scheduling in heterogeneous computing systems
    Chuang, PJ
    Wei, CH
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 901 - 907
  • [37] Contrastive meta-reinforcement learning for heterogeneous graph neural architecture search
    Xu, Zixuan
    Wu, Jia
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [38] Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems
    Sun, Changyin
    Liu, Wenzhang
    Dong, Lu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2054 - 2065
  • [39] Harnessing Meta-Reinforcement Learning for Enhanced Tracking in Geofencing Systems
    Famili, Alireza
    Sun, Shihua
    Atalay, Tolga
    Stavrou, Angelos
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2025, 6 : 944 - 960
  • [40] Adaptive Inference Reinforcement Learning for Task Offloading in Vehicular Edge Computing Systems
    Tang, Dian
    Zhang, Xuefei
    Li, Meng
    Tao, Xiaofeng
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,