Computing Over the Sky: Joint UAV Trajectory and Task Offloading Scheme Based on Optimization-Embedding Multi-Agent Deep Reinforcement Learning

被引：8

作者：

Li, Xuanheng ^{[1
]}

Du, Xinyang ^{[1
]}

Zhao, Nan ^{[1
]}

Wang, Xianbin ^{[2
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China

[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2024年 / 72卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Autonomous aerial vehicles; Task analysis; Trajectory; Heuristic algorithms; Delays; Reinforcement learning; Resource management; Unmanned aerial vehicle; mobile edge computing; computation offloading; trajectory control; reinforcement learning; RESOURCE-ALLOCATION; ENERGY EFFICIENCY; ALGORITHM; NETWORKS; TIME;

D O I：

10.1109/TCOMM.2023.3331029

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) has emerged to support computation-intensive tasks in 6G systems. Since the battery capacity of a UAV is limited, to serve as many users as possible, a joint design on UAV trajectory and offloading strategy with consideration for service fairness is essential to provide energy-efficient computation offloading to the users in UAV-MEC networks. Unfortunately, such a joint decision-making problem is not straightforward due to various task types required from users and various functionalities of different UAVs enabled by different application programs. Considering the above issues, we take energy efficiency and service fairness as the objective, and propose a Multi-Agent Energy-Efficient joint Trajectory and Computation Offloading (MA-ETCO) scheme. To adapt to dynamic demands of users, we develop an optimization-embedding multi-agent deep reinforcement learning (OMADRL) algorithm. Each UAV autonomously learns the trajectory control decision based on MADRL to adapt to dynamic demands. Then, it will obtain the optimal computation offloading decision by solving a mixed-integer nonlinear programming problem. The computation offloading result, in turn, will be used as an indicator to guide UAVs' trajectory design. Compared to relying solely on deep reinforcement learning, such an optimization-embedding way reduces action space dimension and improves convergence efficiency.

引用

页码：1355 / 1369

页数：15

共 50 条

[21] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
Zhou, Xuanhan
Xiong, Jun
Zhao, Haitao
Liu, Xiaoran
Ren, Baoquan
Zhang, Xiaochen
Wei, Jibo
Yin, Hao
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
[22] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
Xuanhan ZHOU
Jun XIONG
Haitao ZHAO
Xiaoran LIU
Baoquan REN
Xiaochen ZHANG
Jibo WEI
Hao YIN
ScienceChina(InformationSciences), 2024, 67 (03) : 225 - 245
[23] Multi-agent reinforcement learning for vehicular task offloading with multi-step trajectory prediction
Zhang, Xinyi
Zhu, Yanmin
Wang, Chunyang
Cao, Jian
Chen, Yirong
Wang, Jie
CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2024, 6 (02) : 101 - 114
[24] Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System
Lu, Gengyuan
Chang, Zheng
2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
[25] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
He, Hang
Ren, Tao
Qiu, Yuan
Hu, Zheyuan
Li, Yanqi
SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
[26] Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Distributed Edge Cloud Computing
Ding, Shiyao
Lin, Donghui
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 936 - 945
[27] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
Chen, Binqiang
Liu, Dong
Hanzo, Lajos
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
[28] Task Offloading and Trajectory Optimization in UAV Networks: A Deep Reinforcement Learning Method Based on SAC and A-Star
Liu, Jianhua
Xie, Peng
Liu, Jiajia
Tu, Xiaoguang
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (02): : 1243 - 1273
[29] Task offloading in hybrid-decision-based multi-cloud computing network: a cooperative multi-agent deep reinforcement learning
Juan Chen
Peng Chen
Xianhua Niu
Zongling Wu
Ling Xiong
Canghong Shi
Journal of Cloud Computing, 11
[30] Task offloading in hybrid-decision-based multi-cloud computing network: a cooperative multi-agent deep reinforcement learning
Chen, Juan
Chen, Peng
Niu, Xianhua
Wu, Zongling
Xiong, Ling
Shi, Canghong
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):

← 1 2 3 4 5 →