Computing Over the Sky: Joint UAV Trajectory and Task Offloading Scheme Based on Optimization-Embedding Multi-Agent Deep Reinforcement Learning

被引:8
|
作者
Li, Xuanheng [1 ]
Du, Xinyang [1 ]
Zhao, Nan [1 ]
Wang, Xianbin [2 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
基金
中国国家自然科学基金;
关键词
Autonomous aerial vehicles; Task analysis; Trajectory; Heuristic algorithms; Delays; Reinforcement learning; Resource management; Unmanned aerial vehicle; mobile edge computing; computation offloading; trajectory control; reinforcement learning; RESOURCE-ALLOCATION; ENERGY EFFICIENCY; ALGORITHM; NETWORKS; TIME;
D O I
10.1109/TCOMM.2023.3331029
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) has emerged to support computation-intensive tasks in 6G systems. Since the battery capacity of a UAV is limited, to serve as many users as possible, a joint design on UAV trajectory and offloading strategy with consideration for service fairness is essential to provide energy-efficient computation offloading to the users in UAV-MEC networks. Unfortunately, such a joint decision-making problem is not straightforward due to various task types required from users and various functionalities of different UAVs enabled by different application programs. Considering the above issues, we take energy efficiency and service fairness as the objective, and propose a Multi-Agent Energy-Efficient joint Trajectory and Computation Offloading (MA-ETCO) scheme. To adapt to dynamic demands of users, we develop an optimization-embedding multi-agent deep reinforcement learning (OMADRL) algorithm. Each UAV autonomously learns the trajectory control decision based on MADRL to adapt to dynamic demands. Then, it will obtain the optimal computation offloading decision by solving a mixed-integer nonlinear programming problem. The computation offloading result, in turn, will be used as an indicator to guide UAVs' trajectory design. Compared to relying solely on deep reinforcement learning, such an optimization-embedding way reduces action space dimension and improves convergence efficiency.
引用
收藏
页码:1355 / 1369
页数:15
相关论文
共 50 条
  • [21] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Zhou, Xuanhan
    Xiong, Jun
    Zhao, Haitao
    Liu, Xiaoran
    Ren, Baoquan
    Zhang, Xiaochen
    Wei, Jibo
    Yin, Hao
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [22] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Xuanhan ZHOU
    Jun XIONG
    Haitao ZHAO
    Xiaoran LIU
    Baoquan REN
    Xiaochen ZHANG
    Jibo WEI
    Hao YIN
    ScienceChina(InformationSciences), 2024, 67 (03) : 225 - 245
  • [23] Multi-agent reinforcement learning for vehicular task offloading with multi-step trajectory prediction
    Zhang, Xinyi
    Zhu, Yanmin
    Wang, Chunyang
    Cao, Jian
    Chen, Yirong
    Wang, Jie
    CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2024, 6 (02) : 101 - 114
  • [24] Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System
    Lu, Gengyuan
    Chang, Zheng
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [25] Multi-agent Computation Offloading in UAV Assisted MEC via Deep Reinforcement Learning
    He, Hang
    Ren, Tao
    Qiu, Yuan
    Hu, Zheyuan
    Li, Yanqi
    SMART COMPUTING AND COMMUNICATION, 2022, 13202 : 416 - 426
  • [26] Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Distributed Edge Cloud Computing
    Ding, Shiyao
    Lin, Donghui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 936 - 945
  • [27] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [28] Task Offloading and Trajectory Optimization in UAV Networks: A Deep Reinforcement Learning Method Based on SAC and A-Star
    Liu, Jianhua
    Xie, Peng
    Liu, Jiajia
    Tu, Xiaoguang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (02): : 1243 - 1273
  • [29] Task offloading in hybrid-decision-based multi-cloud computing network: a cooperative multi-agent deep reinforcement learning
    Juan Chen
    Peng Chen
    Xianhua Niu
    Zongling Wu
    Ling Xiong
    Canghong Shi
    Journal of Cloud Computing, 11
  • [30] Task offloading in hybrid-decision-based multi-cloud computing network: a cooperative multi-agent deep reinforcement learning
    Chen, Juan
    Chen, Peng
    Niu, Xianhua
    Wu, Zongling
    Xiong, Ling
    Shi, Canghong
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):