Reinforcement Learning-Based Resource Allocation and Energy Efficiency Optimization for a Space-Air-Ground-Integrated Network

被引:2
|
作者
Chen, Zhiyu [1 ]
Zhou, Hongxi [1 ]
Du, Siyuan [2 ]
Liu, Jiayan [2 ]
Zhang, Luyang [2 ]
Liu, Qi [3 ]
机构
[1] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China
[2] North China Elect Power Univ, Sch Elect & Elect Engn, Beijing 102206, Peoples R China
[3] Beijing FibrLink Commun Co Ltd, Beijing 100071, Peoples R China
关键词
space-air-ground-integrated network (SAGIN); Low Earth Orbit (LEO) satellites; dynamic resource allocation; multi-agent reinforcement learning (RL); Markov Decision Process (MDP); K-armed bandit; POWER; INTERNET; IOT;
D O I
10.3390/electronics13091792
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the construction and development of the smart grid, the power business puts higher requirements on the communication capability of the network. In order to improve the energy efficiency of the space-air-ground-integrated power three-dimensional fusion communication network, we establish an optimization problem for joint air platform (AP) flight path selection, ground power facility (GPF) association, and power control. In solving the problem, we decompose the problem into two subproblems, one is the AP flight path selection subproblem and the other is the GPF association and power control subproblem. Firstly, based on the GPF distribution and throughput weights, we model the AP flight path selection subproblem as a Markov Decision Process (MDP) and propose a multi-agent iterative optimization algorithm based on the comprehensive judgment of GPF positions and workload. Secondly, we model the GPF association and power control subproblem as a multi-agent, time-varying K-armed bandit model and propose an algorithm based on multi-agent Temporal Difference (TD) learning. Then, by alternately iterating between the two subproblems, we propose a reinforcement learning (RL)-based joint optimization algorithm. Finally, the simulation results indicate that compared to the three baseline algorithms (random path, average transmit power, and random device association), the proposed algorithm improves an overall energy efficiency of the system of 16.23%, 86.29%, and 5.11% under various conditions (including different noise power levels, GPF bandwidth, and GPF quantities), respectively.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Learning-Based Queue-Aware Task Offloading and Resource Allocation for Space-Air-Ground-Integrated Power IoT
    Liao, Haijun
    Zhou, Zhenyu
    Zhao, Xiongwen
    Wang, Yang
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (07): : 5250 - 5263
  • [2] A Resource Allocation Algorithm for Space-Air-Ground Integrated Network Based on Deep Reinforcement Learning
    Liu, Xuefang
    Mao, Weihao
    Yang, Qinghai
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (07): : 2831 - 2841
  • [3] Dynamic SFC Embedding Algorithm Assisted by Federated Learning in Space-Air-Ground-Integrated Network Resource Allocation Scenario
    Zhang, Peiying
    Zhang, Yi
    Kumar, Neeraj
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (11) : 9308 - 9318
  • [4] Deep Reinforcement Learning-Based Energy Minimization Task Offloading and Resource Allocation for Air Ground Integrated Heterogeneous Networks
    Qin, Peng
    Wang, Shuo
    Lu, Zhou
    Xie, Yuanbo
    Zhao, Xiongwen
    IEEE SYSTEMS JOURNAL, 2023, 17 (03): : 4958 - 4968
  • [5] Graphic Deep Reinforcement Learning for Dynamic Resource Allocation in Space-Air-Ground Integrated Networks
    Cai, Yue
    Cheng, Peng
    Chen, Zhuo
    Xiang, Wei
    Vucetic, Branka
    Li, Yonghui
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2025, 43 (01) : 334 - 349
  • [6] Deep Reinforcement Learning-Based Resource Allocation for Integrated Sensing, Communication, and Computation in Vehicular Network
    Yang, Liu
    Wei, Yifei
    Feng, Zhiyong
    Zhang, Qixun
    Han, Zhu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) : 18608 - 18622
  • [7] Reinforcement Learning-Based Intelligent Resource Allocation for Integrated VLCP Systems
    Yang, Helin
    Du, Pengfei
    Zhong, Wen-De
    Chen, Chen
    Alphones, Arokiaswami
    Zhang, Sheng
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (04) : 1204 - 1207
  • [8] Reinforcement Learning-Based Resource Allocation for Streaming in a Multi-Modal Deep Space Network
    Ha, Taeyun
    Oh, Junsuk
    Lee, Donghyun
    Lee, Jeonghwa
    Jeon, Yongin
    Cho, Sungrae
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 201 - 206
  • [9] Demand-Driven Task Scheduling and Resource Allocation in Space-Air-Ground Integrated Network: A Deep Reinforcement Learning Approach
    Fan, Kexin
    Feng, Bowen
    Zhang, Xilin
    Zhang, Qinyu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13053 - 13067
  • [10] Distributed Deep Reinforcement Learning Assisted Resource Allocation Algorithm for Space-Air-Ground Integrated Networks
    Zhang, Peiying
    Li, Yuanjie
    Kumar, Neeraj
    Chen, Ning
    Hsu, Ching-Hsien
    Barnawi, Ahmed
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 3348 - 3358