Reinforcement Learning-Based Resource Allocation and Energy Efficiency Optimization for a Space-Air-Ground-Integrated Network

被引:2
|
作者
Chen, Zhiyu [1 ]
Zhou, Hongxi [1 ]
Du, Siyuan [2 ]
Liu, Jiayan [2 ]
Zhang, Luyang [2 ]
Liu, Qi [3 ]
机构
[1] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China
[2] North China Elect Power Univ, Sch Elect & Elect Engn, Beijing 102206, Peoples R China
[3] Beijing FibrLink Commun Co Ltd, Beijing 100071, Peoples R China
关键词
space-air-ground-integrated network (SAGIN); Low Earth Orbit (LEO) satellites; dynamic resource allocation; multi-agent reinforcement learning (RL); Markov Decision Process (MDP); K-armed bandit; POWER; INTERNET; IOT;
D O I
10.3390/electronics13091792
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the construction and development of the smart grid, the power business puts higher requirements on the communication capability of the network. In order to improve the energy efficiency of the space-air-ground-integrated power three-dimensional fusion communication network, we establish an optimization problem for joint air platform (AP) flight path selection, ground power facility (GPF) association, and power control. In solving the problem, we decompose the problem into two subproblems, one is the AP flight path selection subproblem and the other is the GPF association and power control subproblem. Firstly, based on the GPF distribution and throughput weights, we model the AP flight path selection subproblem as a Markov Decision Process (MDP) and propose a multi-agent iterative optimization algorithm based on the comprehensive judgment of GPF positions and workload. Secondly, we model the GPF association and power control subproblem as a multi-agent, time-varying K-armed bandit model and propose an algorithm based on multi-agent Temporal Difference (TD) learning. Then, by alternately iterating between the two subproblems, we propose a reinforcement learning (RL)-based joint optimization algorithm. Finally, the simulation results indicate that compared to the three baseline algorithms (random path, average transmit power, and random device association), the proposed algorithm improves an overall energy efficiency of the system of 16.23%, 86.29%, and 5.11% under various conditions (including different noise power levels, GPF bandwidth, and GPF quantities), respectively.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Edge-Cloud Resource Scheduling in Space-Air-Ground-Integrated Networks for Internet of Vehicles
    Cao, Bin
    Zhang, Jintong
    Liu, Xin
    Sun, Zhiheng
    Cao, Wenxi
    Nowak, Robert M.
    Lv, Zhihan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08): : 5765 - 5772
  • [22] Reinforcement Learning-Based UAVs Resource Allocation for Integrated Sensing and Communication (ISAC) System
    Wang, Min
    Chen, Peng
    Cao, Zhenxin
    Chen, Yun
    ELECTRONICS, 2022, 11 (03)
  • [23] A Deep Reinforcement Learning-Based Dynamic Traffic Offloading in Space-Air-Ground Integrated Networks (SAGIN)
    Tang, Fengxiao
    Hofner, Hans
    Kato, Nei
    Kaneko, Kazuma
    Yamashita, Yasutaka
    Hangai, Masatake
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (01) : 276 - 289
  • [24] Energy-Efficient Resource Allocation for Space-Air-Ground Integrated Industrial Power Internet of Things Network
    Qin, Peng
    Zhao, Honghao
    Fu, Yang
    Geng, Suiyan
    Chen, Zhiyu
    Zhou, Hongxi
    Zhao, Xiongwen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 5274 - 5284
  • [25] Location Hijacking Attack in Software-Defined Space-Air-Ground-Integrated Vehicular Network
    Wang, Jiadai
    Liu, Jiajia
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) : 5971 - 5981
  • [26] Energy Aware Space-Air-Ground Integrated Network Resource Orchestration Algorithm
    Zhang, Peiying
    Li, Zhiqiang
    Guizani, Mohsen
    Kumar, Neeraj
    Yu, Keping
    Wang, Jian
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 18950 - 18960
  • [27] Resource allocation for joint energy and spectral efficiency in cloud radio access network based on deep reinforcement learning
    Iqbal, Amjad
    Tham, Mau-Luen
    Chang, Yoong Choon
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (04)
  • [28] A Reinforcement Learning-Based Resource Allocation Scheme for Cloud Robotics
    Liu, Hang
    Liu, Shiwen
    Zheng, Kan
    IEEE ACCESS, 2018, 6 : 17215 - 17222
  • [29] AI-based 6G Space-Air-Ground Integrated Network Resource Allocation Algorithm
    Guan, Mingxiang
    Wu, Zhou
    Lv, Changwei
    Gan, Yuxi
    Guo, Bin
    Liu, Xing
    Lu, Chen
    Wang, Le
    Gao, Kuandong
    Feng, Yongpan
    2024 IEEE INTERNATIONAL WORKSHOP ON RADIO FREQUENCY AND ANTENNA TECHNOLOGIES, IWRF&AT 2024, 2024, : 340 - 344
  • [30] Physical Layer Security of HAPS-Based Space-Air-Ground-Integrated Network With Hybrid FSO/RF Communication
    Bankey, Vinay
    Sharma, Shubha
    Swaminathan, R.
    Madhukumar, A. S.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (04) : 4680 - 4688