Distributed Multiagent Reinforcement Learning With Action Networks for Dynamic Economic Dispatch

被引：8

作者：

Hu, Chengfang ^{[1
]}

Wen, Guanghui ^{[2
]}

Wang, Shuai ^{[3
,4
]}

Fu, Junjie ^{[2
]}

Yu, Wenwu ^{[2
]}

机构：

[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

[2] Southeast Univ, Sch Math, Dept Syst Sci, Nanjing 211189, Peoples R China

[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China

[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Power demand; Heuristic algorithms; Prediction algorithms; Couplings; Approximation algorithms; Power system stability; Convex functions; Distributed optimization; dynamic economic dispatch; multiagent reinforcement learning (MARL); smart grids; VISIBLE IMAGE FUSION; PERFORMANCE; INFORMATION; ALGORITHM; PROTEIN;

D O I：

10.1109/TNNLS.2023.3234049

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A new class of distributed multiagent reinforcement learning (MARL) algorithm suitable for problems with coupling constraints is proposed in this article to address the dynamic economic dispatch problem (DEDP) in smart grids. Specifically, the assumption made commonly in most existing results on the DEDP that the cost functions are known and/or convex is removed in this article. A distributed projection optimization algorithm is designed for the generation units to find the feasible power outputs satisfying the coupling constraints. By using a quadratic function to approximate the state-action value function of each generation unit, the approximate optimal solution of the original DEDP can be obtained by solving a convex optimization problem. Then, each action network utilizes a neural network (NN) to learn the relationship between the total power demand and the optimal power output of each generation unit, such that the algorithm obtains the generalization ability to predict the optimal power output distribution on an unseen total power demand. Furthermore, an improved experience replay mechanism is introduced into the action networks to improve the stability of the training process. Finally, the effectiveness and robustness of the proposed MARL algorithm are verified by simulation.

引用

页码：9553 / 9564

页数：12

共 50 条

[31] Distributed dynamic economic dispatch of power generators with storage
Cherukuri, Ashish
Cortes, Jorge
2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 2365 - 2370
[32] Stochastic Economic Dispatch of Integrated Transmission and Distribution Networks Using Distributed Approximate Dynamic Programming
Fan, Guansheng
Lin, Shunjiang
Feng, Xiangyong
Wang, Qiong
Liu, Mingbo
IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5985 - 5996
[33] Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation
Georgios C. Chasparis
Jeff S. Shamma
Dynamic Games and Applications, 2012, 2 : 18 - 50
[34] Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation
Chasparis, Georgios C.
Shamma, Jeff S.
DYNAMIC GAMES AND APPLICATIONS, 2012, 2 (01) : 18 - 50
[35] Optimal Economic Gas Turbine Dispatch with Deep Reinforcement Learning
Sage, Manuel
Staniszewski, Martin
Zhao, Yaoyao Fiona
IFAC PAPERSONLINE, 2023, 56 (02): : 10039 - 10044
[36] A Reinforcement Learning Algorithm to Economic Dispatch Considering Transmission Losses
Jasmin, E. A.
Ahamed, T. P. Imthias
Jagathiraj, V. P.
2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 837 - 842
[37] Contingency-constrained economic dispatch with safe reinforcement learning
Eichelbeck, Michael
Markgraf, Hannah
Althoff, Matthias
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 597 - 602
[38] A Reinforcement Learning Algorithm based on Neural Network for Economic Dispatch
Yu, Liying
Li, Ning
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 1637 - 1642
[39] Dynamic Multichannel Access Based on Deep Reinforcement Learning in Distributed Wireless Networks
Cui, Qimei
Zhang, Ziyuan
Shi, Yanpeng
Ni, Wei
Zeng, Ming
Zhou, Mingyu
IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5831 - 5834
[40] Multiagent value iteration algorithms in dynamic programming and reinforcement learning
Bertsekas, Dimitri
RESULTS IN CONTROL AND OPTIMIZATION, 2020, 1

← 1 2 3 4 5 →