Distributed Multiagent Reinforcement Learning With Action Networks for Dynamic Economic Dispatch

被引：8

作者：

Hu, Chengfang ^{[1
]}

Wen, Guanghui ^{[2
]}

Wang, Shuai ^{[3
,4
]}

Fu, Junjie ^{[2
]}

Yu, Wenwu ^{[2
]}

机构：

[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

[2] Southeast Univ, Sch Math, Dept Syst Sci, Nanjing 211189, Peoples R China

[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China

[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Power demand; Heuristic algorithms; Prediction algorithms; Couplings; Approximation algorithms; Power system stability; Convex functions; Distributed optimization; dynamic economic dispatch; multiagent reinforcement learning (MARL); smart grids; VISIBLE IMAGE FUSION; PERFORMANCE; INFORMATION; ALGORITHM; PROTEIN;

D O I：

10.1109/TNNLS.2023.3234049

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A new class of distributed multiagent reinforcement learning (MARL) algorithm suitable for problems with coupling constraints is proposed in this article to address the dynamic economic dispatch problem (DEDP) in smart grids. Specifically, the assumption made commonly in most existing results on the DEDP that the cost functions are known and/or convex is removed in this article. A distributed projection optimization algorithm is designed for the generation units to find the feasible power outputs satisfying the coupling constraints. By using a quadratic function to approximate the state-action value function of each generation unit, the approximate optimal solution of the original DEDP can be obtained by solving a convex optimization problem. Then, each action network utilizes a neural network (NN) to learn the relationship between the total power demand and the optimal power output of each generation unit, such that the algorithm obtains the generalization ability to predict the optimal power output distribution on an unseen total power demand. Furthermore, an improved experience replay mechanism is introduced into the action networks to improve the stability of the training process. Finally, the effectiveness and robustness of the proposed MARL algorithm are verified by simulation.

引用

页码：9553 / 9564

页数：12

共 50 条

[21] A Distributed Optimization Algorithm Based on Multiagent Network for Economic Dispatch With Region Partitioning
Liu, Qingshan
Le, Xinyi
Li, Kaixuan
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (05) : 2466 - 2475
[22] Reinforcement Learning with Enhanced Safety for Optimal Dispatch of Distributed Energy Resources in Active Distribution Networks
Xu Yang
Haotian Liu
Wenchuan Wu
Qi Wang
Peng Yu
Jiawei Xing
Yuejiao Wang
Journal of Modern Power Systems and Clean Energy, 2024, 12 (05) : 1484 - 1494
[23] Reinforcement Learning with Enhanced Safety for Optimal Dispatch of Distributed Energy Resources in Active Distribution Networks
Yang, Xu
Liu, Haotian
Wu, Wenchuan
Wang, Qi
Yu, Peng
Xing, Jiawei
Wang, Yuejiao
JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2024, 12 (05) : 1484 - 1494
[24] A Multiagent Fuzzy Reinforcement Learning Approach for Economic Power Dispatch Considering Multiple Plug-In Electric Vehicle Loads
Nandan Kumar Navin
Arabian Journal for Science and Engineering, 2021, 46 : 1431 - 1449
[25] Dynamic pricing based on asymmetric multiagent reinforcement learning
Könönen, V
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (01) : 73 - 98
[26] A Multiagent Fuzzy Reinforcement Learning Approach for Economic Power Dispatch Considering Multiple Plug-In Electric Vehicle Loads
Navin, Nandan Kumar
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (02) : 1431 - 1449
[27] Distributed Value Function Approximation for Collaborative Multiagent Reinforcement Learning
Stankovic, Milos S.
Beko, Marko
Stankovic, Srdjan S.
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2021, 8 (03): : 1270 - 1280
[28] Distributed response to network intrusions using multiagent reinforcement learning
Malialis, Kleanthis
Kudenko, Daniel
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 270 - 284
[29] Distributed Coordination of DERs With Storage for Dynamic Economic Dispatch
Cherukuri, Ashish
Cortes, Jorge
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (03) : 835 - 842
[30] Dynamic Economic Dispatch for Microgrids: A Fully Distributed Approach
Zheng, Weiye
Wu, Wenchuan
Zhang, Boming
Sun, Hongbin
Guo, Qinglai
Lin, Chenhui
2016 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION (T&D), 2016,

← 1 2 3 4 5 →