Safe multi-agent deep reinforcement learning for decentralized low-carbon operation in active distribution networks and multi-microgrids

被引:0
|
作者
Ye, Tong [1 ,2 ]
Huang, Yuping [1 ,2 ,3 ,4 ]
Yang, Weijia [2 ,3 ,4 ]
Cai, Guotian [1 ,2 ,3 ,4 ]
Yang, Yuyao [5 ]
Pan, Feng [5 ]
机构
[1] Univ Sci & Technol China, Sch Energy Sci & Engn, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Guangzhou Inst Energy Convers, Guangzhou 510640, Peoples R China
[3] CAS Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[4] Guangdong Prov Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[5] Guangdong Power Grid Co Ltd, Metrol Ctr, Qingyuan 511545, Peoples R China
关键词
Active distribution network; Carbon emission allocation; Low-carbon economic operation; Multi-microgrid operation; Safe multi-agent deep reinforcement learning;
D O I
10.1016/j.apenergy.2025.125609
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Due to fundamental differences in operational entities between distribution networks and microgrids, the equitable allocation of carbon responsibilities remains challenging. Furthermore, achieving real-time, efficient, and secure low-carbon economic dispatch in decentralized multi-entities continues to face obstacles. Therefore, we propose a co-optimization framework for Active Distribution Networks (ADNs) and multi-Microgrids (MMGs) to improve operational efficiency and reduce carbon emissions through adaptive coordination and decisionmaking. To facilitate decentralized low-carbon decision-making, we introduce the Spatiotemporal Carbon Intensity Equalization Method (STCIEM). This method ensures privacy and fairness by processing local data and equitably distributing carbon responsibilities. Additionally, we propose a non-cooperative optimization strategy that enables entities to optimize their operations independently while considering both economic and environmental interests. To address the challenges of real-time decision-making and the non-convex nature of lowcarbon optimization inherent in traditional approaches, we have developed the Enhanced Action Projection Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (EAP-MATD3) algorithm. This algorithm enhances the actor's objective to address the actor-critic mismatch problem, thereby outperforming conventional safe multi-agent deep reinforcement learning methods by generating optimized actions that adhere to physical system constraints. Experiments conducted on the modified IEEE 33-bus network and IEEE 123-bus network demonstrate the superiority of our approach in effectively balancing economic and environmental objectives within complex energy systems.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] A coordinated active and reactive power optimization approach for multi-microgrids connected to distribution networks with multi-actor-attention-critic deep reinforcement learning
    Dong, Lei
    Lin, Hao
    Qiao, Ji
    Zhang, Tao
    Zhang, Shiming
    Pu, Tianjiao
    APPLIED ENERGY, 2024, 373
  • [12] Decentralized Anomaly Detection via Deep Multi-Agent Reinforcement Learning
    Szostak, Hadar
    Cohen, Kobi
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [13] Decentralized Multi-agent Formation Control via Deep Reinforcement Learning
    Gutpa, Aniket
    Nallanthighal, Raghava
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, : 289 - 295
  • [14] Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
    Qu, Chao
    Mannor, Shie
    Xu, Huan
    Qi, Yuan
    Song, Le
    Xiong, Junwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [15] Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning
    Lu, Songtao
    Zhang, Kaiqing
    Chen, Tianyi
    Basar, Tamer
    Horesh, Lior
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8767 - 8775
  • [16] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [17] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269
  • [18] Decentralized coordination between active distribution network and multi-microgrids through a fast decentralized adjustable robust operation framework✩
    Chen, Xiao
    Zhai, Junyi
    Jiang, Yuning
    Ni, Chenyixuan
    Wang, Sheng
    Nimmegeers, Philippe
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 34
  • [19] Assured Deep Multi-Agent Reinforcement Learning for Safe Robotic Systems
    Riley, Joshua
    Calinescu, Radu
    Paterson, Colin
    Kudenko, Daniel
    Banks, Alec
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2021, 2022, 13251 : 158 - 180
  • [20] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269