Safe multi-agent deep reinforcement learning for decentralized low-carbon operation in active distribution networks and multi-microgrids

被引:0
|
作者
Ye, Tong [1 ,2 ]
Huang, Yuping [1 ,2 ,3 ,4 ]
Yang, Weijia [2 ,3 ,4 ]
Cai, Guotian [1 ,2 ,3 ,4 ]
Yang, Yuyao [5 ]
Pan, Feng [5 ]
机构
[1] Univ Sci & Technol China, Sch Energy Sci & Engn, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Guangzhou Inst Energy Convers, Guangzhou 510640, Peoples R China
[3] CAS Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[4] Guangdong Prov Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[5] Guangdong Power Grid Co Ltd, Metrol Ctr, Qingyuan 511545, Peoples R China
关键词
Active distribution network; Carbon emission allocation; Low-carbon economic operation; Multi-microgrid operation; Safe multi-agent deep reinforcement learning;
D O I
10.1016/j.apenergy.2025.125609
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Due to fundamental differences in operational entities between distribution networks and microgrids, the equitable allocation of carbon responsibilities remains challenging. Furthermore, achieving real-time, efficient, and secure low-carbon economic dispatch in decentralized multi-entities continues to face obstacles. Therefore, we propose a co-optimization framework for Active Distribution Networks (ADNs) and multi-Microgrids (MMGs) to improve operational efficiency and reduce carbon emissions through adaptive coordination and decisionmaking. To facilitate decentralized low-carbon decision-making, we introduce the Spatiotemporal Carbon Intensity Equalization Method (STCIEM). This method ensures privacy and fairness by processing local data and equitably distributing carbon responsibilities. Additionally, we propose a non-cooperative optimization strategy that enables entities to optimize their operations independently while considering both economic and environmental interests. To address the challenges of real-time decision-making and the non-convex nature of lowcarbon optimization inherent in traditional approaches, we have developed the Enhanced Action Projection Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (EAP-MATD3) algorithm. This algorithm enhances the actor's objective to address the actor-critic mismatch problem, thereby outperforming conventional safe multi-agent deep reinforcement learning methods by generating optimized actions that adhere to physical system constraints. Experiments conducted on the modified IEEE 33-bus network and IEEE 123-bus network demonstrate the superiority of our approach in effectively balancing economic and environmental objectives within complex energy systems.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
    Szostak, Hadar
    Cohen, Kobi
    IEEE ACCESS, 2024, 12 : 130444 - 130459
  • [2] Real-Time Operation Optimization in Active Distribution Networks Based on Multi-Agent Deep Reinforcement Learning
    Xu, Jie
    Gao, Hongjun
    Wang, Renjun
    Liu, Junyong
    JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2024, 12 (03) : 886 - 899
  • [3] Real-time Operation Optimization in Active Distribution Networks Based on Multi-agent Deep Reinforcement Learning
    Jie Xu
    Hongjun Gao
    Renjun Wang
    Junyong Liu
    Journal of Modern Power Systems and Clean Energy, 2024, 12 (03) : 886 - 899
  • [4] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
  • [5] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
  • [6] Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning
    El Mhamdi, El Mandi
    Guerraoui, Rachid
    Hendrikx, Hadrien
    Maurer, Alexandre
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [7] Multi-Agent Reinforcement Learning for Microgrids
    Dimeas, A. L.
    Hatziargyriou, N. D.
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [8] Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning
    de Souza, Cristino, Jr.
    Newbury, Rhys
    Cosgun, Akansel
    Castillo, Pedro
    Vidolov, Boris
    Kulic, Dana
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03): : 4552 - 4559
  • [9] Decentralized Deterministic Multi-Agent Reinforcement Learning
    Grosnit, Antoine
    Cai, Desmond
    Wynter, Laura
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1548 - 1553
  • [10] Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks
    Wang, Jianhong
    Xu, Wangkun
    Gu, Yunjie
    Song, Wenbin
    Green, Tim C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34