Safe multi-agent deep reinforcement learning for decentralized low-carbon operation in active distribution networks and multi-microgrids

被引:0
|
作者
Ye, Tong [1 ,2 ]
Huang, Yuping [1 ,2 ,3 ,4 ]
Yang, Weijia [2 ,3 ,4 ]
Cai, Guotian [1 ,2 ,3 ,4 ]
Yang, Yuyao [5 ]
Pan, Feng [5 ]
机构
[1] Univ Sci & Technol China, Sch Energy Sci & Engn, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Guangzhou Inst Energy Convers, Guangzhou 510640, Peoples R China
[3] CAS Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[4] Guangdong Prov Key Lab Renewable Energy, Guangzhou 510640, Peoples R China
[5] Guangdong Power Grid Co Ltd, Metrol Ctr, Qingyuan 511545, Peoples R China
关键词
Active distribution network; Carbon emission allocation; Low-carbon economic operation; Multi-microgrid operation; Safe multi-agent deep reinforcement learning;
D O I
10.1016/j.apenergy.2025.125609
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Due to fundamental differences in operational entities between distribution networks and microgrids, the equitable allocation of carbon responsibilities remains challenging. Furthermore, achieving real-time, efficient, and secure low-carbon economic dispatch in decentralized multi-entities continues to face obstacles. Therefore, we propose a co-optimization framework for Active Distribution Networks (ADNs) and multi-Microgrids (MMGs) to improve operational efficiency and reduce carbon emissions through adaptive coordination and decisionmaking. To facilitate decentralized low-carbon decision-making, we introduce the Spatiotemporal Carbon Intensity Equalization Method (STCIEM). This method ensures privacy and fairness by processing local data and equitably distributing carbon responsibilities. Additionally, we propose a non-cooperative optimization strategy that enables entities to optimize their operations independently while considering both economic and environmental interests. To address the challenges of real-time decision-making and the non-convex nature of lowcarbon optimization inherent in traditional approaches, we have developed the Enhanced Action Projection Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (EAP-MATD3) algorithm. This algorithm enhances the actor's objective to address the actor-critic mismatch problem, thereby outperforming conventional safe multi-agent deep reinforcement learning methods by generating optimized actions that adhere to physical system constraints. Experiments conducted on the modified IEEE 33-bus network and IEEE 123-bus network demonstrate the superiority of our approach in effectively balancing economic and environmental objectives within complex energy systems.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Decentralized Multi-agent Reinforcement Learning with Shared Actions
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [32] Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control
    Zhao, Yang
    Hu, Jian-Ming
    Gao, Ming-Yang
    Zhang, Zuo
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 458 - 470
  • [33] Decentralized Multi-Agent Deep Reinforcement Learning in Swarms of Drones for Flood Monitoring
    Baldazo, David
    Parras, Juan
    Zazo, Santiago
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [34] Optimal energy management of multi-microgrids connected to distribution system based on deep reinforcement learning
    Guo, Chenyu
    Wang, Xin
    Zheng, Yihui
    Zhang, Feng
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 131
  • [35] Energy management based on safe multi-agent reinforcement learning for smart buildings in distribution networks
    Sun, Yiyun
    Zhang, Senlin
    Liu, Meiqin
    Zheng, Ronghao
    Dong, Shanling
    ENERGY AND BUILDINGS, 2024, 318
  • [36] Multi-Agent Based Optimal Scheduling and Trading for Multi-Microgrids Integrated With Urban Transportation Networks
    Liu, Yun
    Wang, Yu
    Li, Yuanzheng
    Gooi, Hoay Beng
    Xin, Huanhai
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (03) : 2197 - 2210
  • [37] Multi-Agent Deep Reinforcement Learning for Decentralized Voltage-Var Control in Distribution Power System
    Zhang, Mengfan
    Xu, Qianwen
    Magnusson, Sindri
    Pilawa-Podgurski, Robert C. N.
    Guo, Guodong
    2022 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2022,
  • [38] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [39] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
    Jiang, Haitian
    Xiong, Dongliang
    Jiang, Xiaowen
    Yin, Aiguo
    Ding, Li
    Huang, Kai
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
  • [40] Deep reinforcement learning for multi-agent interaction
    Ahmed, Ibrahim H.
    Brewitt, Cillian
    Carlucho, Ignacio
    Christianos, Filippos
    Dunion, Mhairi
    Fosong, Elliot
    Garcin, Samuel
    Guo, Shangmin
    Gyevnar, Balint
    McInroe, Trevor
    Papoudakis, Georgios
    Rahman, Arrasy
    Schafer, Lukas
    Tamborski, Massimiliano
    Vecchio, Giuseppe
    Wang, Cheng
    Albrecht, Stefano, V
    AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368