Fault diagnosis and protection strategy based on spatio-temporal multi-agent reinforcement learning for active distribution system using phasor measurement units

被引:4
|
作者
Zhang, Tong [1 ]
Liu, Jianchang [2 ]
Wang, Honghai [2 ]
Li, Yong [3 ]
Wang, Nan [4 ]
Kang, Chengming [5 ]
机构
[1] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang Key Lab Informat Percept & Edge Comp, Shenliao West Rd 111, Shenyang, Peoples R China
[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China
[3] Shenyang Univ Technol, Sch Elect Engn, Shenyang, Peoples R China
[4] Shenyang Univ, Coll Mech Engn, Shenyang 110000, Peoples R China
[5] Shenyang Pharmaceut Univ, Sch Pharmaceut Engn, Shenyang, Peoples R China
基金
中国博士后科学基金;
关键词
Phasor measurement unit; Active distribution network; Fault diagnosis and protection; Multi -agent reinforcement learning; Dynamic angles; ADAPTATION; FILTER;
D O I
10.1016/j.measurement.2023.113291
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Active distribution system (ADS) requires intelligent sensors to provide real-time data. Due to the harmonic distortion and sparse reward function, the multi-agent reinforcement learning strategy has the fuzzy characteristic and slow convergence. This work proposes a model-free spatio-temporal multi-agent reinforcement learning (STMARL) strategy for the spatio-temporal fault diagnosis and protection. The augmented-state extended Kalman filter tracks spatial-temporal sequences measured by phasor measurement unit (PMU) and feed into the diagnosis model. The supervised multi-residual generation learning (SMGL) model is constructed to diagnose the single-phase-to-ground fault. Based on spatio-temporal sequences, the SMGL diagnosis model integrates the ADS protection as a Markov decision process and the protection operation is quantified as the STMARL reward. In the hybrid multi-agent framework, the STMARL protection strategy converges faster based on the higher-level agent suggestion without the global reward. The STMARL protection strategy is validated in the IEEE 34-bus distribution test system with 10 PMUs. Comparing with the SOGI, WNN, Sarsa and DDPG algorithms, in the common fault conditions, the STMARL protection strategy shows better performance in the high dynamic environment with the response time 1.274 s and the diagnosis accuracy rate 97.125%. The STMARL diagnosis and protection strategy guides ADS in a stable operation coordinate with all PMUs, which lays foundation for the synchronous measurement application in the smart grid.
引用
收藏
页数:12
相关论文
共 47 条
  • [41] Collaborative operation optimization of distribution system and virtual power plants using multi-agent deep reinforcement learning with parameter-sharing mechanism
    Sun, Zhonghao
    Lu, Tianguang
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (01) : 39 - 49
  • [42] Construction of multi-agent mobile robots control system in the problem of persecution with using a modified reinforcement learning method based on neural networks
    Patkin, M. L.
    Rogachev, G. N.
    2017 WORKSHOP ON MATERIALS AND ENGINEERING IN AERONAUTICS (MEA2017), 2018, 312
  • [43] Multi-agent deep reinforcement learning-based cooperative energy management for regional integrated energy system incorporating active demand-side management
    Liu, Jiejie
    Ma, Yanan
    Chen, Ying
    Zhao, Chunlu
    Meng, Xianyang
    Wu, Jiangtao
    ENERGY, 2025, 319
  • [44] Consumer-Centric Home Energy Management System Using Trust Region Policy Optimization-Based Multi-Agent Deep Reinforcement Learning
    Thattai, Kuthsav
    Ravishankar, Jayashri
    Li, Chaojie
    2023 IEEE BELGRADE POWERTECH, 2023,
  • [45] Multi-agent system based sequential energy management strategy for Micro-Grid using optimal weighted regularized extreme learning machine and decision tree
    El Bourakadi, Dounia
    Yahyaouy, Ali
    Boumhidi, Jaouad
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (04): : 479 - 494
  • [46] Reinforcement Learning-based Decentralized Optimal Control for Large-Scale Multi-agent System by Using Neural Networks and Discrete-time Mean Field Games
    Zhou, Zejian
    Zhang, Yuzhu
    Xu, Hao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [47] Land Use Dynamic Simulator (LUDAS): A multi-agent system model for simulating spatio-temporal dynamics of coupled human-landscape system 2. Scenario-based application for impact assessment of land-use policies
    Le, Quang Bao
    Park, Soo Jin
    Vlek, Paul L. G.
    ECOLOGICAL INFORMATICS, 2010, 5 (03) : 203 - 221