Rolling horizon wind-thermal unit commitment optimization based on deep reinforcement learning

被引:3
|
作者
Shi, Jinhao [1 ]
Wang, Bo [1 ]
Yuan, Ran [1 ]
Wang, Zhi [1 ]
Chen, Chunlin [1 ]
Watada, Junzo [2 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Nanjing 210093, Peoples R China
[2] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu 8080135, Japan
基金
中国国家自然科学基金;
关键词
Unit commitment; Rolling optimization; Deep reinforcement learning; Wind power; Stochastic uncertainty; ROBUST ECONOMIC-DISPATCH; MANAGEMENT; REDUCTION; SCENARIOS;
D O I
10.1007/s10489-023-04489-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing penetration of renewable energy has brought significant challenges for modern power system operation. Academic research and industrial practice show that adjusting unit commitment (UC) scheduling periodically according to new forecasts of renewable power provides a promising way to improve system stability and economy; however, this greatly increases the computational burden for solution methods. In this paper, a deep reinforcement learning (DRL) method is proposed to obtain timely and reliable solutions for rolling-horizon UC (RHUC). First, based on historical data and day-ahead point forecasting, a data-driven method is designed to construct typical wind power scenarios that are regarded as components of the state space of DRL. Second, a rolling mechanism is proposed to dynamically update the state space based on real-time wind power data. Third, unlike existing reinforcement learning-based UC solution methods that segment the continuous outputs of generators as discrete variables, all the variables in RHUC are regarded as continuous. Additionally, a series of updating regulations are defined to ensure that the model is realistic. Thus, a DRL algorithm, the twin delayed deep deterministic policy gradient (TD3), can be utilized to effectively solve the problem. Finally, several case studies are conducted based on different test systems to demonstrate the efficiency of the proposed method. According to the experimental results, the proposed algorithm can obtain high-quality solutions in a considerably shorter time than traditional methods, which leads to a reduction of at least 1.1% in the power system operation cost.
引用
收藏
页码:19591 / 19609
页数:19
相关论文
共 50 条
  • [31] Decentralized yaw optimization for maximizing wind farm production based on deep reinforcement learning
    Deng, Zhiwen
    Xu, Chang
    Han, Xingxing
    Cheng, Zhe
    Xue, Feifei
    ENERGY CONVERSION AND MANAGEMENT, 2023, 286
  • [32] Reinforcement learning and A* search for the unit commitment problem
    de Mars, Patrick
    OSullivan, Aidan
    ENERGY AND AI, 2022, 9
  • [33] Operation Optimization of Wind-Thermal Systems Considering Emission Problem
    Zhang, Yang
    Yao, Fang
    Iu, Herbert H. C.
    Fernando, Tyrone
    Trinh, Hieu
    IECON 2014 - 40TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2014, : 2199 - 2205
  • [34] A Multiobjective Optimization Dispatch Method of Wind-Thermal Power System
    Guo, Xiaoxuan
    Gong, Renxi
    Bao, Haibo
    Lu, Zhenkun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (12) : 2549 - 2558
  • [35] Wind-thermal systems operation optimization considering emission problem
    Zhang, Yang
    Yao, Fang
    Iu, Herbert H. C.
    Fernando, Tyrone
    Hieu Trinh
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2015, 65 : 238 - 245
  • [36] Wind generation effects on Thermal unit commitment
    Otton, Chris
    9TH INTERNATIONAL WORKSHOP ON LARGE-SCALE INTEGRATION OF WIND POWER INTO POWER SYSTEMS AS WELL AS ON TRANSMISSION NETWORKS FOR OFFSHORE WIND POWER PLANTS, 2010, : 281 - 287
  • [37] Rolling unit commitment for systems with significant installed wind capacity
    Tuohy, Aidan
    Denny, Eleanor
    O'Malley, Mark
    2007 IEEE LAUSANNE POWERTECH, VOLS 1-5, 2007, : 1380 - 1385
  • [38] Deep Reinforcement Learning Based on Proximal Policy Optimization for the Maintenance of a Wind Farm with Multiple Crews
    Pinciroli, Luca
    Baraldi, Piero
    Ballabio, Guido
    Compare, Michele
    Zio, Enrico
    ENERGIES, 2021, 14 (20)
  • [39] Container stacking optimization based on Deep Reinforcement Learning
    Jin, Xin
    Duan, Zhentang
    Song, Wen
    Li, Qiqiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [40] Aerodynamic optimization of airfoil based on deep reinforcement learning
    Lou, Jinhua
    Chen, Rongqian
    Liu, Jiaqi
    Bao, Yue
    You, Yancheng
    Chen, Zhengwu
    PHYSICS OF FLUIDS, 2023, 35 (03)