Energy saving control for subway station air conditioning systems based on reinforcement learning

被引:0
|
作者
Jiao H.-Y. [1 ]
Feng H.-D. [1 ]
Wei D. [1 ,2 ]
Ran Y.-B. [1 ,2 ]
Hu C.-W. [1 ,3 ]
机构
[1] School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing
[2] Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing
[3] Beijing Xingchuang Land Real Estate Development Co., Ltd, Beijing
来源
Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 12期
关键词
deep deterministic policy gradient; energy saving control; multi-step prediction; neural networks; reinforcement learning; subway station air conditioning systems;
D O I
10.13195/j.kzyjc.2021.0778
中图分类号
学科分类号
摘要
The subway station air conditioning system consumes a lot of energy, and traditional control methods cannot take into account the comfort and energy saving issues together, resulting in poor control effect. Moreover, the current subway station air conditioning control systems control the air system and water system separately, which cannot guarantee the energy saving effect of the whole system. Therefore, this paper proposes an energy-saving control strategy for the system based on reinforcement learning. Firstly, this paper uses a neural network to establish an air conditioning system model as a simulation environment for offline training of the agent to solve the problem of long convergence time of model-free reinforcement learning methods for online training. Then, in order to improve the efficiency of the algorithm and also to address the characteristics of the multidimensional continuous action space of the air conditioning systems, this paper proposes a deep deterministic policy gradient algorithm based on multi-step prediction and designs an agent framework that will be used to interact with the environment model for training. In addition, in order to determine the optimal number of training times, the agent training termination condition is also set, which further improves the algorithm efficiency. Finally, simulation experiments are conducted based on the measured operational data of a subway station in Wuhan, and the results show that the proposed control strategy has better temperature tracking performance and can ensure the comfort of the platform, and the energy saving is about 17.908% compared with the current actual system. © 2022 Northeast University. All rights reserved.
引用
收藏
页码:3139 / 3148
页数:9
相关论文
共 24 条
  • [1] Zeng Y T, Zhao L., Research on thermal comfort and energy saving strategies of ventilation and air-conditioning system in underground subway stations, Railway Standard Design, 63, 3, pp. 178-183, (2019)
  • [2] Maddalena E T, Lian Y Z, Jones C N., Data-driven methods for building control — A review and promising future directions, Control Engineering Practice, 95, (2020)
  • [3] Sutton R S, Barto A G., Reinforcement learning, A Bradford Book, 15, 7, pp. 665-685, (1998)
  • [4] Dai X Y, Zhang Y B, Gao J, Et al., Design and implementation of artificial intelligence-based energy-saving control system on IoT cloud platform, Electrotechnical Application, 38, 11, pp. 97-104, (2019)
  • [5] Yan J W, Huang Q, Zhou X., Energy-saving optimization operation of central air-conditioning system based on double-DQN algorithm, Journal of South China University of Technology: Natural Science Edition, 47, 1, pp. 135-144, (2019)
  • [6] Yuan X L, Pan Y Q, Yang J R, Et al., Study on the application of reinforcement learning in the operation optimization of HVAC system, Building Simulation, 14, 1, pp. 75-87, (2021)
  • [7] Dalamagkidis K, Kolokotsa D, Kalaitzakis K, Et al., Reinforcement learning for energy conservation and comfort in buildings, Building and Environment, 42, 7, pp. 2686-2698, (2007)
  • [8] Afram A, Janabi-Sharifi F., Review of modeling methods for HVAC systems, Applied Thermal Engineering, 67, 1, pp. 507-519, (2014)
  • [9] Zhao J, Wang X, Wang B, Et al., Multi-category target recognition based on neural network, Control and Decision, 35, 8, pp. 2037-2041, (2020)
  • [10] Liu S Y, Wu D., Fuzzy clustering smooth support vector machine, Control and Decision, 32, 3, pp. 547-551, (2017)