Energy saving control for subway station air conditioning systems based on reinforcement learning

被引：0

作者：

Jiao H.-Y. ^{[1
]}

Feng H.-D. ^{[1
]}

Wei D. ^{[1
,2
]}

Ran Y.-B. ^{[1
,2
]}

Hu C.-W. ^{[1
,3
]}

机构：

[1] School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing

[2] Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing

[3] Beijing Xingchuang Land Real Estate Development Co., Ltd, Beijing

来源：

Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 12期

关键词：

deep deterministic policy gradient; energy saving control; multi-step prediction; neural networks; reinforcement learning; subway station air conditioning systems;

D O I：

10.13195/j.kzyjc.2021.0778

中图分类号：

学科分类号：

摘要：

The subway station air conditioning system consumes a lot of energy, and traditional control methods cannot take into account the comfort and energy saving issues together, resulting in poor control effect. Moreover, the current subway station air conditioning control systems control the air system and water system separately, which cannot guarantee the energy saving effect of the whole system. Therefore, this paper proposes an energy-saving control strategy for the system based on reinforcement learning. Firstly, this paper uses a neural network to establish an air conditioning system model as a simulation environment for offline training of the agent to solve the problem of long convergence time of model-free reinforcement learning methods for online training. Then, in order to improve the efficiency of the algorithm and also to address the characteristics of the multidimensional continuous action space of the air conditioning systems, this paper proposes a deep deterministic policy gradient algorithm based on multi-step prediction and designs an agent framework that will be used to interact with the environment model for training. In addition, in order to determine the optimal number of training times, the agent training termination condition is also set, which further improves the algorithm efficiency. Finally, simulation experiments are conducted based on the measured operational data of a subway station in Wuhan, and the results show that the proposed control strategy has better temperature tracking performance and can ensure the comfort of the platform, and the energy saving is about 17.908% compared with the current actual system. © 2022 Northeast University. All rights reserved.

引用

页码：3139 / 3148

页数：9

共 24 条

[1] Zeng Y T, Zhao L., Research on thermal comfort and energy saving strategies of ventilation and air-conditioning system in underground subway stations, Railway Standard Design, 63, 3, pp. 178-183, (2019)
[2] Maddalena E T, Lian Y Z, Jones C N., Data-driven methods for building control — A review and promising future directions, Control Engineering Practice, 95, (2020)
[3] Sutton R S, Barto A G., Reinforcement learning, A Bradford Book, 15, 7, pp. 665-685, (1998)
[4] Dai X Y, Zhang Y B, Gao J, Et al., Design and implementation of artificial intelligence-based energy-saving control system on IoT cloud platform, Electrotechnical Application, 38, 11, pp. 97-104, (2019)
[5] Yan J W, Huang Q, Zhou X., Energy-saving optimization operation of central air-conditioning system based on double-DQN algorithm, Journal of South China University of Technology: Natural Science Edition, 47, 1, pp. 135-144, (2019)
[6] Yuan X L, Pan Y Q, Yang J R, Et al., Study on the application of reinforcement learning in the operation optimization of HVAC system, Building Simulation, 14, 1, pp. 75-87, (2021)
[7] Dalamagkidis K, Kolokotsa D, Kalaitzakis K, Et al., Reinforcement learning for energy conservation and comfort in buildings, Building and Environment, 42, 7, pp. 2686-2698, (2007)
[8] Afram A, Janabi-Sharifi F., Review of modeling methods for HVAC systems, Applied Thermal Engineering, 67, 1, pp. 507-519, (2014)
[9] Zhao J, Wang X, Wang B, Et al., Multi-category target recognition based on neural network, Control and Decision, 35, 8, pp. 2037-2041, (2020)
[10] Liu S Y, Wu D., Fuzzy clustering smooth support vector machine, Control and Decision, 32, 3, pp. 547-551, (2017)

← 1 2 3 →