Towards optimal control of air handling units using deep reinforcement learning and recurrent neural network

被引:115
|
作者
Zou, Zhengbo [1 ]
Yu, Xinran [1 ]
Ergan, Semiha [1 ]
机构
[1] NYU, Dept Civil & Urban Engn, MetroTech Ctr 15, Brooklyn, NY 11201 USA
关键词
HVAC control; Energy consumption; Thermal comfort; Deep reinforcement learning; Long-short-term-memory network; ENERGY-CONSUMPTION; BUILDINGS; SYSTEMS; COMFORT; MODELS;
D O I
10.1016/j.buildenv.2019.106535
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Optimal control of heating, ventilation and air conditioning systems (HVACs) aims to minimize the energy consumption of equipment while maintaining the thermal comfort of occupants. Traditional rule-based control methods are not optimized for HVAC systems with continuous sensor readings and actuator controls. Recent developments in deep reinforcement learning (DRL) enabled control of HVACs with continuous sensor inputs and actions, while eliminating the need of building complex thermodynamic models. DRL control includes an environment, which approximates real-world HVAC operations; and an agent, that aims to achieve optimal control over the HVAC. Existing DRL control frameworks use simulation tools (e.g., EnergyPlus) to build DRL training environments with HVAC systems information, but oversimplify building geometrics. This study proposes a framework aiming to achieve optimal control over Air Handling Units (AHUs) by implementing longshort-term-memory (LSTM) networks to approximate real-world HVAC operations to build DRL training environments. The framework also implements state-of-the-art DRL algorithms (e.g., deep deterministic policy gradient) for optimal control over the AHUs. Three AHUs, each with two-years of building automation system (BAS) data, were used as testbeds for evaluation. Our LSTM-based DRL training environments, built using the first year's BAS data, achieved an average mean square error of 0.0015 across 16 normalized AHU parameters. When deployed in the testing environments, which were built using the second year's BAS data of the same AHUs, the DRL agents achieved 27%-30% energy saving comparing to the actual energy consumption, while maintaining the predicted percentage of discomfort (PPD) at 10%.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Airflow Direction Control of Air Conditioners Using Deep Reinforcement Learning
    Sakuma, Yuiko
    Nishi, Hiroaki
    2020 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS (SICE ISCS 2020), 2020, : 61 - 68
  • [22] Neural Malware Control with Deep Reinforcement Learning
    Wang, Yu
    Stokes, Jack W.
    Marinescu, Mady
    MILCOM 2019 - 2019 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2019,
  • [23] Deep Neural Network-Based Surrogate Model for Optimal Component Sizing of Power Converters Using Deep Reinforcement Learning
    Bui, Van-Hai
    Chang, Fangyuan
    Su, Wencong
    Wang, Mengqi
    Murphey, Yi Lu
    Da Silva, Felipe Leno
    Huang, Can
    Xue, Lingxiao
    Glatt, Ruben
    IEEE ACCESS, 2022, 10 : 78702 - 78712
  • [24] Optimal load distribution control for airport terminal chiller units based on deep reinforcement learning
    Chen, Bochao
    Zeng, Wenhao
    Nie, Haowen
    Deng, Ziyou
    Yang, Wansheng
    Yan, Biao
    JOURNAL OF BUILDING ENGINEERING, 2024, 97
  • [25] Towards optimal control of HPV model using safe reinforcement learning with actor-critic neural networks
    Amirabadi, Roya Khalili
    Fard, Omid S.
    Farimani, Mohsen Jalaeian
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [26] Accelerating the Deep Reinforcement Learning with Neural Network Compression
    Zhang, Hongjie
    He, Zhuocheng
    Li, Jing
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [27] Adaptive neural network control of robot manipulator using reinforcement learning
    Tang, Li
    Liu, Yan-Jun
    JOURNAL OF VIBRATION AND CONTROL, 2014, 20 (14) : 2162 - 2171
  • [28] IMPROVING DEEP REINFORCEMENT LEARNING FOR FINANCIAL TRADING USING NEURAL NETWORK DISTILLATION
    Tsantekidis, Avraam
    Passalis, Nikolaos
    Tefas, Anastasios
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [29] Robust flow control and optimal sensor placement using deep reinforcement learning
    Paris, Romain
    Beneddine, Samir
    Dandois, Julien
    JOURNAL OF FLUID MECHANICS, 2021, 913
  • [30] Deep Reinforcement Learning for Time Optimal Velocity Control using Prior Knowledge
    Hartmann, Gabriel
    Shiller, Zvi
    Azaria, Amos
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 186 - 193