Autonomous Building Control Using Offline Reinforcement Learning

被引:4
|
作者
Schepers, Jorren [1 ]
Eyckerman, Reinout [1 ]
Elmaz, Furkan [1 ]
Casteels, Wim [1 ]
Latre, Steven [1 ]
Hellinckx, Peter [1 ]
机构
[1] Univ Antwerp, Fac Appl Engn, IMEC, IDLab, Sint Pietersvliet 7, B-2000 Antwerp, Belgium
关键词
D O I
10.1007/978-3-030-89899-1_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial Intelligence (AI) powered building control allows deriving policies that are more flexible and energy efficient than standard control. However, there are challenges: environment interaction is used to train Reinforcement Learning (RL) agents but for building control it is often not possible to use a physical environment, and creating high fidelity simulators is a difficult task. With offline RL an agent can be trained without environment interaction, it is a data-driven approach to RL. In this paper, Conservative Q-Learning (CQL), an offline RL algorithm, is used to control the temperature setpoint in a room of a university campus building. The agent is trained using only the historical data available for this room. The results show that there is potential for offline RL in the field of building control, but also that there is room for improvement and need for further research in this area.
引用
收藏
页码:246 / 255
页数:10
相关论文
共 50 条
  • [41] Benchmarking Offline Reinforcement Learning
    Tittaferrante, Andrew
    Yassine, Abdulsalam
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 259 - 263
  • [42] Federated Offline Reinforcement Learning
    Zhou, Doudou
    Zhang, Yufeng
    Sonabend-W, Aaron
    Wang, Zhaoran
    Lu, Junwei
    Cai, Tianxi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 3152 - 3163
  • [43] Distributed Offline Reinforcement Learning
    Heredia, Paulo
    George, Jemin
    Mou, Shaoshuai
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4621 - 4626
  • [44] Learning conditional policies for crystal design using offline reinforcement learning
    Govindarajan, Prashant
    Miret, Santiago
    Rector-Brooks, Jarrid
    Phielipp, Mariano
    Rajendran, Janarthanan
    Chandar, Sarath
    DIGITAL DISCOVERY, 2024, 3 (04): : 769 - 785
  • [45] Safety-Aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving
    Lin, Haohong
    Ding, Wenhao
    Liu, Zuxin
    Niu, Yaru
    Zhu, Jiacheng
    Niu, Yuming
    Zhao, Ding
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4639 - 4646
  • [46] Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning
    Liu, Tao
    Huang, Junhao
    Zhao, Jintao
    ROBOTICA, 2024,
  • [47] Hybrid offline-online reinforcement learning for obstacle avoidance in autonomous underwater vehicles
    Zhao, Jintao
    Liu, Tao
    Huang, Junhao
    SHIPS AND OFFSHORE STRUCTURES, 2024,
  • [48] Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
    Dhakate, Rohit
    Brommer, Christian
    Boehm, Christoph
    Gietler, Harald
    Weiss, Stephan
    Steinbrener, Jan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7036 - 7043
  • [49] Autonomous underwater vehicle control using reinforcement learning policy search methods
    El-Fakdi, A
    Carreras, M
    Palomeras, N
    Ridao, P
    OCEANS 2005 - EUROPE, VOLS 1 AND 2, 2005, : 793 - 798
  • [50] Low-level autonomous control and tracking of quadrotor using reinforcement learning
    Pi, Chen-Huan
    Hu, Kai-Chun
    Cheng, Stone
    Wu, I-Chen
    CONTROL ENGINEERING PRACTICE, 2020, 95 (95)