Data-driven building load prediction and large language models: Comprehensive overview

被引:0
|
作者
Zhang, Yake [1 ]
Wang, Dijun [2 ]
Wang, Guansong [3 ]
Xu, Peng [1 ]
Zhu, Yihao [2 ]
机构
[1] Tongji Univ, Sch Mech Engn, Shanghai 200092, Peoples R China
[2] Guangzhou Metro Design & Res Inst Co Ltd, Guangzhou 510030, Peoples R China
[3] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
Data-driven approach; Building load prediction; Machine learning; Large language models; Feature engineering; Data preparation; Room-scale load prediction; Retrieval augmented generation; ARTIFICIAL NEURAL-NETWORKS; ENERGY-CONSUMPTION; DATA FUSION; RANDOM FOREST; ELECTRICITY CONSUMPTION; FORECASTING TECHNIQUES; KNOWLEDGE DISCOVERY; DEMAND; TIME; REGRESSION;
D O I
10.1016/j.enbuild.2024.115001
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Building load forecasting is essential for optimizing the architectural design and managing energy efficiently, enhancing the performance of Heating, Ventilation, and Air Conditioning systems, and enhancing occupant comfort. With advancements in data science and machine learning, the focus on predicting building loads through data analysis has significantly intensified as a research domain. However, previous studies have typically faced challenges such as data scarcity, improper feature extraction methods, and weak model generalization capabilities. To gain a deeper understanding of these issues, a comprehensive review of data processing, feature selection, and model selection methods in previous research is conducted from the perspective of the entire load forecasting process. The aim is to identify the most suitable methods for each step of load forecasting to enhance prediction accuracy. This review surveys the research progress of statistical learning methods, traditional machine learning methods, deep learning methods, and hybrid methods in different application scenarios of building load prediction. Then, it emphasized the critical role of data preprocessing and focused on techniques like data fusion and transfer learning to overcome data shortages and bolster the models' ability to generalize. Moreover, the obtainment of significant features from building characteristics, weather data, and operational statistics to boost prediction accuracy is explored. A notable contribution of this review is the proposed technical framework for EnergyPlus model generation using LLM-based Retrieval Augmented Generation (RAG) technology and room- level load prediction with Spatio-Temporal Graph Neural Networks. This framework utilize architectural design drawings to achieve an "end-to-end" prediction process, aiming to reduce the professional threshold of load prediction and provide technical support for fine-grained regulation of building operation. Exploratory experiment is conducted using a single-zone building model to verify the feasibility of LLMgenerated EnergyPlus models, with IDF simulation file generation taking only 196 s. Room-level load forecasting with LLMs remains to be explored further. It is reasonable to believe that the methods proposed in this review hold promise for advancing data-driven building load forecasting technologies.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Flood forecasting in large rivers with data-driven models
    Phuoc Khac-Tien Nguyen
    Lloyd Hock-Chye Chua
    Lam Hung Son
    Natural Hazards, 2014, 71 : 767 - 784
  • [32] Flood forecasting in large rivers with data-driven models
    Phuoc Khac-Tien Nguyen
    Chua, Lloyd Hock-Chye
    Son, Lam Hung
    NATURAL HAZARDS, 2014, 71 (01) : 767 - 784
  • [33] Data-driven urban building energy models for the platform of Toronto
    Vecchi, Francesca
    Berardi, Umberto
    Mutani, Guglielmina
    ENERGY EFFICIENCY, 2023, 16 (04)
  • [34] Data-driven urban building energy models for the platform of Toronto
    Francesca Vecchi
    Umberto Berardi
    Guglielmina Mutani
    Energy Efficiency, 2023, 16
  • [35] Data-Driven Analysis of Patients’ Body Language in Healthcare: A Comprehensive Review
    Turaev, Sherzod
    Babu, Aiswarya
    Al-Dabet, Saja
    Rustamov, Jaloliddin
    Rustamov, Zahiriddin
    Zaki, Nazar
    Mohamad, Mohd Saberi
    Loo, Chu Kiong
    IEEE ACCESS, 2024, 12 : 16514 - 16548
  • [36] PersonaCraft: Leveraging language models for data-driven persona development
    Jung, Soon-Gyo
    Salminen, Joni
    Aldous, Kholoud Khalil
    Jansen, Bernard J.
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2025, 197
  • [37] Novel Data-Driven Machine Learning Models for Heating Load Prediction: Single and Optimized Naive Bayes
    Li, Fangyuan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 657 - 668
  • [38] Large-scale comparison and demonstration of continual learning for adaptive data-driven building energy prediction
    Li, Ao
    Zhang, Chong
    Xiao, Fu
    Fan, Cheng
    Deng, Yang
    Wang, Dan
    APPLIED ENERGY, 2023, 347
  • [39] A comprehensive overview of the applications of kernel functions and data-driven models in regression and classification tasks in the context of software sensors
    Ngu, Joyce Chen Yen
    Yeo, Wan Sieng
    Thien, Teck Fu
    Nandong, Jobrun
    APPLIED SOFT COMPUTING, 2024, 164
  • [40] A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
    Shui, Ruihao
    Cao, Yixin
    Xiang, Wang
    Chua, Tat-Seng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7337 - 7348