Real vs. simulated: Questions on the capability of simulated datasets on building fault detection for energy efficiency from a data-driven perspective

被引:15
|
作者
Huang, Jiajing [1 ]
Wen, Jin [2 ]
Yoon, Hyunsoo [3 ]
Pradhan, Ojas [2 ]
Wu, Teresa [1 ]
O'Neill, Zheng [4 ]
Candan, Kasim Selcuk [1 ]
机构
[1] Arizona State Univ, Sch Comp & Augmented Intelligence, Tempe, AZ 85281 USA
[2] Drexel Univ, Dept Civil Architectural & Environm Engn, Philadelphia, PA 19104 USA
[3] Yonsei Univ, Dept Ind Engn, Seoul 03722, South Korea
[4] Texas A&M Univ, J Mike Walker 66 Dept Mech Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Building AFDD; Machine learning; Simulated; Real; Similarity; NEURAL-NETWORK; HVAC SYSTEMS; DIAGNOSIS; PROGNOSTICS; STRATEGY; SENSORS; WAVELET;
D O I
10.1016/j.enbuild.2022.111872
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Literature on building Automatic Fault Detection and Diagnosis (AFDD) mainly focuses on simulated system data due to high expenses and difficulties of obtaining and analyzing real building data. There is a lack of validation on performances and scalabilities of data-driven AFDD approaches using simulated data and how it compares to that from real building data. In this study, we conduct two sets of experiments to seek answers to this question. We first evaluate data-driven fault detection strategies on real and simulated building data separately. We observe that the fault detection performances are not affected by fault detection strategies, sizes of training data, and the number of cross-validation folds when training and blind test data come from the same data source, namely, simulated or real building data. Next, we conduct a cross-dataset study, that is, develop the model using simulated data and tested on real building data. The results indicate the model trained on simulated data is not generalized to be applied for real building data for fault detection. Kolmogorov-Smirnov Test is conducted to confirm that there exist statistical differences between the simulated and real building data and identify a subset of features with similarities between the two datasets. Using the subset of the feature, cross-dataset experiments show fault detection improvements on most fault cases. We conclude that even if the system produces simulated data with the same fault symptoms from physical analysis perspectives, not all features from simulated datasets may not be beneficial for AFDD but only a subset of features contains valuable information from a machine learning perspective. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 8 条
  • [1] Real vs. simulated: Questions on the capability of simulated datasets on building fault detection for energy efficiency from a data-driven perspective
    Huang, Jiajing
    Wen, Jin
    Yoon, Hyunsoo
    Pradhan, Ojas
    Wu, Teresa
    O'Neill, Zheng
    Selcuk Candan, Kasim
    Energy and Buildings, 2022, 259
  • [2] Detection of spoofed AIS: Simulated tracks vs. real maritime data
    Pohontu, Alexandru
    Vertan, Constantin
    Ciocioi, Iancu
    Popa, Ciprian
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2025, 35 (01): : 37 - 50
  • [3] Data-Driven Fault-Tolerant Control for Energy Efficiency in a Multi-Zone Building
    Jain, Tushar
    Yame, Joseph J.
    Sauter, Dominique
    2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [4] From Model, Signal to Knowledge: A Data-Driven Perspective of Fault Detection and Diagnosis
    Dai, Xuewu
    Gao, Zhiwei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (04) : 2226 - 2238
  • [5] High Impedance Fault Detection Method Efficiency: Simulation vs. Real-World Data Acquisition
    Ghaderi, Amin
    Mohammadpour, Hossein Ali
    Ginn, Herbert
    2015 IEEE POWER AND ENERGY CONFERENCE AT ILLINOIS (PECI), 2015,
  • [6] Adaptation of Dynamic Data-Driven Models for Real-Time Applications: From Simulated to Real Batch Distillation Trajectories by Transfer Learning
    Rihm, Gerardo Brand
    Schueler, Merlin
    Nentwich, Corina
    Esche, Erik
    Repke, Jens-Uwe
    CHEMIE INGENIEUR TECHNIK, 2023, 95 (07) : 1125 - 1133
  • [7] Enhancing energy efficiency in supermarkets: A data-driven approach for fault detection and diagnosis in CO2 refrigeration systems
    Farahani, Masoud Kishani
    Yazdi, Mohammad Hossein
    Talaei, Mohammad
    Ghahnavieh, Abbas Rajabi
    APPLIED ENERGY, 2025, 377
  • [8] Fault data seasonal imbalance and insufficiency impacts on data-driven heating, ventilation and air-conditioning fault detection and diagnosis performances for energy-efficient building operations
    Zhong, Fangliang
    Calautit, John Kaiser
    Wu, Yupeng
    ENERGY, 2023, 282