Augmenting Explainable Data-Driven Models in Energy Systems: A Python']Python Framework for Feature Engineering

被引:0
|
作者
Wilfling, Sandra [1 ]
机构
[1] Graz Univ Technol, Inst Software Technol, Graz, Austria
关键词
Energy systems modeling; Data-driven modeling; Feature engineering; FEATURE-SELECTION; PREDICTION;
D O I
10.1007/978-3-031-47062-2_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data-driven modeling is an approach in energy systems modeling that has been gaining popularity. In data-driven modeling, machine learning methods such as linear regression, neural networks or decision-tree based methods are applied. While these methods do not require domain knowledge, they are sensitive to data quality. Therefore, improving data quality in a dataset is beneficial for creating machine learning-based models. The improvement of data quality can be implemented through preprocessing methods. A selected type of preprocessing is feature engineering, which focuses on evaluating and improving the quality of certain features inside the dataset. Feature engineering includes methods such as feature creation, feature expansion, or feature selection. In this work, a Python framework containing different feature engineering methods is presented. This framework contains different methods for feature creation, expansion and selection; in addition, methods for transforming or filtering data are implemented. The implementation of the framework is based on the Python library scikit-learn. The framework is demonstrated on a use case from energy demand prediction. A data-driven model is created including selected feature engineering methods. The results show an improvement in prediction accuracy through the engineered features.
引用
收藏
页码:121 / 129
页数:9
相关论文
共 50 条
  • [1] B-AMA: A Python']Python-coded protocol to enhance the application of data-driven models in hydrology
    Amaranto, Alessandro
    Mazzoleni, Maurizio
    ENVIRONMENTAL MODELLING & SOFTWARE, 2023, 160
  • [2] Using web2py Python']Python framework for creating data-driven web applications in the academic library
    Miles, Mathew
    LIBRARY HI TECH, 2016, 34 (01) : 164 - 171
  • [3] Python']Python Data Driven framework for acceleration of Phase-Field simulations
    Fetni, Seifallah
    Delahaye, Jocelyn
    Habraken, Anne Marie
    SOFTWARE IMPACTS, 2023, 17
  • [4] A Python']Python Toolbox for Data-Driven Aerodynamic Modeling Using Sparse Gaussian Processes
    Valayer, Hugo
    Bartoli, Nathalie
    Castano-Aguirre, Mauricio
    Lafage, Remi
    Lefebvre, Thierry
    Lopez-Lopera, Andres F.
    Mouton, Sylvain
    AEROSPACE, 2024, 11 (04)
  • [5] PABLO: Helping Novices Debug Python']Python Code Through Data-Driven Fault Localization
    Cosman, Benjamin
    Endres, Madeline
    Sakkas, Georgios
    Medvinsky, Leon
    Yang, Yao-Yuan
    Jhala, Ranjit
    Chaudhuri, Kamalika
    Weimer, Westley
    SIGCSE 2020: PROCEEDINGS OF THE 51ST ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2020, : 1047 - 1053
  • [6] GReNaDIne: A Data-Driven Python']Python Library to Infer Gene Regulatory Networks from Gene Expression Data
    Schmitt, Pauline
    Sorin, Baptiste
    Froute, Timothee
    Parisot, Nicolas
    Calevro, Federica
    Peignier, Sergio
    GENES, 2023, 14 (02)
  • [7] fuzzycreator: A Python']Python-Based Toolkit for Automatically Generating and Analysing Data-Driven Fuzzy Sets
    McCulloch, Josie
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [8] A modular Python']Python framework for rapid development of advanced control algorithms for energy systems
    Eser, Steffen
    Storek, Thomas
    Wuellhorst, Fabian
    Daehling, Stefan
    Gall, Jan
    Stoffel, Phillip
    Mueller, Dirk
    APPLIED ENERGY, 2025, 385
  • [9] pyvrft: A Python']Python package for the Virtual Reference Feedback Tuning, a direct data-driven control method
    Boeira, Emerson
    Eckhard, Diego
    SOFTWAREX, 2020, 11
  • [10] Interpretable and explainable predictive machine learning models for data-driven protein engineering
    Medina-Ortiz, David
    Khalifeh, Ashkan
    Anvari-Kazemabad, Hoda
    Davari, Mehdi D.
    BIOTECHNOLOGY ADVANCES, 2025, 79