Symbolic regression as a feature engineering method for machine and deep learning regression tasks

被引:2
|
作者
Shmuel, Assaf [1 ]
Glickman, Oren [1 ]
Lazebnik, Teddy [2 ,3 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, Ramat Gan, Israel
[2] Ariel Univ, Dept Math, Ariel, Israel
[3] UCL, Canc Inst, Dept Canc Biol, London, England
来源
关键词
symbolic regression; neural network; data-driven physics; feature engineering; data science; FEATURE-SELECTION; BIG DATA; MODEL;
D O I
10.1088/2632-2153/ad513a
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of machine and deep learning (DL) regression tasks, the role of effective feature engineering (FE) is pivotal in enhancing model performance. Traditional approaches of FE often rely on domain expertise to manually design features for machine learning (ML) models. In the context of DL models, the FE is embedded in the neural network's architecture, making it hard for interpretation. In this study, we propose to integrate symbolic regression (SR) as an FE process before a ML model to improve its performance. We show, through extensive experimentation on synthetic and 21 real-world datasets, that the incorporation of SR-derived features significantly enhances the predictive capabilities of both machine and DL regression models with 34%-86% root mean square error (RMSE) improvement in synthetic datasets and 4%-11.5% improvement in real-world datasets. In an additional realistic use case, we show the proposed method improves the ML performance in predicting superconducting critical temperatures based on Eliashberg theory by more than 20% in terms of RMSE. These results outline the potential of SR as an FE component in data-driven models, improving them in terms of performance and interpretability.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Symbolic Regression Based Extreme Learning Machine Models for System Identification
    Başak Esin Köktürk-Güzel
    Selami Beyhan
    Neural Processing Letters, 2021, 53 : 1565 - 1578
  • [22] Feature selection of generalized extreme learning machine for regression problems
    Zhao, Yong-Ping
    Pan, Ying-Ting
    Song, Fang-Quan
    Sun, Liguo
    Chen, Ting-Hao
    NEUROCOMPUTING, 2018, 275 : 2810 - 2823
  • [23] Symbolic Regression Enhanced Decision Trees for Classification Tasks
    Sen Fong, Kei
    Motani, Mehul
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12033 - 12042
  • [24] Symbolic Regression Methods for Reinforcement Learning
    Kubalik, Jiri
    Derner, Erik
    Zegklitz, Jan
    Babuska, Robert
    IEEE ACCESS, 2021, 9 : 139697 - 139711
  • [25] Improving Genetic Programming Based Symbolic Regression Using Deterministic Machine Learning
    Icke, Ilknur
    Bongard, Joshua C.
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1763 - 1770
  • [26] Solving the Control Synthesis Problem Through Supervised Machine Learning of Symbolic Regression
    Diveev, Askhat
    Sofronova, Elena
    Konyrbaev, Nurbek
    MATHEMATICS, 2024, 12 (22)
  • [27] Machine Learning Feedback Control Approach Based on Symbolic Regression for Robotic Systems
    Diveev, Askhat
    Shmalko, Elizaveta
    MATHEMATICS, 2022, 10 (21)
  • [28] Deep Differentiable Symbolic Regression Neural Network
    Lu, Qiang
    Luo, Yuanzhen
    Li, Haoyang
    Luo, Jake
    Wang, Zhiguang
    NEUROCOMPUTING, 2025, 629
  • [29] Rediscovering the Mullins effect with deep symbolic regression
    Abdusalamov, Rasul
    Weise, Jendrik
    Itskov, Mikhail
    INTERNATIONAL JOURNAL OF PLASTICITY, 2024, 179
  • [30] An Interactive Visualization Platform for Deep Symbolic Regression
    Kim, Joanne T.
    Kim, Sookyung
    Petersen, Brenden K.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 5261 - 5263