Modular Multitree Genetic Programming for Evolutionary Feature Construction for Regression

被引:2
|
作者
Zhang, Hengzhe [1 ,2 ]
Chen, Qi [1 ,2 ]
Xue, Bing [1 ,2 ]
Banzhaf, Wolfgang [3 ]
Zhang, Mengjie [1 ,2 ]
机构
[1] Victoria Univ Wellington, Ctr Data Sci & Artificial Intelligence, Wellington 6140, New Zealand
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing 48824, MI USA
关键词
Genetic programming; Task analysis; Semantics; Random forests; Machine learning algorithms; Computational modeling; Contracts; Evolutionary feature construction; evolutionary forest; genetic programming (GP); modularity; random forest; SELECTION; OPERATOR;
D O I
10.1109/TEVC.2023.3318638
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evolutionary feature construction is a key technique in evolutionary machine learning, with the aim of constructing high-level features that enhance performance of a learning algorithm. In real-world applications, engineers typically construct complex features based on a combination of basic features, reusing those features as modules. However, modularity in evolutionary feature construction is still an open research topic. This article tries to fill that gap by proposing a modular and hierarchical multitree genetic programming (GP) algorithm that allows trees to use the output values of other trees, thereby representing expressive features in a compact form. Based on this new representation, we propose a macro parent-repair strategy to reduce redundant and irrelevant features, a macro crossover operator to preserve interactive features, and an adaptive control strategy for crossover and mutation rates to dynamically balance the tradeoff between exploration and exploitation. A comparison with seven bloat control methods on 98 regression datasets shows that the proposed modular representation achieves significantly better results in terms of test performance and smaller model size. Experimental results on the state-of-the-art acrlong SRBench demonstrate that the proposed symbolic regression method outperforms 22 existing symbolic regression and machine learning algorithms, providing empirical evidence for the superiority of the modularized evolutionary feature construction method.
引用
收藏
页码:1455 / 1469
页数:15
相关论文
共 50 条
  • [21] Genetic Programming based Feature Construction for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 1033 - 1040
  • [22] Hierarchical Feature Construction for Image Classification Using Genetic Programming
    Suganuma, Masanori
    Tsuchiya, Daiki
    Shirakawa, Shinichi
    Nagao, Tomoharu
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 1423 - 1428
  • [23] Classification of EEG Signals using Genetic Programming for Feature Construction
    Miranda, Icaro Marcelino
    Aranha, Claus
    Ladeira, Marcelo
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 1275 - 1283
  • [24] Evolutionary feature selection approaches for insolvency business prediction with genetic programming
    Ángel Beade
    Manuel Rodríguez
    José Santos
    Natural Computing, 2023, 22 : 705 - 722
  • [25] Evolutionary feature selection approaches for insolvency business prediction with genetic programming
    Beade, Angel
    Rodriguez, Manuel
    Santos, Jose
    NATURAL COMPUTING, 2023, 22 (04) : 705 - 722
  • [26] Evolutionary Image Descriptor: A Dynamic Genetic Programming Representation for Feature Extraction
    Al-Sahaf, Harith
    Zhang, Mengjie
    Johnston, Mark
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 975 - 982
  • [27] Multi-Tree Genetic Programming for Feature Construction-Based Domain Adaptation in Symbolic Regression with Incomplete Data
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 913 - 921
  • [28] A novel approach to dynamic portfolio trading system using multitree genetic programming
    Mousavi, Somayeh
    Esfahanipour, Akbar
    Zarandi, Mohammad Hossein Fazel
    KNOWLEDGE-BASED SYSTEMS, 2014, 66 : 68 - 81
  • [29] Genetic Programming for Region Detection, Feature Extraction, Feature Construction and Classification in Image Data
    Lensen, Andrew
    Al-Sahaf, Harith
    Zhang, Mengjie
    Xue, Bing
    GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 : 51 - 67
  • [30] A geometric semantic macro-crossover operator for evolutionary feature construction in regression
    Zhang, Hengzhe
    Chen, Qi
    Xue, Bing
    Banzhaf, Wolfgang
    Zhang, Mengjie
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2024, 25 (01)