Efficient evolution of decision trees via fully matrix-based fitness evaluation

被引:6
|
作者
Costa, Vinicius G. [1 ]
Salcedo-Sanz, Sancho [2 ]
Pedreira, Carlos E. [1 ]
机构
[1] Univ Fed Rio de Janeiro, Syst Engn & Comp Sci Dept, Rio De Janeiro, Brazil
[2] Univ Alcala, Dept Signal Proc & Commun, Madrid 28805, Spain
关键词
Decision trees; Interpretability; Problem encoding; Coral reef optimization; Evolutionary computation; CORAL-REEFS OPTIMIZATION; ALGORITHMS; SELECTION; DESIGN;
D O I
10.1016/j.asoc.2023.111045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision Trees (DTs) are a class of supervised learning models that are widely used for both classification and regression applications. They are well-known for their interpretability and robustness, which have led them to remain popular even 60 years after they were first proposed. However, because traditional tree algorithms use greedy methods that are prone to suboptimality, several works have explored the usage of evolutionary algorithms instead. Although these algorithms are often reported to outperform the traditional greedy approach, their computational cost is much higher, since the evolutionary component requires a large number (millions or billions) of function evaluations in order to produce a single tree. Aiming to reduce this computational cost, in this work we propose an encoding that allows the training and evaluation of DTs using only matrix operations. The proposed procedure is shown to be much faster than the traditional tree implementation for complete trees with depths ranging from 2 to 6, and for datasets ranging in size from 100 to 100,000 observations. In particular, the results show speedups of nearly up to 20 times, especially when the dataset is large and the desired tree is small enough to be interpretable. The proposed procedure also benefits from GPU parallelization, although it is still highly performing without it. Furthermore, we propose an evolutionary algorithm, called Coral Reef Optimization for Decision Trees (CRO-DT), that integrates this encoding with a pre-existing ensemble algorithm to evolve better univariate trees. The results obtained show that the proposed CRO-DT is competitive with traditional and modern tree algorithms, consistently producing models of good quality across 14 tested UCI Datasets. We conclude that for most relevant situations, the proposed matrix encoding provides significant speedups over the traditional implementation, and also may serve as a basis for high quality evolutionary DT algorithms.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Feature Selection using Compact Discernibility Matrix-based Approach in Dynamic Incomplete Decision System
    Qian, Wenbin
    Shu, Wenhao
    Xie, Yonghong
    Yang, Bingru
    Yang, Jun
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (02) : 509 - 527
  • [42] Efficient Privacy-Preserving Matrix Factorization via Fully Homomorphic Encryption
    Kim, Sungwook
    Kim, Jinsu
    Koo, Dongyoung
    Kim, Yuna
    Yoon, Hyunsoo
    Shin, Junbum
    ASIA CCS'16: PROCEEDINGS OF THE 11TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, : 617 - 628
  • [43] Transfer precipitation learning via patterns of dependency matrix-based machine learning approaches
    Altunkaynak, Abdusselam
    Kullahci, Kubra
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 22177 - 22196
  • [44] Transfer precipitation learning via patterns of dependency matrix-based machine learning approaches
    Abdüsselam Altunkaynak
    Kübra Küllahcı
    Neural Computing and Applications, 2022, 34 : 22177 - 22196
  • [45] Feature selection using compact discernibility matrix-based approach in dynamic incomplete decision system
    School of Software, Jiangxi Agriculture University, Nanchang, 330045, China
    不详
    不详
    J. Inf. Sci. Eng., 2 (509-527):
  • [46] Design and evaluation of matrix-based controlled release tablets of diclofenac sodium and chondroitin sulphate
    Amelia Avachat
    Vikram Kotwal
    AAPS PharmSciTech, 8
  • [47] An efficient and secure feature location approach in source code using Jacobian matrix-based clustering
    Balaji, N.
    Lakshmi, S.
    Anand, M.
    Anbarasan, M.
    Mathiyalagan, P.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12): : 7235 - 7247
  • [48] An Efficient Traffic Forecasting System Based on Spatial Data and Decision Trees
    Prasad, Kalli Srinivasa
    Ramakrishna, Seelam
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (02) : 186 - 194
  • [49] Design and evaluation of matrix-based controlled release tablets of diclofenac sodium and chondroitin sulphate
    Avachat, Amelia
    Kotwal, Vikram
    AAPS PHARMSCITECH, 2007, 8 (04)
  • [50] COMPUTATIONALLY EFFICIENT GRADIENTS FOR RELAXATION MATRIX-BASED STRUCTURE REFINEMENT INCLUDING THE ACCOMMODATION OF INTERNAL MOTIONS
    DELLWO, MJ
    WAND, J
    JOURNAL OF BIOMOLECULAR NMR, 1993, 3 (02) : 205 - 214