Helping predictive analytics interpretation using regression trees and clustering perturbation

被引:0
|
作者
Parisot, Olivier [1 ]
Didry, Yoanne [1 ]
Tamisier, Thomas [1 ]
Otjacques, Benoit [1 ]
机构
[1] Publ Res Ctr, Gabriel Lippmann 41,Rue Brill, L-4422 Belvaux, Luxembourg
关键词
regression trees; clustering perturbation; predictive analytics;
D O I
10.1080/12460125.2015.994331
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Regression trees are helpful tools for decision support and predictive analytics, due to their simple structure and the ease with which they can be obtained from data. Nonetheless, when applied to non-trivial datasets, they tend to grow according to the complexity of the data, becoming difficult to interpret. This difficulty can be overcome by clustering the dataset and representing the regression tree of each cluster independently. In order to help create predictive models that are more comprehensible, we propose in this work a clustering perturbation method to reduce the size of the regression tree obtained from each cluster. A prototype has been developed and tested on several regression datasets.
引用
收藏
页码:55 / 72
页数:18
相关论文
共 50 条
  • [1] Helping decision support with regression trees and clustering perturbation
    Parisot, Olivier
    Didry, Yoann
    Tamisier, Thomas
    DSS 2.0 - SUPPORTING DECISION MAKING WITH NEW TECHNOLOGIES, 2014, 261 : 104 - 109
  • [2] Network Regression with Predictive Clustering Trees
    Stojanova, Daniela
    Ceci, Michelangelo
    Appice, Annalisa
    Dzeroski, Saso
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 333 - 348
  • [3] Network regression with predictive clustering trees
    Daniela Stojanova
    Michelangelo Ceci
    Annalisa Appice
    Sašo Džeroski
    Data Mining and Knowledge Discovery, 2012, 25 : 378 - 413
  • [4] Network regression with predictive clustering trees
    Stojanova, Daniela
    Ceci, Michelangelo
    Appice, Annalisa
    Dzeroski, Saso
    DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 25 (02) : 378 - 413
  • [5] Option Predictive Clustering Trees for Multi-target Regression
    Osojnik, Aljaz
    Dzeroski, Saso
    Kocev, Dragi
    DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 118 - 133
  • [6] Option predictive clustering trees for multi-target regression
    Stepisnik, Tomaz
    Osojnik, Aljaz
    Dzeroski, Saso
    Kocev, Dragi
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 459 - 486
  • [7] Predictive Clustering Trees for Hierarchical Multi-Target Regression
    Mileski, Vanja
    DZeroski, Saso
    Kocev, Dragi
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVI, IDA 2017, 2017, 10584 : 223 - 234
  • [8] Predictive analytics of insurance claims using multivariate decision trees
    Quan, Zhiyu
    Valdez, Emiliano A.
    DEPENDENCE MODELING, 2018, 6 (01): : 377 - 407
  • [9] Additive regression trees and smoothing splines - Predictive modeling and interpretation in data mining
    Li, Bin
    Goel, Prem K.
    PREDICTION AND DISCOVERY, 2007, 443 : 83 - 101
  • [10] Predictive analytics: Parametric models for regression and classification using R
    Huang, Yu-Jyun
    BIOMETRICS, 2022, 78 (02) : 816 - 817