Helping predictive analytics interpretation using regression trees and clustering perturbation

被引:0
|
作者
Parisot, Olivier [1 ]
Didry, Yoanne [1 ]
Tamisier, Thomas [1 ]
Otjacques, Benoit [1 ]
机构
[1] Publ Res Ctr, Gabriel Lippmann 41,Rue Brill, L-4422 Belvaux, Luxembourg
关键词
regression trees; clustering perturbation; predictive analytics;
D O I
10.1080/12460125.2015.994331
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Regression trees are helpful tools for decision support and predictive analytics, due to their simple structure and the ease with which they can be obtained from data. Nonetheless, when applied to non-trivial datasets, they tend to grow according to the complexity of the data, becoming difficult to interpret. This difficulty can be overcome by clustering the dataset and representing the regression tree of each cluster independently. In order to help create predictive models that are more comprehensible, we propose in this work a clustering perturbation method to reduce the size of the regression tree obtained from each cluster. A prototype has been developed and tested on several regression datasets.
引用
收藏
页码:55 / 72
页数:18
相关论文
共 50 条
  • [21] Incremental predictive clustering trees for online semi-supervised multi-target regression
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2020, 109 (11) : 2121 - 2139
  • [22] Hierarchical classification of diatom images using ensembles of predictive clustering trees
    Dimitrovski, Ivica
    Kocev, Dragi
    Loskovska, Suzana
    Dzeroski, Saso
    ECOLOGICAL INFORMATICS, 2012, 7 (01) : 19 - 29
  • [23] Pose clustering guided by short interpretation trees
    Olson, CF
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 149 - 152
  • [24] Building predictive models of counterinsurgent deaths using robust clustering and regression
    King, Marvin L.
    Hering, Amanda S.
    Aguilar, Oscar M.
    JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2016, 13 (04): : 449 - 465
  • [25] Option Predictive Clustering Trees for Multilabel Classification
    Stepisnik, Tomaz
    Kocev, Dragi
    Dzeroski, Saso
    ACTA POLYTECHNICA HUNGARICA, 2020, 17 (10) : 109 - 128
  • [26] Analysis of time series data on agroecosystem vegetation using predictive clustering trees
    Debeljak, Marko
    Squire, Geoffrey R.
    Kocev, Dragi
    Hawes, Cathy
    Young, Mark W.
    Dzeroski, Saso
    ECOLOGICAL MODELLING, 2011, 222 (14) : 2524 - 2529
  • [27] Using predictive analytics in the library
    Massis, Bruce E.
    NEW LIBRARY WORLD, 2012, 113 (9-10) : 491 - 494
  • [28] Survival analysis as semi-supervised multi-target regression for time-to-employment prediction using oblique predictive clustering trees
    Andonovikj, Viktor
    Boskoski, Pavle
    Dzeroski, Saso
    Boshkoska, Biljana Mileva
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [29] Interpretation of dam deformation and leakage with boosted regression trees
    Salazar, Fernando
    Toledo, Miguel A.
    Onate, Eugenio
    Suarez, Benjamin
    ENGINEERING STRUCTURES, 2016, 119 : 230 - 251
  • [30] Incorporating Association Patterns into Manifold Clustering for Enabling Predictive Analytics
    Sy, Bon
    Chen, Jin
    Horowitz, Rebecca
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 1300 - 1305