Beam search induction and similarity constraints for predictive clustering trees

被引:0
|
作者
Kocev, Dragi [1 ]
Struyf, Jan [2 ]
Dzeroski, Saso [1 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana 1000, Slovenia
[2] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Heverlee, Belgium
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much research on inductive databases (IDBs) focuses on local models, such as item sets and association rules. In this work, we investigate how IDBs can support global models, such as decision trees. Our focus is on predictive clustering trees (PCTs). PCTs generalize decision trees and can be used for prediction and clustering, two of the most common data mining tasks. Regular PCT induction builds PCTs top-down, using a greedy algorithm, similar to that of C4.5. We propose a new induction algorithm for PCTs based on beam search. This has three advantages over the regular method: (a) it returns a set of PCTs satisfying the user constraints instead of just one PCT; (b) it better allows for pushing of user constraints into the induction algorithm; and (c) it is less susceptible to myopia. In addition, we propose similarity constraints for PCTs, which improve the diversity of the resulting PCT set.
引用
收藏
页码:134 / +
页数:2
相关论文
共 50 条
  • [1] Ranking with predictive clustering trees
    Todorovski, L
    Blockeel, H
    Dzeroski, S
    MACHINE LEARNING: ECML 2002, 2002, 2430 : 444 - 455
  • [2] Oblique predictive clustering trees
    Stepisnik, Tomaz
    Kocev, Dragi
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [3] Adaptive similarity search in metric trees
    Yousri, Noha A.
    Ismail, Mohammed A.
    Kamel, Mohamed S.
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2663 - +
  • [4] Clustering trees with instance level constraints
    Struyf, Jan
    Dzeroski, Saso
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 359 - +
  • [5] Network Regression with Predictive Clustering Trees
    Stojanova, Daniela
    Ceci, Michelangelo
    Appice, Annalisa
    Dzeroski, Saso
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 333 - 348
  • [6] Multivariate Predictive Clustering Trees for Classification
    Stepisnik, Tomaz
    Kocev, Dragi
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 331 - 341
  • [7] Network regression with predictive clustering trees
    Daniela Stojanova
    Michelangelo Ceci
    Annalisa Appice
    Sašo Džeroski
    Data Mining and Knowledge Discovery, 2012, 25 : 378 - 413
  • [8] Network regression with predictive clustering trees
    Stojanova, Daniela
    Ceci, Michelangelo
    Appice, Annalisa
    Dzeroski, Saso
    DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 25 (02) : 378 - 413
  • [9] Optimal Decision Trees For Interpretable Clustering with Constraints
    Shati, Pouya
    Cohen, Eldan
    McIlraith, Sheila
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2022 - 2030
  • [10] Option Predictive Clustering Trees for Multilabel Classification
    Stepisnik, Tomaz
    Kocev, Dragi
    Dzeroski, Saso
    ACTA POLYTECHNICA HUNGARICA, 2020, 17 (10) : 109 - 128