Beam search induction and similarity constraints for predictive clustering trees

被引:0
|
作者
Kocev, Dragi [1 ]
Struyf, Jan [2 ]
Dzeroski, Saso [1 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana 1000, Slovenia
[2] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Heverlee, Belgium
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much research on inductive databases (IDBs) focuses on local models, such as item sets and association rules. In this work, we investigate how IDBs can support global models, such as decision trees. Our focus is on predictive clustering trees (PCTs). PCTs generalize decision trees and can be used for prediction and clustering, two of the most common data mining tasks. Regular PCT induction builds PCTs top-down, using a greedy algorithm, similar to that of C4.5. We propose a new induction algorithm for PCTs based on beam search. This has three advantages over the regular method: (a) it returns a set of PCTs satisfying the user constraints instead of just one PCT; (b) it better allows for pushing of user constraints into the induction algorithm; and (c) it is less susceptible to myopia. In addition, we propose similarity constraints for PCTs, which improve the diversity of the resulting PCT set.
引用
收藏
页码:134 / +
页数:2
相关论文
共 50 条
  • [41] Ensembles of extremely randomized predictive clustering trees for predicting structured outputs
    Dragi Kocev
    Michelangelo Ceci
    Tomaž Stepišnik
    Machine Learning, 2020, 109 : 2213 - 2241
  • [42] Hierarchical multi-classification with predictive clustering trees in functional genomics
    Struyf, J
    Dzeroski, S
    Blockeel, H
    Clare, A
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3808 : 272 - 283
  • [43] Efficient Top-k Graph Similarity Search With GED Constraints
    Kim, Jongik
    IEEE ACCESS, 2022, 10 : 79180 - 79191
  • [44] Induction of decision multi-trees using Levin search
    Ferri-Ramírez, C
    Hernández-Orallo, J
    Ramírez-Quintana, MJ
    COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 166 - 175
  • [45] Deep semi-supervised clustering based on pairwise constraints and sample similarity
    Qin, Xiao
    Yuan, Changan
    Jiang, Jianhui
    Chen, Long
    PATTERN RECOGNITION LETTERS, 2024, 178 : 1 - 6
  • [46] Cascaded Model Predictive Position Control of Induction Motor with Constraints
    Gan, Lu
    Wang, Liuping
    39TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2013), 2013, : 2656 - 2661
  • [47] A fuzzy beam-search rule induction algorithm
    Fertig, CS
    Freitas, AA
    Arruda, LVR
    Kaestner, C
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 341 - 347
  • [48] Scene Labeling Using Beam Search Under Mutex Constraints
    Roy, Anirban
    Todorovic, Sinisa
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1178 - 1185
  • [49] A novel bit level time series representation with implication of similarity search and clustering
    Ratanamahatana, C
    Keogh, E
    Bagnal, AJ
    Lonardi, S
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 771 - 777
  • [50] Clustering-based similarity search in metric spaces with sparse spatial centers
    Brisaboa, Nieves
    Pedreira, Oscar
    Seco, Diego
    Solar, Roberto
    Uribe, Roberto
    SOFSEM 2008: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2008, 4910 : 186 - +