Interpretable Clustering via Soft Clustering Trees

被引:0
|
作者
Cohen, Eldan [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
关键词
DECISION TREE;
D O I
10.1007/978-3-031-33271-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a popular unsupervised learning task that consists of finding a partition of the data points that groups similar points together. Despite its popularity, most state-of-the-art algorithms do not provide any explanation of the obtained partition, making it hard to interpret. In recent years, several works have considered using decision trees to construct clusters that are inherently interpretable. However, these approaches do not scale to large datasets, do not account for uncertainty in results, and do not support advanced clustering objectives such as spectral clustering. In this work, we present soft clustering trees, an interpretable clustering approach that is based on soft decision trees that provide probabilistic cluster membership. We model soft clustering trees as continuous optimization problem that is amenable to efficient optimization techniques. Our approach is designed to output highly sparse decision trees to increase interpretability and to support tree-based spectral clustering. Extensive experiments show that our approach can produce clustering trees of significantly higher quality compared to the state-of-the-art and scale to large datasets.
引用
收藏
页码:281 / 298
页数:18
相关论文
共 50 条
  • [31] Deep soft clustering: simultaneous deep embedding and soft-partition clustering
    Kang Li
    Tongguang Ni
    Jing Xue
    Yizhang Jiang
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 5581 - 5593
  • [32] A clustering algorithm for huge trees
    Auber, D
    Delest, M
    ADVANCES IN APPLIED MATHEMATICS, 2003, 31 (01) : 46 - 60
  • [33] Improving decision trees by clustering
    Chalasani, V
    IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 959 - 963
  • [34] Ranking with predictive clustering trees
    Todorovski, L
    Blockeel, H
    Dzeroski, S
    MACHINE LEARNING: ECML 2002, 2002, 2430 : 444 - 455
  • [35] CLUSTERING WITH MINIMUM SPANNING TREES
    Zhou, Yan
    Grygorash, Oleksandr
    Hain, Thomas F.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2011, 20 (01) : 139 - 177
  • [36] Clustering rooted ordered trees
    Chehreghani, Mostafa Haghir
    Rahgozar, Masoud
    Lucas, Caro
    Chehreghani, Morteza Haghir
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 450 - 455
  • [37] SPANNING TREES AND ASPECTS OF CLUSTERING
    HUBERT, L
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 1974, 27 (MAY): : 14 - 28
  • [38] Recovering Trees with Convex Clustering
    Chi, Eric C.
    Steinerberger, Stefan
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (03): : 383 - 407
  • [39] Oblique predictive clustering trees
    Stepisnik, Tomaz
    Kocev, Dragi
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [40] VORONOI TREES AND CLUSTERING PROBLEMS
    DEHNE, F
    NOLTEMEIER, H
    INFORMATION SYSTEMS, 1987, 12 (02) : 171 - 175