Interpretable Clustering via Soft Clustering Trees

被引:0
|
作者
Cohen, Eldan [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
关键词
DECISION TREE;
D O I
10.1007/978-3-031-33271-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a popular unsupervised learning task that consists of finding a partition of the data points that groups similar points together. Despite its popularity, most state-of-the-art algorithms do not provide any explanation of the obtained partition, making it hard to interpret. In recent years, several works have considered using decision trees to construct clusters that are inherently interpretable. However, these approaches do not scale to large datasets, do not account for uncertainty in results, and do not support advanced clustering objectives such as spectral clustering. In this work, we present soft clustering trees, an interpretable clustering approach that is based on soft decision trees that provide probabilistic cluster membership. We model soft clustering trees as continuous optimization problem that is amenable to efficient optimization techniques. Our approach is designed to output highly sparse decision trees to increase interpretability and to support tree-based spectral clustering. Extensive experiments show that our approach can produce clustering trees of significantly higher quality compared to the state-of-the-art and scale to large datasets.
引用
收藏
页码:281 / 298
页数:18
相关论文
共 50 条
  • [1] Interpretable clustering using unsupervised binary trees
    Ricardo Fraiman
    Badih Ghattas
    Marcela Svarc
    Advances in Data Analysis and Classification, 2013, 7 : 125 - 145
  • [2] Optimal Decision Trees For Interpretable Clustering with Constraints
    Shati, Pouya
    Cohen, Eldan
    McIlraith, Sheila
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2022 - 2030
  • [3] Interpretable clustering using unsupervised binary trees
    Fraiman, Ricardo
    Ghattas, Badih
    Svarc, Marcela
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2013, 7 (02) : 125 - 145
  • [4] Using Decision Trees for Interpretable Supervised Clustering
    Kokash N.
    Makhnist L.
    SN Computer Science, 5 (2)
  • [5] Optimal Interpretable Clustering Using Oblique Decision Trees
    Gabidolla, Magzhan
    Carreira-Perpinan, Miguel A.
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 400 - 410
  • [6] Interpretable fuzzy clustering using unsupervised fuzzy decision trees
    Jiao, Lianmeng
    Yang, Haoyu
    Liu, Zhun-ga
    Pan, Quan
    INFORMATION SCIENCES, 2022, 611 : 540 - 563
  • [7] Interpretable categorical data clustering via hypothesis testing
    Hu, Lianyu
    Jiang, Mudi
    Dong, Junjie
    Liu, Xinying
    He, Zengyou
    PATTERN RECOGNITION, 2025, 162
  • [8] Fast and interpretable consensus clustering via minipatch learning
    Gan, Luqin
    Allen, Genevera, I
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (10)
  • [9] Interpretable Clustering via Multi-Polytope Machines
    Lawless, Connor
    Kalagnanam, Jayant
    Nguyen, Lam M.
    Dzung Phan
    Reddy, Chandra
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7309 - 7316
  • [10] Interpretable Clustering via Discriminative Rectangle Mixture Model
    Chen, Junxiang
    Chang, Yale
    Hobbs, Brian
    Castaldi, Peter
    Cho, Michael
    Silverman, Edwin
    Dy, Jennifer
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 823 - 828