Interpretable Clustering via Soft Clustering Trees

被引:0
|
作者
Cohen, Eldan [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
关键词
DECISION TREE;
D O I
10.1007/978-3-031-33271-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a popular unsupervised learning task that consists of finding a partition of the data points that groups similar points together. Despite its popularity, most state-of-the-art algorithms do not provide any explanation of the obtained partition, making it hard to interpret. In recent years, several works have considered using decision trees to construct clusters that are inherently interpretable. However, these approaches do not scale to large datasets, do not account for uncertainty in results, and do not support advanced clustering objectives such as spectral clustering. In this work, we present soft clustering trees, an interpretable clustering approach that is based on soft decision trees that provide probabilistic cluster membership. We model soft clustering trees as continuous optimization problem that is amenable to efficient optimization techniques. Our approach is designed to output highly sparse decision trees to increase interpretability and to support tree-based spectral clustering. Extensive experiments show that our approach can produce clustering trees of significantly higher quality compared to the state-of-the-art and scale to large datasets.
引用
收藏
页码:281 / 298
页数:18
相关论文
共 50 条
  • [21] Physically interpretable performance metrics for clustering
    Mondal, Kinjal
    Klauda, Jeffery B.
    JOURNAL OF CHEMICAL PHYSICS, 2024, 161 (24):
  • [22] Interpretable multi-view clustering
    Jiang, Mudi
    Hu, Lianyu
    He, Zengyou
    Chen, Zhikui
    PATTERN RECOGNITION, 2025, 162
  • [23] Scalable Differentially Private Clustering via Hierarchically Separated Trees
    Cohen-Addad, Vincent
    Epasto, Alessandro
    Lattanzi, Silvio
    Mirrokni, Vahab
    Medina, Andres Munoz
    Saulpic, David
    Schwiegelshohn, Chris
    Vassilvitskii, Sergei
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 221 - 230
  • [24] Orthopartitions and soft clustering: Soft mutual information measures for clustering validation
    Campagner, Andrea
    Ciucci, Davide
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 51 - 61
  • [25] Soft rank clustering
    Rovetta, Stefano
    Masulli, Francesco
    Filippone, Maurizio
    NEURAL NETS, 2006, 3931 : 207 - 213
  • [26] Fair Soft Clustering
    Kjaersgaard, Rune D.
    Parviainen, Pekka
    Saurabh, Saket
    Kundu, Madhumita
    Clemmensen, Line K. H.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [27] Soft ranking in clustering
    Rovetta, Stefano
    Masulli, Francesco
    Filippone, Maurizio
    NEUROCOMPUTING, 2009, 72 (7-9) : 2028 - 2031
  • [28] Soft phenotyping for sepsis via EHR time-aware soft clustering
    Jiang, Shiyi
    Gai, Xin
    Treggiari, Miriam M.
    Stead, William W.
    Zhao, Yuankang
    Page, C. David
    Zhang, Anru R.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 152
  • [29] Advances in Rough and Soft Clustering: Meta-Clustering, Dynamic Clustering, Data-Stream Clustering
    Lingras, Pawan
    Triff, Matt
    ROUGH SETS, (IJCRS 2016), 2016, 9920 : 3 - 22
  • [30] Deep soft clustering: simultaneous deep embedding and soft-partition clustering
    Li, Kang
    Ni, Tongguang
    Xue, Jing
    Jiang, Yizhang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (5) : 5581 - 5593