Scalable Spectral Clustering Using Random Binning Features

被引:28
|
作者
Wu, Lingfei [1 ]
Chen, Pin-Yu [1 ]
Yen, Ian En-Hsu [2 ]
Xu, Fangli [3 ]
Xia, Yinglong [4 ]
Aggarwal, Charu [1 ]
机构
[1] IBM Res AI, Armonk, NY 10504 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Coll William & Mary, Williamsburg, VA 23187 USA
[4] Huawei Res, Shenzhen, Peoples R China
来源
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2018年
关键词
Spectral clustering; Graph Construction; Random Binning Features; Eigendecomposition of Graph; PRIMME;
D O I
10.1145/3219819.3220090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral clustering is one of the most effective clustering approaches that capture hidden cluster structures in the data. However, it does not scale well to large-scale problems due to its quadratic complexity in constructing similarity graphs and computing subsequent eigendecomposition. Although a number of methods have been proposed to accelerate spectral clustering, most of them compromise considerable information loss in the original data for reducing computational bottlenecks. In this paper, we present a novel scalable spectral clustering method using Random Binning features (RB) to simultaneously accelerate both similarity graph construction and the eigendecomposition. Specifically, we implicitly approximate the graph similarity (kernel) matrix by the inner product of a large sparse feature matrix generated by RB. Then we introduce a state-of-the-art SVD solver to effectively compute eigenvectors of this large matrix for spectral clustering. Using these two building blocks, we reduce the computational cost from quadratic to linear in the number of data points while achieving similar accuracy. Our theoretical analysis shows that spectral clustering via RB converges faster to the exact spectral clustering than the standard Random Feature approximation. Extensive experiments on 8 benchmarks show that the proposed method either outperforms or matches the state-of-the-art methods in both accuracy and runtime. Moreover, our method exhibits linear scalability in both the number of data samples and the number of RB features.
引用
收藏
页码:2506 / 2515
页数:10
相关论文
共 50 条
  • [31] Scalable Multi-view Spectral Clustering Based on Spectral Perturbation Theory
    Lin, Xiang
    Liang, Weixuan
    Liu, Jiyuan
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 92 - 99
  • [32] Fast Spectral Clustering with Random Projection and Sampling
    Sakai, Tomoya
    Imiya, Atsushi
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 372 - 384
  • [33] Lossless source coding using tree structured random binning
    Haghighat, Javad
    Hamouda, Walaa
    Soleymani, M. Reza
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 2348 - +
  • [34] Statistical Physics of Random Binning
    Merhav, Neri
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (05) : 2454 - 2464
  • [35] Statistical Physics of Random Binning
    Merhav, Neri
    2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 1560 - 1564
  • [36] Two-step scalable spectral clustering algorithm using landmarks and probability density estimation
    Hong, Xia
    Gao, Junbin
    Wei, Hong
    Xiao, James
    Mitchell, Richard
    NEUROCOMPUTING, 2023, 519 : 173 - 186
  • [37] Scalable Spectral Clustering With Nystrom Approximation: Practical and Theoretical Aspects
    Pourkamali-Anaraki, Farhad
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 242 - 256
  • [38] A general framework for scalable spectral clustering based on document models
    Chen, Guangliang
    PATTERN RECOGNITION LETTERS, 2019, 125 : 488 - 493
  • [39] Spectral Classification of Retinal Features Using K-Means Clustering Algorithm
    Cho, Julie
    Kashani, Amir H.
    Humayun, Mark S.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2015, 56 (07)
  • [40] Crack Detection Using Improved Spectral Clustering Considering Effective Crack Features
    Shiotsuka, Daiki
    Matsushima, Kousuke
    Takahashi, Osamu
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RAE 2018) AND INTERNATIONAL CONFERENCE ON ADVANCED MECHANICAL AND ELECTRICAL ENGINEERING (AMEE 2018), 2018, : 181 - 185