Spectral clustering with linear embedding: A discrete clustering method for large-scale data

被引:5
|
作者
Gao, Chenhui [1 ]
Chen, Wenzhi [1 ]
Nie, Feiping [2 ]
Yu, Weizhong [2 ]
Wang, Zonghui [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Northwestern Polytech Univ, Xian 710072, Peoples R China
关键词
Spectral clustering; Graph embedding; Unsupervised learning;
D O I
10.1016/j.patcog.2024.110396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent decades, spectral clustering has found widespread applications in various real -world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out -of -sample data. SCLE initiates the process with balanced and hierarchical K -means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Spectral clustering based on iterative optimization for large-scale and high-dimensional data
    Zhao, Yang
    Yuan, Yuan
    Nie, Feiping
    Wang, Qi
    NEUROCOMPUTING, 2018, 318 : 227 - 235
  • [22] Privacy-preserving constrained spectral clustering algorithm for large-scale data sets
    Li, Ji
    Wei, Jianghong
    Ye, Mao
    Liu, Wenfen
    Hu, Xuexian
    IET INFORMATION SECURITY, 2020, 14 (03) : 321 - 331
  • [23] RESKM: A General Framework to Accelerate Large-Scale Spectral Clustering
    Yang, Geping
    Deng, Sucheng
    Chen, Xiang
    Chen, Can
    Yang, Yiyang
    Gong, Zhiguo
    Hao, Zhifeng
    PATTERN RECOGNITION, 2023, 137
  • [24] Fast large-scale spectral clustering by sequential shrinkage optimization
    Liu, Tie-Yan
    Yang, Huai-Yuan
    Zheng, Xin
    Qin, Tao
    Ma, Wei-Ying
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 319 - +
  • [25] Divide-and-conquer based large-scale spectral clustering
    Li, Hongmin
    Ye, Xiucai
    Imakura, Akira
    Sakurai, Tetsuya
    NEUROCOMPUTING, 2022, 501 : 664 - 678
  • [26] Randomized Spectral Clustering in Large-Scale Stochastic Block Models
    Zhang, Hai
    Guo, Xiao
    Chang, Xiangyu
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (03) : 887 - 906
  • [27] Sparse-Reduced Computation for Large-Scale Spectral Clustering
    Baumann, Philipp
    2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2016, : 1284 - 1288
  • [28] Fast Compressive Spectral Clustering for Large-Scale Sparse Graph
    Li, Ting
    Zhang, Yiming
    Liu, Hao
    Xue, Guangtao
    Liu, Ling
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (01) : 193 - 202
  • [29] A Novel Clustering Algorithm on Large-Scale Graph Data
    Zhang, Hao
    Zhou, Wei
    Wan, Xiaoyu
    Fu, Ge
    Xu, Zhiyong
    Han, Jizhong
    2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 47 - 54
  • [30] Large-scale clustering of CAGE tag expression data
    Shimokawa, Kazuro
    Okamura-Oho, Yuko
    Kurita, Takio
    Frith, Martin C.
    Kawai, Jun
    Carninci, Piero
    Hayashizaki, Yoshihide
    BMC BIOINFORMATICS, 2007, 8 (1)