Spectral clustering with linear embedding: A discrete clustering method for large-scale data

被引:5
|
作者
Gao, Chenhui [1 ]
Chen, Wenzhi [1 ]
Nie, Feiping [2 ]
Yu, Weizhong [2 ]
Wang, Zonghui [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Northwestern Polytech Univ, Xian 710072, Peoples R China
关键词
Spectral clustering; Graph embedding; Unsupervised learning;
D O I
10.1016/j.patcog.2024.110396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent decades, spectral clustering has found widespread applications in various real -world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out -of -sample data. SCLE initiates the process with balanced and hierarchical K -means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Granular-ball-based Fast Spectral Embedding Clustering Algorithm for Large-Scale Data
    Liu, Shushu
    Cheng, Dongdong
    Xie, Jiang
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 16 - 20
  • [2] Granular-ball-based Fast Spectral Embedding Clustering Algorithm for Large-Scale Data
    Liu, Shushu
    Cheng, Dongdong
    Xie, Jiang
    ACM International Conference Proceeding Series, : 16 - 20
  • [3] Large-Scale Clustering through Functional Embedding
    Ratle, Frederic
    Weston, Jason
    Miller, Matthew L.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 266 - +
  • [4] An improved spectral clustering method for large-scale sparse networks
    Ding, Yi
    Deng, Jiayi
    Zhang, Bo
    STATISTICS AND ITS INTERFACE, 2025, 18 (02) : 257 - 266
  • [5] LSEC: Large-scale spectral ensemble clustering
    Li, Hongmin
    Ye, Xiucai
    Imakura, Akira
    Sakurai, Tetsuya
    INTELLIGENT DATA ANALYSIS, 2023, 27 (01) : 59 - 77
  • [6] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876
  • [7] A study of large-scale data clustering based on fuzzy clustering
    Li, Yangyang
    Yang, Guoli
    He, Haiyang
    Jiao, Licheng
    Shang, Ronghua
    SOFT COMPUTING, 2016, 20 (08) : 3231 - 3242
  • [8] A study of large-scale data clustering based on fuzzy clustering
    Yangyang Li
    Guoli Yang
    Haiyang He
    Licheng Jiao
    Ronghua Shang
    Soft Computing, 2016, 20 : 3231 - 3242
  • [9] Compressed constrained spectral clustering framework for large-scale data sets
    Liu, Wenfen
    Ye, Mao
    Wei, Jianghong
    Hu, Xuexian
    KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 77 - 88
  • [10] Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut
    Chen, Xiaojun
    Hong, Weijun
    Nie, Feiping
    He, Dan
    Yang, Min
    Huang, Joshua Zhexue
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1206 - 1215