Spectral clustering with linear embedding: A discrete clustering method for large-scale data

被引:5
|
作者
Gao, Chenhui [1 ]
Chen, Wenzhi [1 ]
Nie, Feiping [2 ]
Yu, Weizhong [2 ]
Wang, Zonghui [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Northwestern Polytech Univ, Xian 710072, Peoples R China
关键词
Spectral clustering; Graph embedding; Unsupervised learning;
D O I
10.1016/j.patcog.2024.110396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent decades, spectral clustering has found widespread applications in various real -world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out -of -sample data. SCLE initiates the process with balanced and hierarchical K -means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Large-scale clustering of CAGE tag expression data
    Kazuro Shimokawa
    Yuko Okamura-Oho
    Takio Kurita
    Martin C Frith
    Jun Kawai
    Piero Carninci
    Yoshihide Hayashizaki
    BMC Bioinformatics, 8
  • [32] Large-scale clustering of cDNA-fingerprinting data
    Herwig, R
    Poustka, AJ
    Müller, C
    Bull, C
    Lehrach, H
    O'Brien, J
    GENOME RESEARCH, 1999, 9 (11) : 1093 - 1105
  • [33] An improved clustering method for large-scale data based on artificial immune system
    Li, Zhonghua
    Tan, Hongzhou
    Yan, Xiaoke
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 920 - 924
  • [34] A heuristic method for clustering a large-scale sensor network
    Furuta, Takehiro
    Miyazawa, Hajime
    Ishizaki, Fumio
    Sasaki, Mihiro
    Suzuki, Atsuo
    2007 WIRELESS TELECOMMUNICATIONS SYMPOSIUM, 2007, : 234 - 239
  • [35] A New Clustering Method Suitable for Large Scale Data
    Xu Yin
    Hong Xingyong
    Zhou Wenjiang
    Wang Lunwen
    Zhang Ling
    Tan Ying
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 6277 - +
  • [36] Nonnegative Spectral Clustering for Large-Scale Semi-supervised Learning
    Hu, Weibo
    Chen, Chuan
    Ye, Fanghua
    Zheng, Zibin
    Ling, Guohui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 287 - 291
  • [37] Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks
    Van Lierde, Hadrien
    Chow, Tommy W. S.
    Chen, Guanrong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (04) : 754 - 767
  • [38] Subsampling spectral clustering for stochastic block models in large-scale networks
    Deng, Jiayi
    Huang, Danyang
    Ding, Yi
    Zhu, Yingqiu
    Jing, Bingyi
    Zhang, Bo
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 189
  • [39] Fast Large-Scale Spectral Clustering via Explicit Feature Mapping
    He, Li
    Ray, Nilanjan
    Guan, Yisheng
    Zhang, Hong
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) : 1058 - 1071
  • [40] Spectral Clustering of Large-scale Communities via Random Sketching and Validation
    Traganitis, Panagiotis A.
    Slavakis, Konstantinos
    Giannakis, Georgios B.
    2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,