Scalable spectral clustering with cosine similarity

被引:0
|
作者
Chen, Guangliang [1 ]
机构
[1] San Jose State Univ, Dept Math & Stat, San Jose, CA 95192 USA
关键词
DATA SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified scalable computing framework for three versions of spectral clustering - Normalized Cut (Shi and Malik, 2000), the Ng-Jordan-Weiss (NJW) algorithm (2001), and Diffusion Maps (Coifman and Lafon, 2006), in the setting of cosine similarity. We assume that the input data is either sparse (e.g., as a document-term frequency matrix) or of only a few hundred dimensions (e.g., for small images or data obtained through PCA). We show that in such cases, spectral clustering can be implemented solely based on efficient operations on the data matrix such as elementwise manipulation, matrix-vector multiplication and low-rank SVD, thus entirely avoiding the weight matrix. Our algorithm is simple to implement, fast to run, accurate and robust to outliers. We demonstrate its superior performance through extensive experiments which compare our scalable algorithm with the plain implementation on several benchmark data sets.
引用
收藏
页码:314 / 319
页数:6
相关论文
共 50 条
  • [41] CAR Spectral Clustering on Manifolds with Statistical and Geometrical Similarity
    Cheng, Yong
    Tong, Qiang
    ADVANCES IN NEURAL NETWORKS - ISNN 2010, PT 1, PROCEEDINGS, 2010, 6063 : 422 - +
  • [42] Federated Spectral Clustering via Secure Similarity Reconstruction
    Qiao, Dong
    Ding, Chris
    Fan, Jicong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] Spectral clustering and fuzzy similarity measure for images segmentation
    Rodriguez-Fernandez, Juan
    Lizarazo-Chilama, Pablo
    Munoz-Espana, Elena
    Florez-Marulanda, Juan
    UIS INGENIERIAS, 2022, 21 (03): : 9 - 20
  • [44] Local density adaptive similarity measurement for spectral clustering
    Zhang, Xianchao
    Li, Jingwei
    Yu, Hong
    PATTERN RECOGNITION LETTERS, 2011, 32 (02) : 352 - 358
  • [45] Improved spectral clustering algorithm based on similarity measure
    Cheng, Debo, 1600, Springer Verlag (8933):
  • [46] Fuzzy partition based similarity measure for spectral clustering
    1600, Science and Engineering Research Support Society (09):
  • [47] Scalable Multi-view Spectral Clustering Based on Spectral Perturbation Theory
    Lin, Xiang
    Liang, Weixuan
    Liu, Jiyuan
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 92 - 99
  • [48] A novel Based-Approach Composed of Clustering Algorithm & Cosine Similarity for Products Recommendation
    Al-Hagery, Mohammed Abdullah
    INTERNATIONAL JOURNAL OF EDUCATION AND INFORMATION TECHNOLOGIES, 2020, 14 : 133 - 141
  • [49] Scalable Spectral Clustering With Nystrom Approximation: Practical and Theoretical Aspects
    Pourkamali-Anaraki, Farhad
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 242 - 256
  • [50] A general framework for scalable spectral clustering based on document models
    Chen, Guangliang
    PATTERN RECOGNITION LETTERS, 2019, 125 : 488 - 493