Scalable spectral clustering with cosine similarity

被引:0
|
作者
Chen, Guangliang [1 ]
机构
[1] San Jose State Univ, Dept Math & Stat, San Jose, CA 95192 USA
关键词
DATA SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified scalable computing framework for three versions of spectral clustering - Normalized Cut (Shi and Malik, 2000), the Ng-Jordan-Weiss (NJW) algorithm (2001), and Diffusion Maps (Coifman and Lafon, 2006), in the setting of cosine similarity. We assume that the input data is either sparse (e.g., as a document-term frequency matrix) or of only a few hundred dimensions (e.g., for small images or data obtained through PCA). We show that in such cases, spectral clustering can be implemented solely based on efficient operations on the data matrix such as elementwise manipulation, matrix-vector multiplication and low-rank SVD, thus entirely avoiding the weight matrix. Our algorithm is simple to implement, fast to run, accurate and robust to outliers. We demonstrate its superior performance through extensive experiments which compare our scalable algorithm with the plain implementation on several benchmark data sets.
引用
收藏
页码:314 / 319
页数:6
相关论文
共 50 条
  • [21] EVALUATION OF POLSAR SIMILARITY MEASURES WITH SPECTRAL CLUSTERING
    Hu, Jingliang
    Wang, Yuanyuan
    Ghamisi, Pedram
    Zhu, Xiao Xiang
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3254 - 3257
  • [22] Spectral clustering based on learning similarity matrix
    Park, Seyoung
    Zhao, Hongyu
    BIOINFORMATICS, 2018, 34 (12) : 2069 - 2076
  • [23] HYBRID ATTRIBUTES SIMILARITY MEASUREMENT FOR SPECTRAL CLUSTERING
    Guan, Ya-Yong
    Wu, Tao
    Ning, Jin
    Cai, Hong-Bin
    2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 16 - 20
  • [24] Spectral Clustering Using Friendship Path Similarity
    Rodriguez, Mario
    Medrano, Carlos
    Herrero, Elias
    Orrite, Carlos
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 319 - 326
  • [25] Spectral clustering based on similarity and dissimilarity criterion
    Bangjun Wang
    Li Zhang
    Caili Wu
    Fan-zhang Li
    Zhao Zhang
    Pattern Analysis and Applications, 2017, 20 : 495 - 506
  • [26] Spectral clustering with density sensitive similarity function
    Yang, Peng
    Zhu, Qingsheng
    Huang, Biao
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (05) : 621 - 628
  • [27] Spectral clustering based on similarity and dissimilarity criterion
    Wang, Bangjun
    Zhang, Li
    Wu, Caili
    Li, Fan-zhang
    Zhang, Zhao
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (02) : 495 - 506
  • [28] An Empirical Analysis of Similarity Matrix for Spectral Clustering
    Zhang, Sheng
    He, Xiaoqi
    Liu, Yangguang
    Huang, Qichun
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING II, PTS 1-3, 2013, 433-435 : 725 - +
  • [29] Medical image compression by discrete cosine transform spectral similarity strategy
    Wu, YG
    Tai, SC
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2001, 5 (03): : 236 - 243
  • [30] ShVEEGc: EEG Clustering With Improved Cosine Similarity-Transformed Shapley Value
    Li, Guanghui
    Shen, Jiahua
    Dai, Chenglong
    Wu, Hia
    Becker, Stefanie, I
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (01): : 222 - 236