Scalable Sequential Spectral Clustering

被引:0
|
作者
Li, Yeqing [1 ]
Huang, Junzhou [1 ]
Liu, Wei [2 ]
机构
[1] Univ Texas Arlington, Arlington, TX 76019 USA
[2] Didi Res, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decades, Spectral Clustering (SC) has become one of the most effective clustering approaches. Although it has been widely used, one significant drawback of SC is its expensive computation cost. Many efforts have been devoted to accelerating SC algorithms and promising results have been achieved. However, most of the existing algorithms rely on the assumption that data can be stored in the computer memory. When data cannot fit in the memory, these algorithms will suffer severe performance degradations. In order to overcome this issue, we propose a novel sequential SC algorithm for tackling large-scale clustering with limited computational resources, e.g., memory. We begin with investigating an effective way of approximating the graph affinity matrix via leveraging a bipartite graph. Then we choose a smart graph construction and optimization strategy to avoid random access to data. These efforts lead to an efficient SC algorithm whose memory usage is independent of the number of input data points. Extensive experiments carried out on large datasets demonstrate that the proposed sequential SC algorithm is up to a thousand times faster than the state-of-the-arts.
引用
收藏
页码:1809 / 1815
页数:7
相关论文
共 50 条
  • [1] Scalable Constrained Spectral Clustering
    Li, Jianyuan
    Xia, Yingjie
    Shan, Zhenyu
    Liu, Yuncai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (02) : 589 - 593
  • [2] A scalable algorithm for clustering sequential data
    Guralnik, V
    Karypis, G
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 179 - 186
  • [3] Ultra-Scalable Spectral Clustering and Ensemble Clustering
    Huang, Dong
    Wang, Chang-Dong
    Wu, Jian-Sheng
    Lai, Jian-Huang
    Kwoh, Chee-Keong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (06) : 1212 - 1226
  • [4] Scalable Robust Spectral Ensemble Clustering
    Liang, Yinian
    Ren, Zhigang
    Wu, Zongze
    Zeng, Deyu
    Li, Jianzhong
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7600 - 7605
  • [5] Scalable spectral clustering with cosine similarity
    Chen, Guangliang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 314 - 319
  • [6] Scalable Spectral Clustering with Weighted PageRank
    Rafailidis, Dimitrios
    Constantinou, Eleni
    Manolopoulos, Yannis
    MODEL AND DATA ENGINEERING, MEDI 2014, 2014, 8748 : 289 - 300
  • [7] Enabling scalable spectral clustering for image segmentation
    Tung, Frederick
    Wong, Alexander
    Clausi, David A.
    PATTERN RECOGNITION, 2010, 43 (12) : 4069 - 4076
  • [8] A scalable approach to spectral clustering with SDD solvers
    Nguyen Lu Dang Khoa
    Chawla, Sanjay
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 44 (02) : 289 - 308
  • [9] A scalable approach to spectral clustering with SDD solvers
    Nguyen Lu Dang Khoa
    Sanjay Chawla
    Journal of Intelligent Information Systems, 2015, 44 : 289 - 308
  • [10] Scalable Spectral Clustering Using Random Binning Features
    Wu, Lingfei
    Chen, Pin-Yu
    Yen, Ian En-Hsu
    Xu, Fangli
    Xia, Yinglong
    Aggarwal, Charu
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2506 - 2515