Scalable Sequential Spectral Clustering

被引:0
|
作者
Li, Yeqing [1 ]
Huang, Junzhou [1 ]
Liu, Wei [2 ]
机构
[1] Univ Texas Arlington, Arlington, TX 76019 USA
[2] Didi Res, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decades, Spectral Clustering (SC) has become one of the most effective clustering approaches. Although it has been widely used, one significant drawback of SC is its expensive computation cost. Many efforts have been devoted to accelerating SC algorithms and promising results have been achieved. However, most of the existing algorithms rely on the assumption that data can be stored in the computer memory. When data cannot fit in the memory, these algorithms will suffer severe performance degradations. In order to overcome this issue, we propose a novel sequential SC algorithm for tackling large-scale clustering with limited computational resources, e.g., memory. We begin with investigating an effective way of approximating the graph affinity matrix via leveraging a bipartite graph. Then we choose a smart graph construction and optimization strategy to avoid random access to data. These efforts lead to an efficient SC algorithm whose memory usage is independent of the number of input data points. Extensive experiments carried out on large datasets demonstrate that the proposed sequential SC algorithm is up to a thousand times faster than the state-of-the-arts.
引用
收藏
页码:1809 / 1815
页数:7
相关论文
共 50 条
  • [21] Fast large-scale spectral clustering by sequential shrinkage optimization
    Liu, Tie-Yan
    Yang, Huai-Yuan
    Zheng, Xin
    Qin, Tao
    Ma, Wei-Ying
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 319 - +
  • [22] Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks
    Van Lierde, Hadrien
    Chow, Tommy W. S.
    Chen, Guanrong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (04) : 754 - 767
  • [23] A Scalable Spectral Clustering Algorithm Based on Landmark-Embedding and Cosine Similarity
    Chen, Guangliang
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 52 - 62
  • [24] Sequential spectral clustering of hyperspectral remote sensing image over bipartite graph
    Hassanzadeh, Aidin
    Kaarna, Arto
    Kauranne, Tuomo
    APPLIED SOFT COMPUTING, 2018, 73 : 727 - 734
  • [25] Scalable spectral ensemble clustering via building representative co-association matrix
    Liang, Yinian
    Ren, Zhigang
    Wu, Zongze
    Zeng, Deyu
    Li, Jianzhong
    NEUROCOMPUTING, 2020, 390 : 158 - 167
  • [26] Scalable probabilistic clustering
    Bradley, PS
    Fayyad, UM
    Reina, CA
    COMPLEMENTARITY: APPLICATIONS, ALGORITHMS AND EXTENSIONS, 2001, 50 : 43 - 65
  • [27] Scalable Clustering and Applications
    Shahid, K., I
    Chaudhury, Santanu
    TENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2016), 2016,
  • [28] Scalable clustering with smoka
    Kogan, Jacob
    ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 299 - 303
  • [29] Scalable Fair Clustering
    Backurs, Arturs
    Indyk, Piotr
    Onak, Krzysztof
    Schieber, Baruch
    Vakilian, Ali
    Wagner, Tal
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [30] Two-step scalable spectral clustering algorithm using landmarks and probability density estimation
    Hong, Xia
    Gao, Junbin
    Wei, Hong
    Xiao, James
    Mitchell, Richard
    NEUROCOMPUTING, 2023, 519 : 173 - 186