Scalable Sequential Spectral Clustering

被引:0
|
作者
Li, Yeqing [1 ]
Huang, Junzhou [1 ]
Liu, Wei [2 ]
机构
[1] Univ Texas Arlington, Arlington, TX 76019 USA
[2] Didi Res, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past decades, Spectral Clustering (SC) has become one of the most effective clustering approaches. Although it has been widely used, one significant drawback of SC is its expensive computation cost. Many efforts have been devoted to accelerating SC algorithms and promising results have been achieved. However, most of the existing algorithms rely on the assumption that data can be stored in the computer memory. When data cannot fit in the memory, these algorithms will suffer severe performance degradations. In order to overcome this issue, we propose a novel sequential SC algorithm for tackling large-scale clustering with limited computational resources, e.g., memory. We begin with investigating an effective way of approximating the graph affinity matrix via leveraging a bipartite graph. Then we choose a smart graph construction and optimization strategy to avoid random access to data. These efforts lead to an efficient SC algorithm whose memory usage is independent of the number of input data points. Extensive experiments carried out on large datasets demonstrate that the proposed sequential SC algorithm is up to a thousand times faster than the state-of-the-arts.
引用
收藏
页码:1809 / 1815
页数:7
相关论文
共 50 条
  • [31] Spectral Sparsification in Spectral Clustering
    Chakeri, Alireza
    Farhidzadeh, Hamidreza
    Hall, Lawrence O.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2301 - 2306
  • [32] Scalable Distributed Subtrajectory Clustering
    Tampakis, Panagiotis
    Pelekis, Nikos
    Doulkeridis, Christos
    Theodoridis, Yannis
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 950 - 959
  • [33] Scalable Hierarchical Agglomerative Clustering
    Monath, Nicholas
    Dubey, Kumar Avinava
    Guruganesh, Guru
    Zaheer, Manzil
    Ahmed, Amr
    McCallum, Andrew
    Mergen, Gokhan
    Najork, Marc
    Terzihan, Mert
    Tjanaka, Bryon
    Wang, Yuan
    Wu, Yuchen
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1245 - 1255
  • [34] Scalable clustering: A distributed approach
    Hore, P
    Hall, LO
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 143 - 148
  • [35] Scalable Sparse Subspace Clustering
    Peng, Xi
    Zhang, Lei
    Yi, Zhang
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 430 - 437
  • [36] Scalable fuzzy clustering algorithms
    Hall, Lawrence O.
    2008 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1 AND 2, 2008, : 852 - 853
  • [37] Parallel and Scalable Precise Clustering
    Byma, Stuart
    Dhasade, Akash
    Altenhoff, Adrian
    Dessimoz, Christophe
    Larus, James R.
    PACT '20: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2020, : 217 - 228
  • [38] Evaluating Scalable Fuzzy Clustering
    Gu, Yuhua
    Hall, Lawrence O.
    Goldgof, Dmitry B.
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [39] SCALABLE ALGORITHMS FOR CONVEX CLUSTERING
    Zhou, Weilian
    Yi, Haidong
    Mishne, Gal
    Chi, Eric
    2021 IEEE DATA SCIENCE AND LEARNING WORKSHOP (DSLW), 2021,
  • [40] Scalable adaptive hierarchical clustering
    Mathy, L
    Canonico, R
    Simpson, S
    Hutchison, D
    IEEE COMMUNICATIONS LETTERS, 2002, 6 (03) : 117 - 119