Understanding the Generalization Performance of Spectral Clustering Algorithms

被引:0
|
作者
Li, Shaojie
Ouyang, Sheng
Liu, Yong [1 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
CONSISTENCY; CONVERGENCE; CUTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The theoretical analysis of spectral clustering is mainly devoted to consistency, while there is little research on its generalization performance. In this paper, we study the excess risk bounds of the popular spectral clustering algorithms: relaxed RatioCut and relaxed NCut. Our analysis follows the two practical steps of spectral clustering algorithms: continuous solution and discrete solution. Firstly, we provide the convergence rate of the excess risk bounds between the empirical continuous optimal solution and the population-level continuous optimal solution. Secondly, we show the fundamental quantity influencing the excess risk between the empirical discrete optimal solution and the population-level discrete optimal solution. At the empirical level, algorithms can be designed to reduce this quantity. Based on our theoretical analysis, we propose two novel algorithms that can penalize this quantity and, additionally, can cluster the out-of-sample data without re-eigendecomposition on the overall samples. Numerical experiments on toy and real datasets confirm the effectiveness of our proposed algorithms.
引用
收藏
页码:8614 / 8621
页数:8
相关论文
共 50 条
  • [21] Understanding Generalization and Optimization Performance of Deep CNNs
    Zhou, Pan
    Feng, Jiashi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [22] General Framework for Understanding Compressed Subspace Clustering Algorithms
    Meng, Linghang
    Li, Gen
    Yan, Jingkai
    Gu, Yuantao
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (06) : 1504 - 1519
  • [23] Understanding Regularized Spectral Clustering via Graph Conductance
    Zhang, Yilin
    Rohe, Karl
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Performance evaluation of line simplification algorithms for vector generalization
    Shi, WZ
    Cheung, CK
    CARTOGRAPHIC JOURNAL, 2006, 43 (01): : 27 - 44
  • [25] Generalization performance of bipartite ranking algorithms with convex losses
    He, Fangchao
    Chen, Hong
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 404 (02) : 528 - 536
  • [26] Performance comparisons for spectral unmixing algorithms
    Keshava, N
    IMAGING SPECTROMETRY VII, 2001, 4480 : 40 - 48
  • [27] Effects of Normalization on Spectral Unmixing and Clustering Algorithms in Raman Imaging
    Hedegaard, Martin
    Matthaeus, Christian
    Hassing, Soren
    Krafft, Christoph
    Diem, Max
    Popp, Juergen
    XXII INTERNATIONAL CONFERENCE ON RAMAN SPECTROSCOPY, 2010, 1267 : 350 - +
  • [28] Water Distribution Network Clustering: Graph Partitioning or Spectral Algorithms?
    Di Nardo, A.
    Di Natale, M.
    Giudicianni, C.
    Greco, R.
    Santonastaso, G. F.
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 1197 - 1209
  • [29] Spectral density-based clustering algorithms for complex networks
    Ramos, Taiane Coelho
    Mourao-Miranda, Janaina
    Fujita, Andre
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [30] Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms
    Trillos, Nicolas Garcia
    Little, Anna
    McKenzie, Daniel
    Murphy, James M.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25