Understanding the Generalization Performance of Spectral Clustering Algorithms

被引:0
|
作者
Li, Shaojie
Ouyang, Sheng
Liu, Yong [1 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
CONSISTENCY; CONVERGENCE; CUTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The theoretical analysis of spectral clustering is mainly devoted to consistency, while there is little research on its generalization performance. In this paper, we study the excess risk bounds of the popular spectral clustering algorithms: relaxed RatioCut and relaxed NCut. Our analysis follows the two practical steps of spectral clustering algorithms: continuous solution and discrete solution. Firstly, we provide the convergence rate of the excess risk bounds between the empirical continuous optimal solution and the population-level continuous optimal solution. Secondly, we show the fundamental quantity influencing the excess risk between the empirical discrete optimal solution and the population-level discrete optimal solution. At the empirical level, algorithms can be designed to reduce this quantity. Based on our theoretical analysis, we propose two novel algorithms that can penalize this quantity and, additionally, can cluster the out-of-sample data without re-eigendecomposition on the overall samples. Numerical experiments on toy and real datasets confirm the effectiveness of our proposed algorithms.
引用
收藏
页码:8614 / 8621
页数:8
相关论文
共 50 条
  • [1] Spectral algorithms for learning and clustering
    Vempala, Santosh S.
    Learning Theory, Proceedings, 2007, 4539 : 3 - 4
  • [2] The Performance of Spectral Clustering Algorithms on Water Distribution Networks: Further Evidence
    Belloum, Farid
    Houichi, Larbi
    Kherouf, Mazouz
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (04) : 9056 - 9062
  • [3] Understanding Big Data Spectral Clustering
    Couillet, Romain
    Benaych-Georges, Florent
    2015 IEEE 6TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2015, : 29 - 32
  • [4] Performance analysis for clustering algorithms
    Xue, Yu
    Zhao, Binping
    Ma, Tinghuai
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (05) : 485 - 493
  • [5] SPECTRALIZER: A TOOL FOR VISUALIZATION OF SPECTRAL CLUSTERING ALGORITHMS
    Bermejo-Teson, D.
    Diez-Pastor, J. F.
    Arnaiz-Gonzalez, Alvar
    Garcia-Osorio, C.
    EDULEARN12: 4TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2012, : 1985 - 1992
  • [6] A Family of Algorithms Using Spectral Clustering and DBSCAN
    Miyahara, So
    Miyamoto, Sadaaki
    2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 196 - 200
  • [7] Review of spectral clustering algorithms used in proteomics
    Kumar, Shraddha
    Purohit, Anuradha
    Varma, Sunita
    International Journal of Data Science, 2023, 8 (01) : 16 - 38
  • [8] Spectral clustering algorithms for ultrasound image segmentation
    Archip, N
    Rohling, R
    Cooperberg, P
    Tahmasebpour, H
    Warfield, SK
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2005, PT 2, 2005, 3750 : 862 - 869
  • [9] A family of algorithms using spectral clustering and DBSCAN
    20150300424012
    (1) Master's Program in Risk Engineering, University of Tsukuba, Ibaraki; 305-8573, Japan; (2) Department of Risk Engineering, University of Tsukuba, Ibaraki; 305-8573, Japan, 1600, IEEE Computer Society; International Granular Computing Society; Kayamori Foundation on Informational Science Advancement; Support Center for Advanced Telecommunications Technology Research, Foundation (SCAT); Tateisi Science and Technology Foundation (Institute of Electrical and Electronics Engineers Inc., United States):
  • [10] A Performance Guarantee for Spectral Clustering
    Boedihardjo, March
    Deng, Shaofeng
    Strohmer, Thomas
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (01): : 369 - 387