A Unified Framework for Representation-Based Subspace Clustering of Out-of-Sample and Large-Scale Data

被引:121
|
作者
Peng, Xi [1 ]
Tang, Huajin [2 ]
Zhang, Lei [2 ]
Yi, Zhang [2 ]
Xiao, Shijie [3 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore
[2] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Error bound analysis; least square regression (LSR); low-rank representation (LRR); out-of-sample problem; scalable subspace clustering; sparse subspace clustering (SSC); SPARSE REPRESENTATION; COLLABORATIVE REPRESENTATION; RANK REPRESENTATION; FACE RECOGNITION; SPECTRAL METHODS; SEGMENTATION; KERNEL;
D O I
10.1109/TNNLS.2015.2490080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Under the framework of spectral clustering, the key of subspace clustering is building a similarity graph, which describes the neighborhood relations among data points. Some recent works build the graph using sparse, low-rank, and l(2)-norm-based representation, and have achieved the state-of-the-art performance. However, these methods have suffered from the following two limitations. First, the time complexities of these methods are at least proportional to the cube of the data size, which make those methods inefficient for solving the large-scale problems. Second, they cannot cope with the out-of-sample data that are not used to construct the similarity graph. To cluster each out-of-sample datum, the methods have to recalculate the similarity graph and the cluster membership of the whole data set. In this paper, we propose a unified framework that makes the representation-based subspace clustering algorithms feasible to cluster both the out-of-sample and the large-scale data. Under our framework, the large-scale problem is tackled by converting it as the out-of-sample problem in the manner of sampling, clustering, coding, and classifying. Furthermore, we give an estimation for the error bounds by treating each subspace as a point in a hyperspace. Extensive experimental results on various benchmark data sets show that our methods outperform several recently proposed scalable methods in clustering a large-scale data set.
引用
收藏
页码:2499 / 2512
页数:14
相关论文
共 50 条
  • [21] Compressed constrained spectral clustering framework for large-scale data sets
    Liu, Wenfen
    Ye, Mao
    Wei, Jianghong
    Hu, Xuexian
    KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 77 - 88
  • [22] Latent Feature Representation-Based Low Rank Subspace Clustering for Hyperspectral Band Selection
    Shang, Xiaodi
    Zhao, Xin
    Guo, Yujie
    Sun, Xudong
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 416 - 429
  • [23] Large-Scale Subspace Clustering Using Random Sketching and Validation
    Traganitis, Panagiotis A.
    Slavakis, Konstantinos
    Giannakis, Georgios B.
    2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 107 - 111
  • [24] Large-Scale Subspace Clustering by Independent Distributed and Parallel Coding
    Li, Jun
    Tao, Zhiqiang
    Wu, Yue
    Zhong, Bineng
    Fu, Yun
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9090 - 9100
  • [25] Large-Scale Subspace Clustering via k-Factorization
    Fan, Jicong
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 342 - 352
  • [26] SKETCHED SPARSE SUBSPACE CLUSTERING FOR LARGE-SCALE HYPERSPECTRAL IMAGES
    Huang, Shaoguang
    Zhang, Hongyan
    Pizurica, Aleksandra
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1766 - 1770
  • [27] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876
  • [28] A fast anchor-based graph-regularized low-rank representation approach for large-scale subspace clustering
    Lili Fan
    Guifu Lu
    Ganyi Tang
    Yong Wang
    Machine Vision and Applications, 2024, 35
  • [29] A fast anchor-based graph-regularized low-rank representation approach for large-scale subspace clustering
    Fan, Lili
    Lu, Guifu
    Tang, Ganyi
    Wang, Yong
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [30] Adaptive Unified Framework with Global Anchor Graph for Large-Scale Multi-view Clustering
    Shi, Lin
    Chen, Wangjie
    Liu, Yi
    Zhuang, Lihua
    Jiang, Guangqi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT 1, 2025, 15031 : 537 - 550