A Unified Framework for Representation-Based Subspace Clustering of Out-of-Sample and Large-Scale Data

被引:121
|
作者
Peng, Xi [1 ]
Tang, Huajin [2 ]
Zhang, Lei [2 ]
Yi, Zhang [2 ]
Xiao, Shijie [3 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore
[2] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Error bound analysis; least square regression (LSR); low-rank representation (LRR); out-of-sample problem; scalable subspace clustering; sparse subspace clustering (SSC); SPARSE REPRESENTATION; COLLABORATIVE REPRESENTATION; RANK REPRESENTATION; FACE RECOGNITION; SPECTRAL METHODS; SEGMENTATION; KERNEL;
D O I
10.1109/TNNLS.2015.2490080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Under the framework of spectral clustering, the key of subspace clustering is building a similarity graph, which describes the neighborhood relations among data points. Some recent works build the graph using sparse, low-rank, and l(2)-norm-based representation, and have achieved the state-of-the-art performance. However, these methods have suffered from the following two limitations. First, the time complexities of these methods are at least proportional to the cube of the data size, which make those methods inefficient for solving the large-scale problems. Second, they cannot cope with the out-of-sample data that are not used to construct the similarity graph. To cluster each out-of-sample datum, the methods have to recalculate the similarity graph and the cluster membership of the whole data set. In this paper, we propose a unified framework that makes the representation-based subspace clustering algorithms feasible to cluster both the out-of-sample and the large-scale data. Under our framework, the large-scale problem is tackled by converting it as the out-of-sample problem in the manner of sampling, clustering, coding, and classifying. Furthermore, we give an estimation for the error bounds by treating each subspace as a point in a hyperspace. Extensive experimental results on various benchmark data sets show that our methods outperform several recently proposed scalable methods in clustering a large-scale data set.
引用
收藏
页码:2499 / 2512
页数:14
相关论文
共 50 条
  • [1] An Efficient Representation-Based Subspace Clustering Framework for Polarized Hyperspectral Images
    Chen, Zhengyi
    Zhang, Chunmin
    Mu, Tingkui
    Yan, Tingyu
    Chen, Zeyu
    Wang, Yanqiang
    REMOTE SENSING, 2019, 11 (13)
  • [2] Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral Clustering
    Nie, Feiping
    Zeng, Zinan
    Tsang, Ivor W.
    Xu, Dong
    Zhang, Changshui
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (11): : 1796 - 1808
  • [3] A Divide-and-Conquer Framework for Large-Scale Subspace Clustering
    You, Chong
    Donnat, Claire
    Robinson, Daniel P.
    Vidal, Rene
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 1014 - 1018
  • [4] Efficient Subspace Clustering of Large-scale Data Streams with Misses
    Traganitis, Panagiotis A.
    Giannakis, Georgios B.
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [5] A simple rapid sample-based clustering for large-scale data
    Chen, Yewang
    Yang, Yuanyuan
    Pei, Songwen
    Chen, Yi
    Du, Jixiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [6] A Randomized Approach to Large-Scale Subspace Clustering
    Traganitis, Panagiotis A.
    Giannakis, Georgios B.
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 1019 - 1023
  • [7] Accelerated low-rank representation for subspace clustering and semi-supervised classification on large-scale data
    Fan, Jicong
    Tian, Zhaoyang
    Zhao, Mingbo
    Chow, Tommy W. S.
    NEURAL NETWORKS, 2018, 100 : 39 - 48
  • [8] Large-Scale Subspace Clustering Based on Purity Kernel Tensor Learning
    Zheng, Yilu
    Zhao, Shuai
    Zhang, Xiaoqian
    Xu, Yinlong
    Peng, Lifan
    ELECTRONICS, 2024, 13 (01)
  • [9] A study of large-scale data clustering based on fuzzy clustering
    Li, Yangyang
    Yang, Guoli
    He, Haiyang
    Jiao, Licheng
    Shang, Ronghua
    SOFT COMPUTING, 2016, 20 (08) : 3231 - 3242
  • [10] A study of large-scale data clustering based on fuzzy clustering
    Yangyang Li
    Guoli Yang
    Haiyang He
    Licheng Jiao
    Ronghua Shang
    Soft Computing, 2016, 20 : 3231 - 3242