Spectral clustering based on iterative optimization for large-scale and high-dimensional data

被引:29
|
作者
Zhao, Yang [1 ,2 ]
Yuan, Yuan [1 ]
Nie, Feiping [3 ,4 ]
Wang, Qi [3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 049, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[4] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Manifold learning; Spectral clustering; Multi-task learning; CUTS;
D O I
10.1016/j.neucom.2018.08.059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral graph theoretic methods have been a fundamental and important topic in the field of manifold learning and it has become a vital tool in data clustering. However, spectral clustering approaches are limited by their computational demands. It would be too expensive to provide an optimal approximation for spectral decomposition in dealing with large-scale and high-dimensional data sets. On the other hand, the rapid development of data on the Web has posed many rising challenges to the traditional single-task clustering, while the multi-task clustering provides many new thoughts for real-world applications such as video segmentation. In this paper, we will study a Spectral Clustering based on Iterative Optimization (SCIO), which solves the spectral decomposition problem of large-scale and high-dimensional data sets and it well performs on multi-task clustering. Extensive experiments on various synthetic data sets and real-world data sets demonstrate that the proposed method provides an efficient solution for spectral clustering. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 50 条
  • [41] Distributed Methods for High-dimensional and Large-scale Tensor Factorization
    Shin, Kijung
    Kang, U.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 989 - 994
  • [42] Model based clustering of high-dimensional binary data
    Tang, Yang
    Browne, Ryan P.
    Mc Nicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 87 : 84 - 101
  • [43] High-dimensional optimization of large-scale steel truss structures using guided stochastic search
    Azad, Saeid Kazemzadeh
    Aminbakhsh, Saman
    STRUCTURES, 2021, 33 : 1439 - 1456
  • [44] An efficient cell-based clustering method for handling large, high-dimensional data
    Chang, JW
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 : 295 - 300
  • [45] A Valid Clustering Algorithm for High-dimensional Large Data Sets Based on Distributed Method
    Guo Xian e
    Yan Junmei
    PROCEEDINGS OF 2009 INTERNATIONAL WORKSHOP ON INFORMATION SECURITY AND APPLICATION, 2009, : 1 - 6
  • [46] Divide-and-conquer based large-scale spectral clustering
    Li, Hongmin
    Ye, Xiucai
    Imakura, Akira
    Sakurai, Tetsuya
    NEUROCOMPUTING, 2022, 501 : 664 - 678
  • [47] Particle swarm optimization-based automatic parameter selection for deep neural networks and its applications in large-scale and high-dimensional data
    Ye, Fei
    PLOS ONE, 2017, 12 (12):
  • [48] SPECTRAL CLUSTERING AND THE HIGH-DIMENSIONAL STOCHASTIC BLOCKMODEL
    Rohe, Karl
    Chatterjee, Sourav
    Yu, Bin
    ANNALS OF STATISTICS, 2011, 39 (04): : 1878 - 1915
  • [49] Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut
    Chen, Xiaojun
    Hong, Weijun
    Nie, Feiping
    He, Dan
    Yang, Min
    Huang, Joshua Zhexue
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1206 - 1215
  • [50] Compressed constrained spectral clustering framework for large-scale data sets
    Liu, Wenfen
    Ye, Mao
    Wei, Jianghong
    Hu, Xuexian
    KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 77 - 88