Spectral clustering based on iterative optimization for large-scale and high-dimensional data

被引:29
|
作者
Zhao, Yang [1 ,2 ]
Yuan, Yuan [1 ]
Nie, Feiping [3 ,4 ]
Wang, Qi [3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 049, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[4] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Manifold learning; Spectral clustering; Multi-task learning; CUTS;
D O I
10.1016/j.neucom.2018.08.059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral graph theoretic methods have been a fundamental and important topic in the field of manifold learning and it has become a vital tool in data clustering. However, spectral clustering approaches are limited by their computational demands. It would be too expensive to provide an optimal approximation for spectral decomposition in dealing with large-scale and high-dimensional data sets. On the other hand, the rapid development of data on the Web has posed many rising challenges to the traditional single-task clustering, while the multi-task clustering provides many new thoughts for real-world applications such as video segmentation. In this paper, we will study a Spectral Clustering based on Iterative Optimization (SCIO), which solves the spectral decomposition problem of large-scale and high-dimensional data sets and it well performs on multi-task clustering. Extensive experiments on various synthetic data sets and real-world data sets demonstrate that the proposed method provides an efficient solution for spectral clustering. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 50 条
  • [1] LARGE-SCALE HIGH-DIMENSIONAL CLUSTERING WITH FAST SKETCHING
    Chatalic, Antoine
    Gribonval, Remi
    Keriven, Nicolas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4714 - 4718
  • [2] Visualizing Large-scale and High-dimensional Data
    Tang, Jian
    Liu, Jingzhou
    Zhang, Ming
    Mei, Qiaozhu
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 287 - 297
  • [3] Parallel algorithms for clustering high-dimensional large-scale datasets
    Nagesh, H
    Goil, S
    Choudhary, A
    DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 335 - 356
  • [4] Batched Large-scale Bayesian Optimization in High-dimensional Spaces
    Wang, Zi
    Gehring, Clement
    Kohli, Pushmeet
    Jegelka, Stefanie
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [5] An Interactive Visual Testbed System for Dimension Reduction and Clustering of Large-scale High-dimensional Data
    Choo, Jaegul
    Lee, Hanseung
    Liu, Zhicheng
    Stasko, John
    Park, Haesun
    VISUALIZATION AND DATA ANALYSIS 2013, 2013, 8654
  • [6] RECURSIVE REDUCTION NET FOR LARGE-SCALE HIGH-DIMENSIONAL DATA
    Ke, Tsung-Wei
    Liu, Tyng-Luh
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1903 - 1907
  • [7] A Supervised Learning Model for High-Dimensional and Large-Scale Data
    Peng, Chong
    Cheng, Jie
    Cheng, Qiang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (02)
  • [8] Feature screening with large-scale and high-dimensional survival data
    Yi, Grace Y.
    He, Wenqing
    Carroll, Raymond. J.
    BIOMETRICS, 2022, 78 (03) : 894 - 907
  • [9] Iterative random projections for high-dimensional data clustering
    Cardoso, Angelo
    Wichert, Andreas
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1749 - 1755
  • [10] Scalable clustering for large high-dimensional data based on data summarization
    Lai, Ying
    Orlandic, Ratko
    Yee, Wai Gen
    Kulkarni, Sachin
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 456 - 461