MVStream: Multiview Data Stream Clustering

被引:37
|
作者
Huang, Ling [1 ,2 ,3 ]
Wang, Chang-Dong [1 ,2 ,3 ]
Chao, Hong-Yang [1 ,3 ]
Yu, Philip S. [4 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510006, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou 510006, Peoples R China
[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[5] Tsinghua Univ, Inst Data Sci, Beijing 100084, Peoples R China
关键词
Clustering algorithms; Shape; Task analysis; Support vector machines; Indexes; Data models; Computer science; Clustering; clusters of arbitrary shapes; data stream; multiview; support vector (SV); ALGORITHM;
D O I
10.1109/TNNLS.2019.2944851
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article studies a new problem of data stream clustering, namely, multiview data stream (MVStream) clustering. Although many data stream clustering algorithms have been developed, they are restricted to the single-view streaming data, and clustering MVStreams still remains largely unsolved. In addition to the many issues encountered by the conventional single-view data stream clustering, such as capturing cluster evolution and discovering clusters of arbitrary shapes under the limited computational resources, the main challenge of MVStream clustering lies in integrating information from multiple views in a streaming manner and abstracting summary statistics from the integrated features simultaneously. In this article, we propose a novel MVStream clustering algorithm for the first time. The main idea is to design a multiview support vector domain description (MVSVDD) model, by which the information from multiple insufficient views can be integrated, and the outputting support vectors (SVs) are utilized to abstract the summary statistics of the historical multiview data objects. Based on the MVSVDD model, a new multiview cluster labeling method is designed, whereby clusters of arbitrary shapes can be discovered for each view. By tracking the cluster labels of SVs in each view, the cluster evolution associated with concept drift can be captured. Since the SVs occupy only a small portion of data objects, the proposed MVStream algorithm is quite efficient with the limited computational resources. Extensive experiments are conducted to demonstrate the effectiveness and efficiency of the proposed method.
引用
收藏
页码:3482 / 3496
页数:15
相关论文
共 50 条
  • [31] Clustering Large Datasets Using Data Stream Clustering Techniques
    Bolanos, Matthew
    Forrest, John
    Hahsler, Michael
    DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 135 - 143
  • [32] Iterative Multiview Subspace Learning for Unpaired Multiview Clustering
    Yang, Wanqi
    Xin, Like
    Wang, Lei
    Yang, Ming
    Yan, Wenzhu
    Gao, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14848 - 14862
  • [33] Incomplete Data Meets Uncoupled Case: A Challenging Task of Multiview Clustering
    Lin, Jia-Qi
    Li, Xiang-Long
    Chen, Man-Sheng
    Wang, Chang-Dong
    Zhang, Haizhang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 8097 - 8110
  • [34] Consensus Kernel &ITK&IT-Means Clustering for Incomplete Multiview Data
    Ye, Yongkai
    Liu, Xinwang
    Liu, Qiang
    Yin, Jianping
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [35] A Deep Fusion Gaussian Mixture Model for Multiview Land Data Clustering
    Li, Peng
    Chen, Zhikui
    Gao, Jing
    Zhang, Jianing
    Jin, Shan
    Zhao, Wenhan
    Xia, Feng
    Wang, Lu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020 (2020):
  • [36] Clustering Ensemble Based on Hybrid Multiview Clustering
    Yu, Zhiwen
    Wang, Daxing
    Meng, Xian-Bing
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6518 - 6530
  • [37] An intelligent clustering algorithm for high-dimensional multiview data in big data applications
    Tao, Qian
    Gu, Chunqin
    Wang, Zhenyu
    Jiang, Daoning
    NEUROCOMPUTING, 2020, 393 : 234 - 244
  • [38] Iterative Deep Structural Graph Contrast Clustering for Multiview Raw Data
    Dong, Zhibin
    Jin, Jiaqi
    Xiao, Yuyang
    Wang, Siwei
    Zhu, Xinzhong
    Liu, Xinwang
    Zhu, En
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18272 - 18284
  • [39] Low-Rank Tensor Regularized Fuzzy Clustering for Multiview Data
    Wei, Huiqin
    Chen, Long
    Ruan, Keyu
    Li, Lingxi
    Chen, Long
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (12) : 3087 - 3099
  • [40] Joint Representation Learning and Clustering: A Framework for Grouping Partial Multiview Data
    Zhuge, Wenzhang
    Tao, Hong
    Luo, Tingjin
    Zeng, Ling-Li
    Hou, Chenping
    Yi, Dongyun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3826 - 3840