MVStream: Multiview Data Stream Clustering

被引:37
|
作者
Huang, Ling [1 ,2 ,3 ]
Wang, Chang-Dong [1 ,2 ,3 ]
Chao, Hong-Yang [1 ,3 ]
Yu, Philip S. [4 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510006, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou 510006, Peoples R China
[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[5] Tsinghua Univ, Inst Data Sci, Beijing 100084, Peoples R China
关键词
Clustering algorithms; Shape; Task analysis; Support vector machines; Indexes; Data models; Computer science; Clustering; clusters of arbitrary shapes; data stream; multiview; support vector (SV); ALGORITHM;
D O I
10.1109/TNNLS.2019.2944851
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article studies a new problem of data stream clustering, namely, multiview data stream (MVStream) clustering. Although many data stream clustering algorithms have been developed, they are restricted to the single-view streaming data, and clustering MVStreams still remains largely unsolved. In addition to the many issues encountered by the conventional single-view data stream clustering, such as capturing cluster evolution and discovering clusters of arbitrary shapes under the limited computational resources, the main challenge of MVStream clustering lies in integrating information from multiple views in a streaming manner and abstracting summary statistics from the integrated features simultaneously. In this article, we propose a novel MVStream clustering algorithm for the first time. The main idea is to design a multiview support vector domain description (MVSVDD) model, by which the information from multiple insufficient views can be integrated, and the outputting support vectors (SVs) are utilized to abstract the summary statistics of the historical multiview data objects. Based on the MVSVDD model, a new multiview cluster labeling method is designed, whereby clusters of arbitrary shapes can be discovered for each view. By tracking the cluster labels of SVs in each view, the cluster evolution associated with concept drift can be captured. Since the SVs occupy only a small portion of data objects, the proposed MVStream algorithm is quite efficient with the limited computational resources. Extensive experiments are conducted to demonstrate the effectiveness and efficiency of the proposed method.
引用
收藏
页码:3482 / 3496
页数:15
相关论文
共 50 条
  • [41] Concept Factorization Based Multiview Clustering for Large-Scale Data
    Chen, Man-Sheng
    Wang, Chang-Dong
    Huang, Dong
    Lai, Jian-Huang
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 5784 - 5796
  • [42] A Tensor Framework for Data Stream Clustering and Compression
    Cyganek, Boguslaw
    Wozniak, Michal
    IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 163 - 173
  • [43] An Adaptive Density Data Stream Clustering Algorithm
    Shifei Ding
    Jian Zhang
    Hongjie Jia
    Jun Qian
    Cognitive Computation, 2016, 8 : 30 - 38
  • [44] Feature-Based Data Stream Clustering
    Asbagh, Mohsen Jafari
    Abolhassani, Hassan
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 363 - 368
  • [45] Intrusion detection based on clustering a data stream
    Oh, SH
    Kang, JS
    Byun, YC
    Park, GL
    Byun, SY
    Third ACIS International Conference on Software Engineering Research, Managment and Applications, Proceedings, 2005, : 220 - 227
  • [46] Data Stream Clustering Based on Grid Coupling
    Zhang D.-Y.
    Zhou L.-H.
    Wu X.-Y.
    Zhao L.-H.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 667 - 683
  • [47] Varying density method for data stream clustering
    Department of Computer Engineering, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran
    不详
    不详
    Appl. Soft Comput. J.,
  • [48] A Novel Algorithm for Adaptive Data Stream Clustering
    Ansarifar, Farnaz
    Ahmadi, Ali
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1542 - 1546
  • [49] An Ensemble Learning Approach for Data Stream Clustering
    Fathzadeh, Ramin
    Mokhtari, Vahid
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [50] A Weighted Fuzzy Clustering Algorithm for Data Stream
    Wan, Renxia
    Yan, Xiaoya
    Su, Xiaoke
    2008 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL 1, PROCEEDINGS, 2008, : 360 - +