Data stream clustering: a review

被引:0
|
作者
Alaettin Zubaroğlu
Volkan Atalay
机构
[1] Middle East Technical University,Department of Computer Engineering
来源
关键词
Data streams; Data stream clustering; Real-time clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Number of connected devices is steadily increasing and these devices continuously generate data streams. Real-time processing of data streams is arousing interest despite many challenges. Clustering is one of the most suitable methods for real-time data stream processing, because it can be applied with less prior information about the data and it does not need labeled instances. However, data stream clustering differs from traditional clustering in many aspects and it has several challenging issues. Here, we provide information regarding the concepts and common characteristics of data streams, such as concept drift, data structures for data streams, time window models and outlier detection. We comprehensively review recent data stream clustering algorithms and analyze them in terms of the base clustering technique, computational complexity and clustering accuracy. A comparison of these algorithms is given along with still open problems. We indicate popular data stream repositories and datasets, stream processing tools and platforms. Open problems about data stream clustering are also discussed.
引用
收藏
页码:1201 / 1236
页数:35
相关论文
共 50 条
  • [1] Data stream clustering: a review
    Zubaroglu, Alaettin
    Atalay, Volkan
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (02) : 1201 - 1236
  • [2] A Review of Uncertain Data Stream Clustering Algorithms
    Yang, Yue
    Liu, Zhuo
    Xing, Zhidan
    2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 111 - 116
  • [3] Data Stream Clustering: A Survey
    Silva, Jonathan A.
    Faria, Elaine R.
    Barros, Rodrigo C.
    Hruschka, Eduardo R.
    de Carvalho, Andre C. P. L. F.
    Gama, Joao
    ACM COMPUTING SURVEYS, 2013, 46 (01)
  • [4] A Critical Review of Density-based Data Stream Clustering Techniques
    Toor, Affan Ahmad
    Usman, Muhammad
    Ahmed, Waseem
    2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016), 2016, : 51 - 61
  • [5] A survey on data stream clustering and classification
    Hai-Long Nguyen
    Woon, Yew-Kwong
    Ng, Wee-Keong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (03) : 535 - 569
  • [6] Clustering data stream: A survey of algorithms
    Mahdiraji, Alireza
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2009, 13 (02) : 39 - 44
  • [7] An evaluation of data stream clustering algorithms
    Mansalis, Stratos
    Ntoutsi, Eirini
    Pelekis, Nikos
    Theodoridis, Yannis
    STATISTICAL ANALYSIS AND DATA MINING, 2018, 11 (04) : 167 - 187
  • [8] MVStream: Multiview Data Stream Clustering
    Huang, Ling
    Wang, Chang-Dong
    Chao, Hong-Yang
    Yu, Philip S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3482 - 3496
  • [9] Data Stream Clustering with Affinity Propagation
    Zhang, Xiangliang
    Furtlehner, Cyril
    Germain-Renaud, Cecile
    Sebag, Michele
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) : 1644 - 1656
  • [10] An improved data stream algorithm for clustering
    Kim, Sang-Sub
    Ahn, Hee-Kap
    COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2015, 48 (09): : 635 - 645