Data stream clustering: a review

被引:0
|
作者
Alaettin Zubaroğlu
Volkan Atalay
机构
[1] Middle East Technical University,Department of Computer Engineering
来源
关键词
Data streams; Data stream clustering; Real-time clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Number of connected devices is steadily increasing and these devices continuously generate data streams. Real-time processing of data streams is arousing interest despite many challenges. Clustering is one of the most suitable methods for real-time data stream processing, because it can be applied with less prior information about the data and it does not need labeled instances. However, data stream clustering differs from traditional clustering in many aspects and it has several challenging issues. Here, we provide information regarding the concepts and common characteristics of data streams, such as concept drift, data structures for data streams, time window models and outlier detection. We comprehensively review recent data stream clustering algorithms and analyze them in terms of the base clustering technique, computational complexity and clustering accuracy. A comparison of these algorithms is given along with still open problems. We indicate popular data stream repositories and datasets, stream processing tools and platforms. Open problems about data stream clustering are also discussed.
引用
收藏
页码:1201 / 1236
页数:35
相关论文
共 50 条
  • [21] An Adaptive Density Data Stream Clustering Algorithm
    Shifei Ding
    Jian Zhang
    Hongjie Jia
    Jun Qian
    Cognitive Computation, 2016, 8 : 30 - 38
  • [22] Feature-Based Data Stream Clustering
    Asbagh, Mohsen Jafari
    Abolhassani, Hassan
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 363 - 368
  • [23] Intrusion detection based on clustering a data stream
    Oh, SH
    Kang, JS
    Byun, YC
    Park, GL
    Byun, SY
    Third ACIS International Conference on Software Engineering Research, Managment and Applications, Proceedings, 2005, : 220 - 227
  • [24] Data Stream Clustering Based on Grid Coupling
    Zhang D.-Y.
    Zhou L.-H.
    Wu X.-Y.
    Zhao L.-H.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 667 - 683
  • [25] Varying density method for data stream clustering
    Department of Computer Engineering, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran
    不详
    不详
    Appl. Soft Comput. J.,
  • [26] A Novel Algorithm for Adaptive Data Stream Clustering
    Ansarifar, Farnaz
    Ahmadi, Ali
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1542 - 1546
  • [27] An Ensemble Learning Approach for Data Stream Clustering
    Fathzadeh, Ramin
    Mokhtari, Vahid
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [28] A Weighted Fuzzy Clustering Algorithm for Data Stream
    Wan, Renxia
    Yan, Xiaoya
    Su, Xiaoke
    2008 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL 1, PROCEEDINGS, 2008, : 360 - +
  • [29] A Distributed Framework for Online Stream Data Clustering
    Ding, Jiafeng
    Fang, Junhua
    Chao, Pingfu
    Xu, Jiajie
    Zhao, PengPeng
    Zhao, Lei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I, 2020, 12452 : 190 - 204
  • [30] Stydy of data stream clustering based on MSF
    Li, Yingmei
    Li, Min
    Shao, Jingbo
    Wang, Gaoyang
    International Journal of Database Theory and Application, 2015, 8 (01): : 55 - 62