A clustering algorithm for data stream based on grid-tree and similarity

被引:0
|
作者
Huang G. [1 ]
Guo W. [1 ]
Ren J. [1 ]
Chen L. [1 ]
机构
[1] College of Information Science and Engineering
关键词
Clustering; Data stream; Grid; Similarity;
D O I
10.4156/ijact.vol3.issue9.3
中图分类号
学科分类号
摘要
Algorithms based on k-means are incompetent to find clusters of arbitrary shapes, and the number of clusters needs to be pre-specified. Moreover, most grid-based clustering algorithms can not deal with boundary points accurately. To address these issues, a novel approach based on density gird-tree and similarity, DGTSstream, is proposed. In DGTSstream, each new data record will be mapped into the gird-tree, and sporadic grids will be removed through setting update cycle and noise density threshold. The average density is exploited to design density threshold. This algorithm repeatedly seeks a maximum density grid without cluster flag, which will be used as a starting point for finding clusters according to depth-first strategy. Finally, the similarity is adopted to deal with the boundary points. Experimental results show that our algorithm can find clusters of arbitrary shapes, and has better clustering accuracy and efficiency.
引用
收藏
页码:17 / 24
页数:7
相关论文
共 50 条
  • [21] Grid-based clustering over an evolving data stream
    Wan, Renxia
    Chen, Jingchao
    Wang, Lixin
    Su, Xiaoke
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2009, 1 (04) : 393 - 410
  • [22] Grid-based data stream clustering for intrusion detection
    Quan, Q. (qqian@shu.edu.cn), 1600, Femto Technique Co., Ltd. (15):
  • [23] A Similarity-Based Clustering Algorithm for Fuzzy Data
    Hung, Wen-Liang
    Yang, Miin-Shen
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [24] Research on Data Stream Clustering Based on FCM Algorithm
    Gao, Tiancheng
    Li, Aihua
    Meng, Fan
    5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 595 - 602
  • [25] IMPROVED DENSITY BASED ALGORITHM FOR DATA STREAM CLUSTERING
    Mousavi, Maryam
    Abu Bakar, Azuraliza
    JURNAL TEKNOLOGI, 2015, 77 (18): : 73 - 77
  • [26] THE CLUSTERING ALGORITHM OF EVOLUTIONAL DATA STREAM BASED ON DENSITY
    Meng, Yuyu
    Zheng, Liying
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 473 - 477
  • [27] Drifted Data Stream Clustering Based on ClusTree Algorithm
    Zgraja, Jakub
    Wozniak, Michal
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 338 - 349
  • [28] A Data Stream Outlier Detection Algorithm Based on Grid
    Yu Xiang
    Lei Guohua
    Xu Xiandong
    Lin Liandong
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4136 - 4141
  • [29] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [30] Intrusion detection algorithm based on SSC-tree stream clustering
    Cheng, Chun-Ling
    Yu, Zhi-Hu
    Zhang, Deng-Yin
    Xu, Xiao-Long
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2012, 34 (03): : 625 - 630