Clustering Heterogeneous Data Streams with Uncertainty over Sliding Window

被引:0
|
作者
Hentech, Houda [1 ]
Gouider, Mohammed Salah [1 ]
Farhat, Amine [1 ]
机构
[1] Univ Tunis, Inst Super Gest Tunis, BESTMOD, Cite Bouchoucha 2000, Le Bardo, Tunisia
来源
MODEL AND DATA ENGINEERING, MEDI 2013 | 2013年 / 8216卷
关键词
Data streams; uncertainty; clustering; similarity measure; sliding window model;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing methods for clustering uncertain data streams over sliding windows do not treat the categorical attributes. However, uncertain mixed data are ubiquitous. This paper investigates the problem of clustering heterogeneous data streams pervaded by uncertainty over sliding windows, so-called SWHU-Clustering. A Heterogeneous Uncertain Temporal Cluster Feature (HUTCF) is introduced to monitor the distribution statistics of mixed data points. Based on this structure, Exponential Histogram of Heterogeneous Uncertain Cluster Feature (EHHUCF) is presented as a collection of HUTCF. This structure may help to handle the in-cluster evolution, and detects the temporal change of the cluster distribution. Our approach has several advantages over existing method: 1) the higher execution efficiency benefits from its good design as it avoids the effects of old data on the final results. 2) We incorporated the k-NN into the clustering process in order to reduce the complexity of the algorithm. 3) Memory consumption can be managed efficiently by limiting the number of HUTCF in each EHHUCF. Simulations on real databases show the feasibility of SWHU-Clustering as well as its effectiveness by comparing it with UMicro algorithm.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 50 条
  • [11] Simultaneous sliding window join approach over multiple data streams
    Qian, Jiangbo
    Xu, Hongbing
    Wang, Yongli
    Liu, Xuejun
    Dong, Yisheng
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2005, 42 (10): : 1771 - 1778
  • [12] GDSW: A General Framework for Distributed Sliding Window over Data Streams
    Chen, Huan
    Wang, Yijie
    Wang, Yuan
    Ma, Xingkong
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 729 - 736
  • [13] A performance evaluation of data streams sampling algorithms over a sliding window
    El Sibai, Rayane
    Chabchoub, Yousra
    Demerjian, Jacques
    Chiky, Raja
    Barbar, Kablan
    2018 IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM), 2018, : 211 - 216
  • [14] Research on sliding window join semantics and join algorithm in heterogeneous data streams
    Du, Wei
    Zou, Xianxia
    Open Cybernetics and Systemics Journal, 2015, 9 : 556 - 564
  • [15] Sliding Window Top-K Monitoring over Distributed Data Streams
    Lv, Zhijin
    Chen, Ben
    Yu, Xiaohui
    WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 527 - 540
  • [16] Sliding Window Top-K Monitoring over Distributed Data Streams
    Chen B.
    Lv Z.
    Yu X.
    Liu Y.
    Data Science and Engineering, 2017, 2 (4) : 289 - 300
  • [17] Mining weighted frequent itemsets using window sliding over data streams
    Kim, Younghee
    Kim, Wonyoung
    Ryu, Joonsuk
    Kim, Ungmo
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 708 - 713
  • [18] A dynamic layout of sliding window for frequent itemset mining over data streams
    Deypir, Mahmood
    Sadreddini, Mohammad Hadi
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (03) : 746 - 759
  • [19] Mining Discriminative Itemsets Over Data Streams Using Efficient Sliding Window
    Seyfi M.
    Nayak R.
    Xu Y.
    SN Computer Science, 4 (5)
  • [20] Sliding window-based frequent pattern mining over data streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    Lee, Young-Koo
    INFORMATION SCIENCES, 2009, 179 (22) : 3843 - 3865