Clustering Heterogeneous Data Streams with Uncertainty over Sliding Window

被引:0
|
作者
Hentech, Houda [1 ]
Gouider, Mohammed Salah [1 ]
Farhat, Amine [1 ]
机构
[1] Univ Tunis, Inst Super Gest Tunis, BESTMOD, Cite Bouchoucha 2000, Le Bardo, Tunisia
来源
关键词
Data streams; uncertainty; clustering; similarity measure; sliding window model;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing methods for clustering uncertain data streams over sliding windows do not treat the categorical attributes. However, uncertain mixed data are ubiquitous. This paper investigates the problem of clustering heterogeneous data streams pervaded by uncertainty over sliding windows, so-called SWHU-Clustering. A Heterogeneous Uncertain Temporal Cluster Feature (HUTCF) is introduced to monitor the distribution statistics of mixed data points. Based on this structure, Exponential Histogram of Heterogeneous Uncertain Cluster Feature (EHHUCF) is presented as a collection of HUTCF. This structure may help to handle the in-cluster evolution, and detects the temporal change of the cluster distribution. Our approach has several advantages over existing method: 1) the higher execution efficiency benefits from its good design as it avoids the effects of old data on the final results. 2) We incorporated the k-NN into the clustering process in order to reduce the complexity of the algorithm. 3) Memory consumption can be managed efficiently by limiting the number of HUTCF in each EHHUCF. Simulations on real databases show the feasibility of SWHU-Clustering as well as its effectiveness by comparing it with UMicro algorithm.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 50 条
  • [1] HCLUWIN: AN ALGORITHM FOR CLUSTERING HETEROGENEOUS DATA STREAMS OVER SLIDING WINDOWS
    Ren, Jiadong
    Hu, Changzhen
    Ma, Ruiqing
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2171 - 2179
  • [2] Density and sliding window-based clustering over evolving data streams
    Yu, Yanwei
    Zhao, Jindong
    Zhang, Yonggang
    Wen, Changci
    ICIC Express Letters, Part B: Applications, 2015, 6 (08): : 2275 - 2283
  • [3] Clustering Data Streams over Sliding Windows by DCA
    Ta Minh Thuy
    Le Thi Hoai An
    Boudjeloud-Assala, Lydia
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 65 - 75
  • [4] Extending Sliding-Window Semantics over Data Streams
    Chen, Leisong
    Lin, Guoping
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 110 - +
  • [5] On concurrency control in sliding window queries over data streams
    Golab, Lukasz
    Bijay, Kumar Gaurav
    Ozsu, M. Tamer
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 608 - 626
  • [6] Incremental and Adaptive Clustering Stream Data over Sliding Window
    Dang, Xuan Hong
    Lee, Vincent C. S.
    Ng, Wee Keong
    Ong, Kok Leong
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 660 - +
  • [7] Mining frequent patterns in an arbitrary sliding window over data streams
    Li, Guohui
    Chen, Hui
    Yang, Bing
    Chen, Gang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 496 - 503
  • [8] Semantics and Implementation of Continuous Sliding Window Queries over Data Streams
    Kraemer, Juergen
    Seeger, Bernhard
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (01):
  • [9] Mining maximal frequent itemsets in a sliding window over data streams
    Mao Y.
    Li H.
    Yang L.
    Liu L.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11): : 1142 - 1148
  • [10] Incremental evaluation of sliding-window queries over data streams
    Ghanem, Thanaa M.
    Hammad, Moustafa A.
    Mokbel, Mohamed F.
    Aref, Walid G.
    Elmagarmid, Ahmed K.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (01) : 57 - 72