Enhancing the DISSFCM Algorithm for Data Stream Classification

被引:3
|
作者
Casalino, Gabriella [1 ,2 ]
Castellano, Giovanna [1 ,2 ]
Fanelli, Anna Maria [1 ]
Mencar, Corrado [1 ,2 ]
机构
[1] Univ Bari Aldo Moro, Comp Sci Dept, Bari, Italy
[2] INdAM Res Grp GNCS, Rome, Italy
来源
关键词
Data stream classification; Semi-supervised fuzzy clustering; Incremental adaptive clustering;
D O I
10.1007/978-3-030-12544-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analyzing data streams has become a new challenge to meet the demands of real time analytics. Conventional mining techniques are proving inefficient to cope with challenges associated with data streams, including resources constraints like memory and running time along with single scan of the data. Most existing data stream classification methods require labeled samples that are more difficult and expensive to obtain than unlabeled ones. Semi-supervised learning algorithms can solve this problem by using unlabeled samples together with a few labeled ones to build classification models. Recently we proposed DISSFCM, an algorithm for data stream classification based on incremental semi-supervised fuzzy clustering. To cope with the evolution of data, DISSFCM adapts dynamically the number of clusters by splitting large-scale clusters. While splitting is effective in improving the quality of clusters, a repeated application without counter-balance may induce many small-scale clusters. To solve this problem, in this paper we enhance DISSFCM by introducing a procedure that merges small-scale clusters. Preliminary experimental results on a real-world benchmark dataset show the effectiveness of the method.
引用
收藏
页码:109 / 122
页数:14
相关论文
共 50 条
  • [41] Data stream classification with artificial endocrine system
    Li Zhao
    Lei Wang
    Qingzheng Xu
    Applied Intelligence, 2012, 37 : 390 - 404
  • [42] Enhancing privacy in remote data classification
    Piva, A.
    Orlandi, C.
    Caini, M.
    Bianchi, T.
    Barni, M.
    PROCEEDINGS OF THE IFIP TC 11/ 23RD INTERNATIONAL INFORMATION SECURITY CONFERENCE, 2008, : 33 - +
  • [43] Data stream classification with artificial endocrine system
    Zhao, Li
    Wang, Lei
    Xu, Qingzheng
    APPLIED INTELLIGENCE, 2012, 37 (03) : 390 - 404
  • [44] Data Stream Classification based on an Associative Classifier
    Lopez-Medina, Karen Pamela
    Uriarte-Arcia, Abril Valeria
    Yanez-Marquez, Cornelio
    COMPUTACION Y SISTEMAS, 2024, 28 (02): : 387 - 400
  • [45] Imbalanced Data Stream Classification: Analysis and Solution
    Anjana, Koringa
    Radhika, Kotecha
    Darshana, Patel
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 316 - 324
  • [46] Exploiting empirical variance for data stream classification
    Muhammad Zia-Ur Rehman
    Tian-rui Li
    Tao Li
    Zia-Ur Rehman, M. (moh.zia@gmail.com), 1600, Shanghai Jiao Tong University (17): : 245 - 250
  • [47] An overview of complex data stream ensemble classification
    Zhang, Xilong
    Han, Meng
    Wu, Hongxin
    Li, Muhang
    Chen, Zhiqiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (02) : 3667 - 3695
  • [48] Transfer estimation and the applications in data stream classification
    Zhang, Zhihao
    Zhou, Jie
    MIPPR 2011: PATTERN RECOGNITION AND COMPUTER VISION, 2011, 8004
  • [49] Density estimation technique for data stream classification
    Kerdprasop, Nittaya
    Kerdprasop, Kittisak
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 662 - +
  • [50] Data Stream Classification Based on the Gamma Classifier
    Valeria Uriarte-Arcia, Abril
    Lopez-Yanez, Itzama
    Yanez-Marquez, Cornelio
    Gama, Joao
    Camacho-Nieto, Oscar
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015