A novel approach using incremental oversampling for data stream mining

被引:5
|
作者
Anupama, N. [1 ]
Jena, Sudarson [2 ]
机构
[1] GITAM Univ, Hyderabad, India
[2] Sambalpur Univ, Inst Informat Technol, Sambalpur, India
关键词
Knowledge discovery; Data streams; Imbalanced data; Oversampling; Increment over sampling for data streams (IOSDS); CLASSIFICATION;
D O I
10.1007/s12530-018-9249-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream mining is very popular in recent years with advanced electronic devices generating continuous data streams. The performance of standard learning algorithms is been compromised with imbalance nature present in real world data streams. In this paper we propose a novel algorithm dubbed as increment over sampling for data streams (IOSDS) which uses an unique over sampling technique to almost balance the data sets to minimize the effect of imbalance in stream mining process. The experimental analysis is conducted on 15 data chunks of data streams with varied sizes and different imbalance ratios. The results suggests that the proposed IOSDS algorithm improves the knowledge discovery over benchmark algorithms like C4.5 and Hoeffding tree in terms of standard performance measures namely accuracy, AUC, precision, recall and F-measure.
引用
收藏
页码:351 / 362
页数:12
相关论文
共 50 条
  • [41] Incremental clustering of data stream using real ants behavior
    Masmoudi, Nesrine
    Azzag, Hanane
    Lebbah, Mustapha
    Bertelle, Cyrille
    2014 SIXTH WORLD CONGRESS ON NATURE AND BIOLOGICALLY INSPIRED COMPUTING (NABIC), 2014, : 262 - 268
  • [42] Novel Oversampling Algorithm for Handling Imbalanced Data Classification Novel Oversampling Algorithm
    More, Anjali S.
    Rana, Dipti P.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 491 - 496
  • [43] Towards a new approach for mining frequent itemsets on data stream
    Chedy Raïssi
    Pascal Poncelet
    Maguelonne Teisseire
    Journal of Intelligent Information Systems, 2007, 28 : 23 - 36
  • [44] Towards a new approach for mining frequent itemsets on data stream
    Raissi, Chedy
    Poncelet, Pascal
    Teisseire, Maguelonne
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2007, 28 (01) : 23 - 36
  • [45] Mining of Data Stream Using "DDenStream" Clustering Algorithm
    Kumar, Manoj
    Sharma, Ashish
    PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE IN MOOC, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE), 2013, : 315 - 320
  • [46] Preprocessing Using Attribute Selection in Data Stream Mining
    Sangeetha, R.
    Sathappan, S.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 431 - 438
  • [47] PiCo: A Novel Approach to Stream Data Analytics
    Misale, Claudia
    Drocco, Maurizio
    Tremblay, Guy
    Aldinucci, Marco
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 118 - 128
  • [48] A Novel Approach to Extracting Casing Status Features Using Data Mining
    Chen, Jikai
    Li, Haoyu
    Wang, Yanjun
    Xie, Ronghua
    Liu, Xingbin
    ENTROPY, 2014, 16 (01): : 389 - 404
  • [49] A Novel Approach for Upgrading Indian Education by Using Data Mining Techniques
    Banumathi, A.
    Pethalakshmi, A.
    2012 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY ENHANCED EDUCATION (ICTEE 2012), 2012,
  • [50] A novel manufacturing defect detection method using data mining approach
    Chen, WC
    Tseng, SS
    Wang, CY
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2004, 3029 : 77 - 86