A novel approach using incremental oversampling for data stream mining

被引:5
|
作者
Anupama, N. [1 ]
Jena, Sudarson [2 ]
机构
[1] GITAM Univ, Hyderabad, India
[2] Sambalpur Univ, Inst Informat Technol, Sambalpur, India
关键词
Knowledge discovery; Data streams; Imbalanced data; Oversampling; Increment over sampling for data streams (IOSDS); CLASSIFICATION;
D O I
10.1007/s12530-018-9249-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream mining is very popular in recent years with advanced electronic devices generating continuous data streams. The performance of standard learning algorithms is been compromised with imbalance nature present in real world data streams. In this paper we propose a novel algorithm dubbed as increment over sampling for data streams (IOSDS) which uses an unique over sampling technique to almost balance the data sets to minimize the effect of imbalance in stream mining process. The experimental analysis is conducted on 15 data chunks of data streams with varied sizes and different imbalance ratios. The results suggests that the proposed IOSDS algorithm improves the knowledge discovery over benchmark algorithms like C4.5 and Hoeffding tree in terms of standard performance measures namely accuracy, AUC, precision, recall and F-measure.
引用
收藏
页码:351 / 362
页数:12
相关论文
共 50 条
  • [21] An oversampling approach for mining program specifications
    Deng Chen
    Yan-duo Zhang
    Wei Wei
    Rong-cun Wang
    Xiao-lin Li
    Wei Liu
    Shi-xun Wang
    Rui Zhu
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 737 - 754
  • [22] A Data Mining Approach to Incremental Adaptive Functional Diagnosis
    Bolchini, Cristiana
    Quintarelli, Elisa
    Salice, Fabio
    Garza, Paolo
    PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFTS), 2013, : 13 - 18
  • [23] Dynamic pattern mining: An incremental data clustering approach
    Chung, S
    McLeod, D
    JOURNAL ON DATA SEMANTICS II, 2005, 3360 : 85 - 112
  • [24] An Intensified Approach for Privacy Preservation in Incremental Data Mining
    Rajalakshmi, V.
    Mala, G. S. Anandha
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 : 347 - +
  • [25] BIT STREAM ADDER FOR OVERSAMPLING CODED DATA
    OLEARY, P
    MALOBERTI, F
    ELECTRONICS LETTERS, 1990, 26 (20) : 1708 - 1709
  • [26] Batch -Incremental Classification of Stream Data Using Storage
    Ponkiya, Parita
    Srivastava, Rohit
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (04): : 95 - 99
  • [27] Batch -Incremental Classification of Stream Data Using Storage
    Ponkiya, Parita
    Srivastava, Rohit
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (05): : 91 - 95
  • [28] Efficient Incremental Itemset Tree for Approximate Frequent Itemset Mining On Data Stream
    Bai, Pavitra S.
    Kumar, Ravi G. K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 239 - 242
  • [29] Underwater Sonar Signals Recognition by Incremental Data Stream Mining with Conflict Analysis
    Fong, Simon
    Deb, Suash
    Wong, Raymond
    Sun, Guangmin
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [30] Incremental multi-dimension scaling visualization mining method for data stream
    Ni, Ping
    Liao, Jian-Xin
    Zhu, Xiao-Min
    Wan, Li
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2011, 41 (03): : 817 - 821