ElStream: An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning

被引:63
|
作者
Abbasi, Ahmad [1 ]
Javed, Abdul Rehman [2 ]
Chakraborty, Chinmay [3 ]
Nebhen, Jamel [4 ]
Zehra, Wisha [1 ]
Jalil, Zunera [2 ]
机构
[1] Air Univ, Fac Comp & AI, Islamabad 44000, Pakistan
[2] Air Univ, Dept Cyber Secur, Islamabad 44000, Pakistan
[3] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi 835215, Bihar, India
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Sci & Engn, Al Kharj 11942, Saudi Arabia
关键词
Big Data; Machine learning; Light emitting diodes; Training; Data models; Standards; Licenses; Internet of Things; big data; smart concept drift; social data; online learning; ensemble learning; HETEROGENEOUS ENSEMBLE; ONLINE; CLASSIFIER;
D O I
10.1109/ACCESS.2021.3076264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
引用
收藏
页码:66408 / 66419
页数:12
相关论文
共 50 条
  • [21] A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift
    Nikpour, Soheila
    Asadi, Shahrokh
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (6) : 2983 - 3003
  • [22] A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift
    Soheila Nikpour
    Shahrokh Asadi
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2983 - 3003
  • [23] Towards Big Data Bayesian Network Learning - an Ensemble Learning Based Approach
    Tang, Yan
    Wang, Yu
    Li, Ling
    Cooper, Kendra M. L.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 355 - 357
  • [24] Concept Drift Detection for Evolving Stream Data
    Lee, Jeonghoon
    Lee, Yoon-Joon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (11) : 2288 - 2292
  • [25] An ensemble method for data stream classification in the presence of concept drift
    Department of Computer Engineering, University of Zanjan, Zanjan
    45371-38791, Iran
    Front. Inf. Technol. Electr. Eng., 12 (1059-1068):
  • [26] An ensemble method for data stream classification in the presence of concept drift
    Omid Abbaszadeh
    Ali Amiri
    Ali Reza Khanteymoori
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 1059 - 1068
  • [27] An ensemble method for data stream classification in the presence of concept drift
    Omid ABBASZADEH
    Ali AMIRI
    Ali Reza KHANTEYMOORI
    FrontiersofInformationTechnology&ElectronicEngineering, 2015, 16 (12) : 1059 - 1068
  • [28] A Dynamic Ensemble Learning Framework for Data Stream Analysis and Real-Time Threat Detection
    Demertzis, Konstantinos
    Iliadis, Lazaros
    Anezakis, Vardis-Dimitris
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 669 - 681
  • [29] Improving malware detection using big data and ensemble learning
    Gupta, Deepak
    Rani, Rinkle
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
  • [30] Intrusion detection based on ensemble learning for big data classification
    Jemili, Farah
    Meddeb, Rahma
    Korbaa, Ouajdi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3771 - 3798