ElStream: An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning

被引:63
|
作者
Abbasi, Ahmad [1 ]
Javed, Abdul Rehman [2 ]
Chakraborty, Chinmay [3 ]
Nebhen, Jamel [4 ]
Zehra, Wisha [1 ]
Jalil, Zunera [2 ]
机构
[1] Air Univ, Fac Comp & AI, Islamabad 44000, Pakistan
[2] Air Univ, Dept Cyber Secur, Islamabad 44000, Pakistan
[3] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi 835215, Bihar, India
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Sci & Engn, Al Kharj 11942, Saudi Arabia
关键词
Big Data; Machine learning; Light emitting diodes; Training; Data models; Standards; Licenses; Internet of Things; big data; smart concept drift; social data; online learning; ensemble learning; HETEROGENEOUS ENSEMBLE; ONLINE; CLASSIFIER;
D O I
10.1109/ACCESS.2021.3076264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
引用
收藏
页码:66408 / 66419
页数:12
相关论文
共 50 条
  • [41] Sparse Ensemble Learning for Concept Detection
    Tang, Sheng
    Zheng, Yan-Tao
    Wang, Yu
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 43 - 54
  • [42] Concept drift detection on stream data for revising DBSCAN
    Miyata Y.
    Ishikawa H.
    IEEJ Transactions on Electronics, Information and Systems, 2020, 140 (08) : 949 - 955
  • [43] Concept drift detection on stream data for revising DBSCAN
    Miyata, Yasushi
    Ishikawa, Hiroshi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2021, 104 (01) : 87 - 94
  • [44] An Active Learning Approach for Ensemble-based Data Stream Mining
    Alabdulrahman, Rabaa
    Viktor, Herna
    Paquet, Eric
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 275 - 282
  • [45] An Algorithm Design of Big Data Anomaly Detection Based on Ensemble Learning
    Chen, Xiao
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 319 - 323
  • [46] Machine Learning & Concept Drift based Approach for Malicious Website Detection
    Singhal, Siddharth
    Chawla, Utkarsh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [47] A Multiscale Concept Drift Detection Method for Learning from Data Streams
    Wang, XueSong
    Kang, Qi
    Zhou, MengChu
    Yao, SiYa
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2018, : 786 - 790
  • [48] An Augmented Learning Approach for Multiple Data Streams Under Concept Drift
    Wang, Kun
    Lu, Jie
    Liu, Anjin
    Zhang, Guangquan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 391 - 402
  • [49] A Survey on Ensemble Learning for Data Stream Classification
    Gomes, Heitor Murilo
    Barddal, Jean Paul
    Enembreck, Fabricio
    Bifet, Albert
    ACM COMPUTING SURVEYS, 2017, 50 (02)
  • [50] Ensemble learning for data stream analysis: A survey
    Krawczyk, Bartosz
    Minku, Leandro L.
    Gama, Joao
    Stefanowski, Jerzy
    Wozniak, Michal
    INFORMATION FUSION, 2017, 37 : 132 - 156