ElStream: An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning

被引:63
|
作者
Abbasi, Ahmad [1 ]
Javed, Abdul Rehman [2 ]
Chakraborty, Chinmay [3 ]
Nebhen, Jamel [4 ]
Zehra, Wisha [1 ]
Jalil, Zunera [2 ]
机构
[1] Air Univ, Fac Comp & AI, Islamabad 44000, Pakistan
[2] Air Univ, Dept Cyber Secur, Islamabad 44000, Pakistan
[3] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi 835215, Bihar, India
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Sci & Engn, Al Kharj 11942, Saudi Arabia
关键词
Big Data; Machine learning; Light emitting diodes; Training; Data models; Standards; Licenses; Internet of Things; big data; smart concept drift; social data; online learning; ensemble learning; HETEROGENEOUS ENSEMBLE; ONLINE; CLASSIFIER;
D O I
10.1109/ACCESS.2021.3076264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
引用
收藏
页码:66408 / 66419
页数:12
相关论文
共 50 条
  • [1] Concept Drift Detection in Data Stream: Ensemble Learning Method for Detecting Gradual Instances
    Khanh-Tung Nguyen
    Trung Tran
    Anh-Duc Nguyen
    Xuan-Hieu Phan
    Quang-Thuy Ha
    2023 ASIA MEETING ON ENVIRONMENT AND ELECTRICAL ENGINEERING, EEE-AM, 2023,
  • [2] Detection of Concept Drift for Learning from Stream Data
    Lee, Jeonghoon
    Magoules, Frederic
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 241 - 245
  • [3] An Ensemble Learning Approach for Concept Drift
    Liao, Jian-Wei
    Dai, Bi-Ru
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [4] SETL: a transfer learning based dynamic ensemble classifier for concept drift detection in streaming data
    Arora, Shruti
    Rani, Rinkle
    Saxena, Nitin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3417 - 3432
  • [5] A Concept Drift based Ensemble Incremental Learning Approach for Intrusion Detection
    Yuan, Xiaoming
    Wang, Ran
    Zhuang, Yi
    Zhu, Kun
    Hao, Jie
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 350 - 357
  • [6] Combining active learning with concept drift detection for data stream mining
    Krawczyk, Bartosz
    Pfahringer, Bernhard
    Wozniak, Michal
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2239 - 2244
  • [7] Concept Drift Detection and Adaption in Big Imbalance Industrial IoT Data Using an Ensemble Learning Method of Offline Classifiers
    Lin, Chun-Cheng
    Deng, Der-Jiunn
    Kuo, Chin-Hung
    Chen, Linnan
    IEEE ACCESS, 2019, 7 : 56198 - 56207
  • [8] An Ensemble Learning Approach for Data Stream Clustering
    Fathzadeh, Ramin
    Mokhtari, Vahid
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [9] Incremental learning imbalanced data streams with concept drift: The dynamic updated ensemble algorithm
    Li, Zeng
    Huang, Wenchao
    Xiong, Yan
    Ren, Siqi
    Zhu, Tuanfei
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [10] Parameter Distribution Ensemble Learning for Sudden Concept Drift Detection
    Khanh-Tung Nguyen
    Trung Tran
    Anh-Duc Nguyen
    Xuan-Hieu Phan
    Quang-Thuy Ha
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 192 - 203