Big data classification of learning behaviour based on data reduction and ensemble learning

被引:1
|
作者
Wang, Taotao [1 ]
Wu, Xiaoxuan [2 ]
机构
[1] Jiangxi Univ Technol, Dept Informat Engn Coll, Ganzhou 330098, Jiangxi, Peoples R China
[2] Guangxi Vocat Coll Water Resources & Elect Power, Dept Gen Educ, Nanning 530023, Peoples R China
关键词
data reduction; ensemble learning; rough set theory; big data of learning behaviour; big data classification;
D O I
10.1504/IJCEELL.2023.132418
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In order to overcome the problems of low classification accuracy, long time, and high missing ratio of traditional methods, a big data classification method of learning behaviour based on data reduction and ensemble learning was proposed. By cleaning and transforming the big data of learning behaviour and discretising the attributes of big data of learning behaviour, the data reduction algorithm is used to simplify the attributes of big data of learning behaviour. The ensemble learning method is used to linearly combine several weak classifiers, and the ensemble classifier is trained according to Choquet integral. The trained classifier is used to classify the big data of learning behaviour after simplified processing. The experimental results show that when the amount of big data on learning behaviour reaches 5,000 GB, the average classification accuracy of the proposed method is 92%, the classification time is 29 s, and the failure rate of classification is 0.32%.
引用
收藏
页码:496 / 510
页数:16
相关论文
共 50 条
  • [1] Intrusion detection based on ensemble learning for big data classification
    Jemili, Farah
    Meddeb, Rahma
    Korbaa, Ouajdi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3771 - 3798
  • [2] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [3] Towards Big Data Bayesian Network Learning - an Ensemble Learning Based Approach
    Tang, Yan
    Wang, Yu
    Li, Ling
    Cooper, Kendra M. L.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 355 - 357
  • [4] A dynamic ensemble learning based data mining framework for medical imbalanced big data
    Rithani, M.
    Kumar, R. Prasanna
    Ali, Altalbe
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [5] Unbalanced data sentiment classification method based on ensemble learning
    Duan, Jidong
    Ma, Kun
    Sun, Runyuan
    PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 34 - 38
  • [6] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [7] Power data classification method based on selective ensemble learning
    Zhang, Yi-Ying
    Liu, Fei
    Pang, Hao-Yuan
    Zhang, Bo
    Wang, Yang
    Journal of Computers (Taiwan), 2020, 31 (01) : 253 - 260
  • [8] An Algorithm Design of Big Data Anomaly Detection Based on Ensemble Learning
    Chen, Xiao
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 319 - 323
  • [9] BigRC-EML: big-data based ransomware classification using ensemble machine learning
    Aurangzeb, Sana
    Anwar, Haris
    Naeem, Muhammad Asif
    Aleem, Muhammad
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (05): : 3405 - 3422
  • [10] BigRC-EML: big-data based ransomware classification using ensemble machine learning
    Sana Aurangzeb
    Haris Anwar
    Muhammad Asif Naeem
    Muhammad Aleem
    Cluster Computing, 2022, 25 : 3405 - 3422