Big data classification of learning behaviour based on data reduction and ensemble learning

被引:1
|
作者
Wang, Taotao [1 ]
Wu, Xiaoxuan [2 ]
机构
[1] Jiangxi Univ Technol, Dept Informat Engn Coll, Ganzhou 330098, Jiangxi, Peoples R China
[2] Guangxi Vocat Coll Water Resources & Elect Power, Dept Gen Educ, Nanning 530023, Peoples R China
关键词
data reduction; ensemble learning; rough set theory; big data of learning behaviour; big data classification;
D O I
10.1504/IJCEELL.2023.132418
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In order to overcome the problems of low classification accuracy, long time, and high missing ratio of traditional methods, a big data classification method of learning behaviour based on data reduction and ensemble learning was proposed. By cleaning and transforming the big data of learning behaviour and discretising the attributes of big data of learning behaviour, the data reduction algorithm is used to simplify the attributes of big data of learning behaviour. The ensemble learning method is used to linearly combine several weak classifiers, and the ensemble classifier is trained according to Choquet integral. The trained classifier is used to classify the big data of learning behaviour after simplified processing. The experimental results show that when the amount of big data on learning behaviour reaches 5,000 GB, the average classification accuracy of the proposed method is 92%, the classification time is 29 s, and the failure rate of classification is 0.32%.
引用
收藏
页码:496 / 510
页数:16
相关论文
共 50 条
  • [21] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Chen, Zhi
    Lin, Tao
    Xia, Xin
    Xu, Hongyan
    Ding, Sha
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2441 - 2457
  • [22] Robust ensemble learning for cancer diagnosis based on microarray data classification
    Peng, YH
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 564 - 574
  • [23] Ensemble Deep Learning with Chimp Optimization Based Medical Data Classification
    Dutta, Ashit Kumar
    Albagory, Yasser
    Alsanea, Majed
    Almohammed, Hamdan I.
    Sait, Abdul Rahaman Wahab
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 1643 - 1655
  • [24] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Zhi Chen
    Tao Lin
    Xin Xia
    Hongyan Xu
    Sha Ding
    Applied Intelligence, 2018, 48 : 2441 - 2457
  • [25] A Genetic-Based Ensemble Learning Applied to Imbalanced Data Classification
    Klikowski, Jakub
    Ksieniewicz, Pawel
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 340 - 352
  • [26] Metaheuristic Based Clustering with Deep Learning Model for Big Data Classification
    Krishnaswamy, R.
    Subramaniam, Kamalraj
    Nandini, V
    Vijayalakshmi, K.
    Kadry, Seifedine
    Nam, Yunyoung
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 391 - 406
  • [27] Image Classification Based on Deep Learning for Big Data of Power Grid
    Yin, Jun
    Zhu, Yongxin
    Shi, Weiwei
    Qiu, Yunru
    Liu, Xingying
    Sheng, Gehao
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, 2016, 367 : 1233 - 1241
  • [28] Ensemble learning and hierarchical data representation for microarray classification
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [29] An Ensemble Extreme Learning Machine for Data Stream Classification
    Yang, Rui
    Xu, Shuliang
    Feng, Lin
    ALGORITHMS, 2018, 11 (07)
  • [30] Multiple Imputation and Ensemble Learning for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    Lam Thu Bui
    INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2016, 2017, 8 : 401 - 415