Systematic ensemble model selection approach for educational data mining

被引:75
|
作者
Injadat, MohammadNoor [1 ]
Moubayed, Abdallah [1 ]
Nassif, Ali Bou [1 ,2 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Elect & Comp Engn Dept, London, ON, Canada
[2] Univ Sharjah, Comp Engn Dept, Sharjah, U Arab Emirates
关键词
e-learning; Student performance prediction; Educational data mining; Ensemble learning model selection; Gini index; p-value; PREDICTING ACADEMIC-PERFORMANCE;
D O I
10.1016/j.knosys.2020.105992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of research has been done in the past focusing on predicting student's performance in order to support their development. Many institutions are focused on improving the performance and the education quality; and this can be achieved by utilizing data mining techniques to analyze and predict students' performance and to determine possible factors that may affect their final marks. To address this issue, this work starts by thoroughly exploring and analyzing two different datasets at two separate stages of course delivery (20% and 50% respectively) using multiple graphical, statistical, and quantitative techniques. The feature analysis provides insights into the nature of the different features considered and helps in the choice of the machine learning algorithms and their parameters. Furthermore, this work proposes a systematic approach based on Gini index and p-value to select a suitable ensemble learner from a combination of six potential machine learning algorithms. Experimental results show that the proposed ensemble models achieve high accuracy and low false positive rate at all stages for both datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Text Classification Using Ensemble Features Selection and Data Mining Techniques
    Shravankumar, B.
    Ravi, Vadlamani
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, SEMCCO 2014, 2015, 8947 : 176 - 186
  • [22] A dynamic classifier ensemble selection approach for noise data
    Xiao, Jin
    He, Changzheng
    Jiang, Xiaoyi
    Liu, Dunhu
    INFORMATION SCIENCES, 2010, 180 (18) : 3402 - 3421
  • [23] An Ensemble Approach for Gene Selection in Gene Expression Data
    Castellanos-Garzon, Jose A.
    Ramos, Juan
    Lopez-Sanchez, Daniel
    de Paz, Juan F.
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 237 - 247
  • [24] Effective educational process: A data-mining approach
    Ranjan, Jayanthi
    Malik, Kamna
    VINE, 2007, 37 (04): : 502 - 515
  • [25] An Intelligent Prediction System for Educational Data Mining Based on Ensemble and Filtering approaches
    Ashraf, Mudasir
    Zaman, Majid
    Ahmed, Muheet
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1471 - 1483
  • [26] A Systematic Review of Deep Learning Approaches to Educational Data Mining
    Hernandez-Blanco, Antonio
    Herrera-Flores, Boris
    Tomas, David
    Navarro-Colorado, Borja
    COMPLEXITY, 2019, 2019
  • [27] Stacking ensemble approach in data mining methods for landslide prediction
    Solmaz Abdollahizad
    Mohammad Ali Balafar
    Bakhtiar Feizizadeh
    Amin Babazadeh Sangar
    Karim Samadzamini
    The Journal of Supercomputing, 2023, 79 : 8583 - 8610
  • [28] Stacking ensemble approach in data mining methods for landslide prediction
    Abdollahizad, Solmaz
    Balafar, Mohammad Ali
    Feizizadeh, Bakhtiar
    Sangar, Amin Babazadeh
    Samadzamini, Karim
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 8583 - 8610
  • [29] Educational data mining: a systematic review of research and emerging trends
    Du, Xu
    Yang, Juan
    Hung, Jui-Long
    Shelton, Brett
    INFORMATION DISCOVERY AND DELIVERY, 2020, 48 (04) : 225 - 236
  • [30] An ensemble approach for conflict detection in Free Flight by data mining
    Alam, Sameer
    Shafi, Kamran
    Abbass, Hussein A.
    Barlow, Michael
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (03) : 298 - 317