Systematic ensemble model selection approach for educational data mining

被引:75
|
作者
Injadat, MohammadNoor [1 ]
Moubayed, Abdallah [1 ]
Nassif, Ali Bou [1 ,2 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Elect & Comp Engn Dept, London, ON, Canada
[2] Univ Sharjah, Comp Engn Dept, Sharjah, U Arab Emirates
关键词
e-learning; Student performance prediction; Educational data mining; Ensemble learning model selection; Gini index; p-value; PREDICTING ACADEMIC-PERFORMANCE;
D O I
10.1016/j.knosys.2020.105992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of research has been done in the past focusing on predicting student's performance in order to support their development. Many institutions are focused on improving the performance and the education quality; and this can be achieved by utilizing data mining techniques to analyze and predict students' performance and to determine possible factors that may affect their final marks. To address this issue, this work starts by thoroughly exploring and analyzing two different datasets at two separate stages of course delivery (20% and 50% respectively) using multiple graphical, statistical, and quantitative techniques. The feature analysis provides insights into the nature of the different features considered and helps in the choice of the machine learning algorithms and their parameters. Furthermore, this work proposes a systematic approach based on Gini index and p-value to select a suitable ensemble learner from a combination of six potential machine learning algorithms. Experimental results show that the proposed ensemble models achieve high accuracy and low false positive rate at all stages for both datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] An Ensemble-based Decision Tree Approach for Educational Data Mining
    Abdar, Moloud
    Zomorodi-Moghadam, Mariam
    Zhou, Xujuan
    2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC), 2018, : 126 - 129
  • [2] Multi-split optimized bagging ensemble model selection for multi-class educational data mining
    Injadat, MohammadNoor
    Moubayed, Abdallah
    Nassif, Ali Bou
    Shami, Abdallah
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4506 - 4528
  • [3] Multi-split optimized bagging ensemble model selection for multi-class educational data mining
    MohammadNoor Injadat
    Abdallah Moubayed
    Ali Bou Nassif
    Abdallah Shami
    Applied Intelligence, 2020, 50 : 4506 - 4528
  • [4] An Approach to Educational Data Mining Model Accuracy Improvement Using Histogram Discretization and Combining Classifiers into an Ensemble
    Dimic, Gabrijela
    Rancic, Dejan
    Pronie-Rancic, Olivera
    Milosevic, Danijela
    SMART EDUCATION AND E-LEARNING 2019, 2019, 144 : 267 - 280
  • [5] A Systematic Review on Educational Data Mining
    Dutti, Ashish
    Ismaili, Maizatul Akmar
    Herawani, Tutut
    IEEE ACCESS, 2017, 5 : 15991 - 16005
  • [6] A Systematic Review of Educational Data Mining
    Xu, FangYao
    Li, ZhiQiang
    Yue, JiaQi
    Qu, ShaoJie
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 764 - 780
  • [7] Systematic Review on Educational Data Mining in Educational Gamification
    Bosnjakovic, Natalija
    Babic, Ivana Durdevic
    TECHNOLOGY KNOWLEDGE AND LEARNING, 2023, 30 (1) : 29 - 46
  • [8] A Systematic Approach for Providing Personalized Pedagogical Recommendations Based on Educational Data Mining
    Araujo Paiva, Ranilson Oscar
    Bittencourt Santa Pinto, Ig Ibert
    da Silva, Alan Pedro
    Isotani, Seiji
    Jaques, Patricia
    INTELLIGENT TUTORING SYSTEMS, ITS 2014, 2014, 8474 : 362 - 367
  • [9] Dynamic Ensemble Selection Methods for Heterogeneous Data Mining
    Ballard, Chris
    Wang, Wenjia
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1021 - 1026
  • [10] Role of FCBF Feature Selection in Educational Data Mining
    Zaffar, Maryam
    Hashmani, Manzoor Ahmad
    Savita, K. S.
    Rizvi, Syed Sajjad Hussain
    Rehman, Mubashar
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (04) : 772 - 778