Systematic ensemble model selection approach for educational data mining

被引:75
|
作者
Injadat, MohammadNoor [1 ]
Moubayed, Abdallah [1 ]
Nassif, Ali Bou [1 ,2 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Elect & Comp Engn Dept, London, ON, Canada
[2] Univ Sharjah, Comp Engn Dept, Sharjah, U Arab Emirates
关键词
e-learning; Student performance prediction; Educational data mining; Ensemble learning model selection; Gini index; p-value; PREDICTING ACADEMIC-PERFORMANCE;
D O I
10.1016/j.knosys.2020.105992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of research has been done in the past focusing on predicting student's performance in order to support their development. Many institutions are focused on improving the performance and the education quality; and this can be achieved by utilizing data mining techniques to analyze and predict students' performance and to determine possible factors that may affect their final marks. To address this issue, this work starts by thoroughly exploring and analyzing two different datasets at two separate stages of course delivery (20% and 50% respectively) using multiple graphical, statistical, and quantitative techniques. The feature analysis provides insights into the nature of the different features considered and helps in the choice of the machine learning algorithms and their parameters. Furthermore, this work proposes a systematic approach based on Gini index and p-value to select a suitable ensemble learner from a combination of six potential machine learning algorithms. Experimental results show that the proposed ensemble models achieve high accuracy and low false positive rate at all stages for both datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A Semi-supervised Ensemble Approach for Mining Data Streams
    Liu, Jing
    Xu, Guo-Sheng
    Xiao, Da
    Gu, Li-Ze
    Niu, Xin-Xin
    JOURNAL OF COMPUTERS, 2013, 8 (11) : 2873 - 2879
  • [32] Educational Data Mining: A Mining Model for Developing Students' Programming Skills
    Pathan, Asraful Alam
    Hasan, Mehedi
    Ahmed, Md. Ferdous
    Farid, Dewan Md.
    8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
  • [33] A batch ensemble approach to active learning with model selection
    Sugiyama, Masashi
    Rubens, Neil
    NEURAL NETWORKS, 2008, 21 (09) : 1278 - 1286
  • [34] Independent Data Model Selection for Ensemble Dispersion Forecasting
    Ciaramella, Angelo
    Giunta, Giulio
    Riccio, Angelo
    Galmarini, Stefano
    APPLICATIONS OF SUPERVISED AND UNSUPERVISED ENSEMBLE METHODS, 2009, 245 : 213 - +
  • [35] A Systematic Mapping Study of Educational Technologies based on Educational Data Mining and Learning Analytics
    Doko, Edona
    Bexheti, Lejla Abazi
    2018 7TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2018, : 259 - 262
  • [36] Semi Supervised Prediction Model in Educational Data Mining
    Hmiedi, Ismail
    Najadat, Hassan
    Halloush, Zain
    Jalabneh, Ibtihal
    2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 27 - 31
  • [37] A Process model for Educational Data Mining and Learning Analytics
    Murnion, Phelim
    Helfert, Markus
    BULLETIN OF THE TECHNICAL COMMITTEE ON LEARNING TECHNOLOGY, 2012, 14 (03): : 6 - 8
  • [38] Data mining model for a better higher educational system
    Department of Computer Science, Dr. Ambedkar Government Arts College, Chennai-600 039, India
    不详
    Inf. Technol. J., 2006, 3 (560-564):
  • [39] Hybrid model approach in data mining
    Bakirarar, Batuhan
    Cosgun, Erdal
    Elhan, Atilla Halil
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (10) : 4998 - 5007
  • [40] RETRACTED: Enhancement of Predicting Students Performance Model Using Ensemble Approaches and Educational Data Mining Techniques (Retracted Article)
    Ragab, Mahmoud
    Aal, Ahmed M. K. Abdel
    Jifri, Ali O.
    Omran, Nahla F.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021