Systematic ensemble model selection approach for educational data mining

被引:75
|
作者
Injadat, MohammadNoor [1 ]
Moubayed, Abdallah [1 ]
Nassif, Ali Bou [1 ,2 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Elect & Comp Engn Dept, London, ON, Canada
[2] Univ Sharjah, Comp Engn Dept, Sharjah, U Arab Emirates
关键词
e-learning; Student performance prediction; Educational data mining; Ensemble learning model selection; Gini index; p-value; PREDICTING ACADEMIC-PERFORMANCE;
D O I
10.1016/j.knosys.2020.105992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of research has been done in the past focusing on predicting student's performance in order to support their development. Many institutions are focused on improving the performance and the education quality; and this can be achieved by utilizing data mining techniques to analyze and predict students' performance and to determine possible factors that may affect their final marks. To address this issue, this work starts by thoroughly exploring and analyzing two different datasets at two separate stages of course delivery (20% and 50% respectively) using multiple graphical, statistical, and quantitative techniques. The feature analysis provides insights into the nature of the different features considered and helps in the choice of the machine learning algorithms and their parameters. Furthermore, this work proposes a systematic approach based on Gini index and p-value to select a suitable ensemble learner from a combination of six potential machine learning algorithms. Experimental results show that the proposed ensemble models achieve high accuracy and low false positive rate at all stages for both datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Selection of invariant objects with a data-mining approach
    Kusiak, A
    IEEE TRANSACTIONS ON ELECTRONICS PACKAGING MANUFACTURING, 2005, 28 (02): : 187 - 196
  • [42] A reliable and ensemble forecasting model for slow-moving and repairable spare parts: Data mining approach
    Sareminia, Saba
    Amini, Fatemeh
    COMPUTERS IN INDUSTRY, 2023, 145
  • [43] Educational Data Mining Based on Multi-objective Weighted Voting Ensemble Classifier
    Abdar, Moloud
    Yen, Neil Y.
    Hung, Jason C.
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 357 - 362
  • [44] Student Performance Prediction Approach Based on Educational Data Mining
    Chen, Ziling
    Cen, Gang
    Wei, Ying
    Li, Zifei
    IEEE ACCESS, 2023, 11 (131260-131272) : 131260 - 131272
  • [45] An ensemble approach to variable selection for classification of DNA microarray data
    Masulli, F
    Rovetta, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3089 - 3094
  • [46] Ensemble Vote Approach for Predicting Primary Tumors Using Data Mining
    Naib, Mehak
    Chhabra, Amit
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 97 - 102
  • [47] A distributed ensemble approach for mining healthcare data under privacy constraints
    Li, Yan
    Bai, Changxin
    Reddy, Chandan K.
    INFORMATION SCIENCES, 2016, 330 : 245 - 259
  • [48] ACADEMIC ANALYTICS AND EDUCATIONAL DATA MINING AT THE UNIVERSITY LEVEL: A SYSTEMATIC REVIEW
    Chavarry Chankay, Mariana
    Aquino Trujillo, Jury Yesenia
    Li Vega, Fiorella Vanessa
    German Reyes, Nilton Cesar
    REVISTA UNIVERSIDAD Y SOCIEDAD, 2022, 14 : 377 - 390
  • [49] An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction
    Wijaya, Richard
    Saeed, Faisal
    Samimi, Parnia
    Albarrak, Abdullah M.
    Qasem, Sultan Noman
    BIOENGINEERING-BASEL, 2024, 11 (07):
  • [50] An Active Learning Approach for Ensemble-based Data Stream Mining
    Alabdulrahman, Rabaa
    Viktor, Herna
    Paquet, Eric
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 275 - 282