Predictive Analysis of Students' Learning Performance Using Data Mining Techniques: A Comparative Study of Feature Selection Methods

被引:7
|
作者
Mustapha, S. M. F. D. Syed [1 ]
机构
[1] Zayed Univ, Coll Technol Innovat, POB 144534, Dubai, U Arab Emirates
关键词
data mining; feature selection methods; Boruta algorithm; lasso regression; recursive feature elimination (RFE); random forest importance (RFI);
D O I
10.3390/asi6050086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The utilization of data mining techniques for the prompt prediction of academic success has gained significant importance in the current era. There is an increasing interest in utilizing these methodologies to forecast the academic performance of students, thereby facilitating educators to intervene and furnish suitable assistance when required. The purpose of this study was to determine the optimal methods for feature engineering and selection in the context of regression and classification tasks. This study compared the Boruta algorithm and Lasso regression for regression, and Recursive Feature Elimination (RFE) and Random Forest Importance (RFI) for classification. According to the findings, Gradient Boost for the regression part of this study had the least Mean Absolute Error (MAE) and Root-Mean-Square Error (RMSE) of 12.93 and 18.28, respectively, in the case of the Boruta selection method. In contrast, RFI was found to be the superior classification method, yielding an accuracy rate of 78% in the classification part. This research emphasized the significance of employing appropriate feature engineering and selection methodologies to enhance the efficacy of machine learning algorithms. Using a diverse set of machine learning techniques, this study analyzed the OULA dataset, focusing on both feature engineering and selection. Our approach was to systematically compare the performance of different models, leading to insights about the most effective strategies for predicting student success.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] A review on feature selection methods for improving the performance of classification in educational data mining
    Zaffar M.
    Hashmani M.A.
    Savita K.S.
    Khan S.A.
    International Journal of Information Technology and Management, 2021, 20 (1-2): : 110 - 131
  • [22] A Comparative Study of Improvements Filter Methods Bring on Feature Selection Using Microarray Data
    Wang, Yingying
    Fan, Xiaomao
    Cai, Yunpeng
    HEALTH INFORMATION SCIENCE, HIS 2014, 2014, 8423 : 55 - 62
  • [23] Data preprocessing and feature selection techniques in gait recognition: A comparative study of machine learning and deep learning approaches
    Parashar, Anubha
    Parashar, Apoorva
    Ding, Weiping
    Shabaz, Mohammad
    Rida, Imad
    PATTERN RECOGNITION LETTERS, 2023, 172 : 65 - 73
  • [24] Attack classification using feature selection techniques: a comparative study
    Ankit Thakkar
    Ritika Lohiya
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 1249 - 1266
  • [25] Attack classification using feature selection techniques: a comparative study
    Thakkar, Ankit
    Lohiya, Ritika
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 1249 - 1266
  • [26] Machine Learning and Ensemble Learning Techniques for Intrusion Detection Systems: A Performance Analysis Based on Feature Selection Methods
    Basarslan, Muhammet Sinan
    Turgut, Zeynep
    INTELLIGENT AND FUZZY SYSTEMS, VOL 3, INFUS 2024, 2024, 1090 : 117 - 124
  • [27] COMPARATIVE PERFORMANCE ANALYSIS OF CLUSTERING TECHNIQUES IN EDUCATIONAL DATA MINING
    DeFreitas, Kyle
    Bernard, Margaret
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 10 (02): : 65 - 78
  • [28] Detection of financial statement fraud and feature selection using data mining techniques
    Ravisankar, P.
    Ravi, V.
    Rao, G. Raghava
    Bose, I.
    DECISION SUPPORT SYSTEMS, 2011, 50 (02) : 491 - 500
  • [29] Improving Reinforcement Learning Algorithms by the Use of Data Mining Techniques for Feature and Action Selection
    Vieira, Davi C. de L.
    Adeodato, Paulo J. L.
    Goncalves, Paulo M., Jr.
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [30] A Literature Review of Feature Selection Techniques and Applications Review of feature selection in data mining
    Visalakshi, S.
    Radha, V.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 966 - 971