Predictive Analysis of Students' Learning Performance Using Data Mining Techniques: A Comparative Study of Feature Selection Methods

被引:7
|
作者
Mustapha, S. M. F. D. Syed [1 ]
机构
[1] Zayed Univ, Coll Technol Innovat, POB 144534, Dubai, U Arab Emirates
关键词
data mining; feature selection methods; Boruta algorithm; lasso regression; recursive feature elimination (RFE); random forest importance (RFI);
D O I
10.3390/asi6050086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The utilization of data mining techniques for the prompt prediction of academic success has gained significant importance in the current era. There is an increasing interest in utilizing these methodologies to forecast the academic performance of students, thereby facilitating educators to intervene and furnish suitable assistance when required. The purpose of this study was to determine the optimal methods for feature engineering and selection in the context of regression and classification tasks. This study compared the Boruta algorithm and Lasso regression for regression, and Recursive Feature Elimination (RFE) and Random Forest Importance (RFI) for classification. According to the findings, Gradient Boost for the regression part of this study had the least Mean Absolute Error (MAE) and Root-Mean-Square Error (RMSE) of 12.93 and 18.28, respectively, in the case of the Boruta selection method. In contrast, RFI was found to be the superior classification method, yielding an accuracy rate of 78% in the classification part. This research emphasized the significance of employing appropriate feature engineering and selection methodologies to enhance the efficacy of machine learning algorithms. Using a diverse set of machine learning techniques, this study analyzed the OULA dataset, focusing on both feature engineering and selection. Our approach was to systematically compare the performance of different models, leading to insights about the most effective strategies for predicting student success.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Mining educational data to predict students performance A comparative study of data mining techniques
    Nahar, Khaledun
    Shova, Boishakhe Islam
    Ria, Tahmina
    Rashid, Humayara Binte
    Islam, A. H. M. Saiful
    EDUCATION AND INFORMATION TECHNOLOGIES, 2021, 26 (05) : 6051 - 6067
  • [2] Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study
    Anurag Kumar Verma
    Saurabh Pal
    Surjeet Kumar
    Applied Biochemistry and Biotechnology, 2020, 190 : 341 - 359
  • [3] Analysis of Feature Selection and Data Mining Techniques to Predict Student Academic Performance
    Kumar, Mukesh
    Sharma, Chetan
    Sharma, Shamneesh
    Nidhi, Nidhi
    Islam, Nazrul
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 1013 - 1017
  • [4] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 156 (02) : 483 - 494
  • [5] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 294 - 301
  • [6] Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method-a Comparative Study
    Verma, Anurag Kumar
    Pal, Saurabh
    Kumar, Surjeet
    APPLIED BIOCHEMISTRY AND BIOTECHNOLOGY, 2020, 190 (02) : 341 - 359
  • [7] ANALYSIS OF STUDENTS' STUDY ACTIVITIES IN VIRTUAL LEARNING ENVIRONMENTS USING DATA MINING METHODS
    Preidys, Saulius
    Sakalauskas, Leonidas
    TECHNOLOGICAL AND ECONOMIC DEVELOPMENT OF ECONOMY, 2010, 16 (01) : 94 - 108
  • [8] COMPARATIVE STUDY OF FEATURE SELECTION METHODS TO ANALYZE PERFORMANCE OF LUNG CANCER DATA
    Koc, Emel
    Ozer, A. Nevra
    PROCEEDINGS OF THE EUROPEAN CONFERENCE ON DATA MINING 2015 AND INTERNATIONAL CONFERENCES ON INTELLIGENT SYSTEMS AND AGENTS 2015 AND THEORY AND PRACTICE IN MODERN COMPUTING 2015, 2015, : 219 - 222
  • [9] Learning Performance of International Students and Students with Disabilities: Early Prediction and Feature Selection through Educational Data Mining
    Thao-Trang Huynh-Cam
    Chen, Long-Sheng
    Khai-Vinh Huynh
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (03)
  • [10] Comparative study of feature selection methods on microarray data
    Miyamoto, T
    Uchimura, S
    Hamamoto, Y
    Iizuka, N
    Oka, M
    Yamada-Okabe, H
    IEEE EMBS APBME 2003, 2003, : 82 - 83