Comparative Analysis of Machine Learning Models for Predicting Student Success in Online Programming Courses: A Study Based on LMS Data and External Factors

被引:0
|
作者
Arevalo-Cordovilla, Felipe Emiliano [1 ]
Pena, Marta [2 ]
机构
[1] Univ Estatal Milagro, Fac Sci & Engn, Milagro 091706, Ecuador
[2] Univ Politecn Cataluna, Dept Math, BarcelonaTech EEBE, Barcelona 08019, Spain
关键词
academic performance prediction; educational data mining; machine learning models; student retention;
D O I
10.3390/math12203272
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Early prediction of student performance in online programming courses is essential for implementing timely interventions to enhance academic outcomes. This study aimed to predict academic success by comparing four machine learning models: Logistic Regression, Random Forest, Support Vector Machine (SVM), and Neural Network (Multilayer Perceptron, MLP). We analyzed data from the Moodle Learning Management System (LMS) and external factors of 591 students enrolled in online object-oriented programming courses at the Universidad Estatal de Milagro (UNEMI) between 2022 and 2023. The data were preprocessed to address class imbalance using the synthetic minority oversampling technique (SMOTE), and relevant features were selected based on Random Forest importance rankings. The models were trained and optimized using Grid Search with cross-validation. Logistic Regression achieved the highest Area Under the Receiver Operating Characteristic Curve (AUC-ROC) on the test set (0.9354), indicating strong generalization capability. SVM and Neural Network models performed adequately but were slightly outperformed by the simpler models. These findings suggest that integrating LMS data with external factors enhances early prediction of student success. Logistic Regression is a practical and interpretable tool for educational institutions to identify at-risk students, and to implement personalized interventions.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Comparative analysis of advanced deep learning models for predicting evapotranspiration based on meteorological data in bangladesh
    Paul, Sourov
    Farzana, Syeda Zehan
    Das, Saikat
    Das, Pobithra
    Kashem, Abul
    Environmental Science and Pollution Research, 2024, 31 (50) : 60041 - 60064
  • [42] Comparative analysis of web-based machine learning models
    Stefan, Ana-Maria
    Ovreiu, Elena
    Ciuc, Mihai
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2024, 34 (02): : 49 - 64
  • [43] Comparative Study of Machine Learning Models and Distributed Runoff Models for Predicting Flood Water Level
    Kubo T.
    Okazaki T.
    IEIE Transactions on Smart Processing and Computing, 2023, 12 (03): : 215 - 222
  • [44] Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
    Bazrafshan, Mohammad
    Sayehmiri, Kourosh
    BMC PSYCHIATRY, 2024, 24 (01)
  • [45] A Comparative Study of Machine Learning Classification Models on Customer Behavior Data
    Rusli, Nur Ida Aniza
    Zulkifle, Farizuwana Akma
    Ramli, Intan Syaherra
    SOFT COMPUTING IN DATA SCIENCE, SCDS 2023, 2023, 1771 : 222 - 231
  • [46] Predicting Student Performance in Online Learning: A Multidimensional Time-Series Data Analysis Approach
    Shou, Zhaoyu
    Xie, Mingquan
    Mo, Jianwen
    Zhang, Huibing
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [47] Predicting soot formation in fossil fuels: A comparative study of regression and machine learning models
    Lawal, Ridhwan
    Farooq, Wasif
    Abdulraheem, Abdulazeez
    Jameel, Abdul Gani Abdul
    DIGITAL CHEMICAL ENGINEERING, 2024, 12
  • [48] Predicting Kereh River's Water Quality: A comparative study of machine learning models
    Nasaruddin, Norashikin
    Ahmad, Afida
    Zakaria, Shahida Farhan
    Ul-Saufie, Ahmad Zia
    Osman, Mohamed Syazwan
    ENVIRONMENT-BEHAVIOUR PROCEEDINGS JOURNAL, 2023, 8 (26): : 213 - 219
  • [49] Online Students' Learning Behaviors and Academic Success: An Analysis of LMS Log Data From Flipped Classrooms via Regularization
    Yoo, Jin Eun
    Rho, Minjeong
    Lee, Yekyung
    IEEE ACCESS, 2022, 10 : 10740 - 10753
  • [50] Predicting Kereh River's Water Quality: A comparative study of machine learning models
    Nasaruddin, Norashikin
    Ahmad, Afida
    Zakaria, Shahida Farhan
    Ul-Saufie, Ahmad Zia
    Osman, Mohamed Syazwan
    ENVIRONMENT-BEHAVIOUR PROCEEDINGS JOURNAL, 2023, 8 : 213 - 219