Balancing Fined-Tuned Machine Learning Models Between Continuous and Discrete Variables - A Comprehensive Analysis Using Educational Data

被引:3
|
作者
Drousiotis, Efthyvoulos [1 ]
Pentaliotis, Panagiotis [1 ]
Shi, Lei [2 ]
Cristea, Alexandra, I [2 ]
机构
[1] Univ Liverpool, Dept Elect Engn & Elect, Liverpool, Merseyside, England
[2] Univ Durham, Dept Comp Sci, Durham, England
来源
关键词
Neural networks; Tree-based algorithms; Educational data mining; Feature engineering; MOOCs;
D O I
10.1007/978-3-031-11644-5_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the exponential increase of students enrolling in MOOCs [26] arises the problem of a high student dropout rate. Researchers worldwide are interested in predicting whether students will drop out of MOOCs to prevent it. This study explores and improves ways of handling notoriously challenging continuous variables datasets, to predict dropout. Importantly, we propose a fair comparison methodology: unlike prior studies and, for the first time, when comparing various models, we use algorithms with the dataset they are intended for, thus `like for like.' We use a time-series dataset with algorithms suited for time-series, and a converted discrete-variables dataset, through feature engineering, with algorithms known to handle discrete variables well. Moreover, in terms of predictive ability, we examine the importance of finding the optimal hyperparameters for our algorithms, in combination with the most effective pre-processing techniques for the data. We show that these much lighter discrete models outperform the time-series models, enabling faster training and testing. This result also holds over fine-tuning of pre-processing and hyperparameter optimisation.
引用
收藏
页码:256 / 268
页数:13
相关论文
共 50 条
  • [31] Analysis of machine learning models and data sources to forecast burst pressure of petroleum corroded pipelines: A comprehensive review
    Soomro, Afzal Ahmed
    Mokhtar, Ainul Akmar
    Hussin, Hilmi B.
    Lashari, Najeebullah
    Oladosu, Temidayo Lekan
    Jameel, Syed Muslim
    Inayat, Muddasser
    ENGINEERING FAILURE ANALYSIS, 2024, 155
  • [32] Brain Age Prediction: A Comparison between Machine Learning Models Using Brain Morphometric Data
    Han, Juhyuk
    Kim, Seo Yeong
    Lee, Junhyeok
    Lee, Won Hee
    SENSORS, 2022, 22 (20)
  • [33] Investigation of Relationships between Discrete and Dimensional Emotion Models in Affective Picture Databases Using Unsupervised Machine Learning
    Horvat, Marko
    Jovic, Alan
    Burnik, Kristijan
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [34] Comparison of predictive models for knee pain and analysis of individual and physical activity variables using interpretable machine learning
    Kim, Jun-hee
    KNEE, 2025, 54 : 146 - 153
  • [35] Goats on the Move: Evaluating Machine Learning Models for Goat Activity Analysis Using Accelerometer Data
    Hollevoet, Arthur
    De Waele, Timo
    Peralta, Daniel
    Tuyttens, Frank
    De Poorter, Eli
    Shahid, Adnan
    ANIMALS, 2024, 14 (13):
  • [36] Robustness Analysis of Machine Learning Models Using Domain-Specific Test Data Perturbation
    Lambert, Marian
    Schuster, Thomas
    Kessel, Marcus
    Atkinson, Colin
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I, 2023, 14115 : 158 - 170
  • [37] Using machine learning techniques in the construction of models .2. Data analysis with rule induction
    Dzeroski, S
    Grbovic, J
    Walley, WJ
    Kompare, B
    ECOLOGICAL MODELLING, 1997, 95 (01) : 95 - 111
  • [38] Comparative Analysis of Intrusion Detection Models using Big Data Analytics and Machine Learning Techniques
    Alaketu, Muyideen Ayodeji
    Oguntimilehin, Abiodun
    Olatunji, Kehinde Adebola
    Abiola, Oluwatoyin Bunmi
    Badeji-Ajisafe, Bukola
    Akinduyite, Christiana Olanike
    Obamiyi, Stephen Eyitayo
    Babalola, Gbemisola Olutosin
    Okebule, Toyin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (02) : 326 - 337
  • [39] A comprehensive analysis and prediction of earthquake magnitude based on position and depth parameters using machine and deep learning models
    Rachna Jain
    Anand Nayyar
    Simrann Arora
    Akash Gupta
    Multimedia Tools and Applications, 2021, 80 : 28419 - 28438
  • [40] A comprehensive analysis and prediction of earthquake magnitude based on position and depth parameters using machine and deep learning models
    Jain, Rachna
    Nayyar, Anand
    Arora, Simrann
    Gupta, Akash
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 28419 - 28438