Predicting Student Dropout Rates Using Supervised Machine Learning: Insights from the 2022 National Education Accessibility Survey in Somaliland

被引:3
|
作者
Hassan, Mukhtar Abdi [1 ]
Muse, Abdisalam Hassan [1 ]
Nadarajah, Saralees [2 ]
机构
[1] Amoud Univ, Fac Sci & Humanities, Sch Postgrad Studies & Res SPGSR, Borama 25263, Somalia
[2] Univ Manchester, Dept Math, Manchester M13 9PL, England
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 17期
关键词
student dropout; machine learning; Somaliland; national education accessibility survey;
D O I
10.3390/app14177593
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
High student dropout rates are a critical issue in Somaliland, significantly impeding educational progress and socioeconomic development. This study leveraged data from the 2022 National Education Accessibility Survey (NEAS) to predict student dropout rates using supervised machine learning techniques. Various algorithms, including logistic regression (LR), probit regression (PR), na & iuml;ve Bayes (NB), decision tree (DT), random forest (RF), support vector machine (SVM), and K-nearest neighbors (KNN), were employed to analyze the survey data. The analysis revealed school dropout rate of 12.67%. Key predictors of dropout included student's grade, age, school type, household income, and type of housing. Logistic regression and probit regression models highlighted age and student's grade as critical predictors, while na & iuml;ve Bayes and random forest models underscored the significance of household income and housing type. Among the models, random forest demonstrated the highest accuracy at 95.00%, indicating its effectiveness in predicting dropout rates. The findings from this study provide valuable insights for educational policymakers and stakeholders in Somaliland. By identifying and understanding the key factors influencing dropout rates, targeted interventions can be designed to enhance student retention and improve educational outcomes. The dominant role of demographic and educational factors, particularly age and student's grade, underscores the necessity for focused strategies to reduce dropout rates and promote inclusive education in Somaliland.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Predicting student satisfaction of emergency remote learning in higher education during COVID-19 using machine learning techniques
    Ho, Indy Man Kit
    Cheong, Kai Yuen
    Weldon, Anthony
    PLOS ONE, 2021, 16 (04):
  • [22] Predicting Growth Trajectory in Vestibular Schwannoma From Radiomic Data Using Supervised Machine Learning Techniques
    Grady, Conor
    Wang, Hesheng
    Schnurman, Zane
    Qu, Tanxia
    Kondziolka, Douglas
    NEUROSURGERY, 2019, 66 : 76 - 77
  • [23] Relationship between lifestyle factors and cardiovascular disease prevalence in Somaliland: A supervised machine learning approach using data from Hargeisa Group Hospital, 2024
    Muse, Yahye Hassan
    Hassan, Mukhtar Abdi
    Abdikarim, Hodo
    Botan, Nuh
    Hassan, Kaltun
    Dahir, Idiris
    Suleiman, Ayanle
    Muse, Abdisalam Hassan
    CURRENT PROBLEMS IN CARDIOLOGY, 2025, 50 (03)
  • [24] Predicting the Ecological Quality Status of Marine Environments from eDNA Metabarcoding Data Using Supervised Machine Learning
    Cordier, Tristan
    Esling, Philippe
    Lejzerowicz, Franck
    Visco, Joana
    Ouadahi, Amine
    Martins, Catarina
    Cedhagen, Tomas
    Pawlowski, Jan
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2017, 51 (16) : 9118 - 9126
  • [25] Predicting rock mass strength from drilling data using synergistic unsupervised and supervised machine learning approaches
    Komadja, Gbetoglo Charles
    Westman, Erik
    Rana, Aditya
    Vitalis, Anye
    EARTH SCIENCE INFORMATICS, 2025, 18 (03)
  • [26] How Valid Are Trust Survey Measures? New Insights From Open-Ended Probing Data and Supervised Machine Learning
    Landesvatter, Camille
    Bauer, Paul
    SOCIOLOGICAL METHODS & RESEARCH, 2024,
  • [27] Predicting Caregiver Burden in Multiple Myeloma: Insights from the Carmma Study Using Machine Learning Models
    Costa, Carlos
    Roque, Adriana Isabel
    Santinha, Joao
    Neves, Manuel
    Sarmento-Ribeiro, Ana Bela
    Gerivaz, Rita
    Tome, Ana Luisa
    Martins, Helena
    Vieira, Joana
    Afonso, Sofia
    Santos, Joana
    Afonso, Celina
    Jorge, Ana
    Freitas, Jose
    Ramos, Ines
    Sousa, Patricia
    Cesar, Paula
    Garrido, Teresa
    Rochate, Dina
    Silveira, Maria Pedro
    Miranda, Fernanda Trigo
    Bergantim, Rui
    Santos, Catarina Geraldes
    Joao, Cristina
    BLOOD, 2024, 144 : 4718 - 4719
  • [28] Predicting the Zinc Content in Rice from Farmland Using Machine Learning Models: Insights from Universal Geochemical Parameters
    Geng, Wenda
    Li, Tingting
    Zhu, Xin
    Dou, Lei
    Liu, Zijia
    Qian, Kun
    Ye, Guiqi
    Lin, Kun
    Li, Bo
    Ma, Xudong
    Hou, Qingye
    Yu, Tao
    Yang, Zhongfang
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [29] Predicting moderate drinking behaviors in National Health and Nutrition Examination Survey participants using biochemical and demographical factors with machine learning
    Leaks, Kalan
    Norden-Krichmar, Trina
    Brody, James P.
    ALCOHOL, 2023, 113 : 1 - 10
  • [30] Predicting multifaceted risks using machine learning in atrial fibrillation: insights from GLORIA-AF study
    Lu, Juan
    Bisson, Arnaud
    Bennamoun, Mohammed
    Zheng, Yalin
    Sanfilippo, Frank M.
    Hung, Joseph
    Briffa, Tom
    McQuillan, Brendan
    Stewart, Jonathon
    Figtree, Gemma
    Huisman, Menno V.
    Dwivedi, Girish
    Lip, Gregory Y. H.
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2024, 5 (03): : 235 - 246