Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014-15 dataset

被引:19
|
作者
Mfateneza, Emmanuel [1 ]
Rutayisire, Pierre Claver [2 ]
Biracyaza, Emmanuel [3 ]
Musafiri, Sanctus [4 ]
Mpabuka, Willy Gasafari [5 ]
机构
[1] Univ Rwanda, African Ctr Excellence Data Sci, Kigali, Rwanda
[2] Univ Rwanda, Appl Stat Dept, Kigali, Rwanda
[3] Prison Fellowship Rwanda, Kigali, Rwanda
[4] Univ Rwanda, Clin Dept Internal Med, Kigali, Rwanda
[5] Transparency Int Rwanda, Kigali, Rwanda
关键词
Infant mortality; Machine Learning; Logistic regression; Model accuracy; MODEL;
D O I
10.1186/s12884-022-04699-8
中图分类号
R71 [妇产科学];
学科分类号
100211 ;
摘要
Background Extensive research on infant mortality (IM) exists in developing countries; however, most of the methods applied thus far relied on conventional regression analyses with limited prediction capability. Advanced of Machine Learning (AML) methods provide accurate prediction of IM; however, there is no study conducted using ML methods in Rwanda. This study, therefore, applied Machine Learning Methods for predicting infant mortality in Rwanda. Methods A cross-sectional study design was conducted using the 2014-15 Rwanda Demographic and Health Survey. Python software version 3.8 was employed to test and apply ML methods through Random Forest (RF), Decision Tree, Support Vector Machine and Logistic regression. STATA version 13 was used for analysing conventional methods. Evaluation metrics methods specifically confusion matrix, accuracy, precision, recall, F1 score, and Area under the Receiver Operating Characteristics (AUROC) were used to evaluate the performance of predictive models. Results Ability of prediction was between 68.6% and 61.5% for AML. We preferred with the RF model (61.5%) presenting the best performance. The RF model was the best predictive model of IM with accuracy (84.3%), recall (91.3%), precision (80.3%), F1 score (85.5%), and AUROC (84.2%); followed by decision tree model with model accuracy (83%), recall (91%), precision (79%), F1 score (84.67%) and AUROC(82.9%), followed by support vector machine with model accuracy (68.6%), recall (74.9%), precision(67%), F1 score (70.73%) and AUROC (68.6%) and last was a logistic regression with the low accuracy of prediction (61.5%), recall (61.1%), precision (62.2%), F1 score (61.6%) and AUROC (61.5%) compared to other predictive models. Our predictive models showed that marital status, children ever born, birth order and wealth index are the 4 top predictors of IM. Conclusions In developing a predictive model, ML methods are used to classify certain hidden information that could not be detected by traditional statistical methods. Random Forest was classified as the best classifier to be used for the predictive models of IM.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
    Emmanuel Mfateneza
    Pierre Claver Rutayisire
    Emmanuel Biracyaza
    Sanctus Musafiri
    Willy Gasafari Mpabuka
    BMC Pregnancy and Childbirth, 22
  • [2] Application of machine learning methods for predicting under-five mortality: analysis of Nigerian demographic health survey 2018 dataset
    Oduse Samuel
    Temesgen Zewotir
    Delia North
    BMC Medical Informatics and Decision Making, 24
  • [3] Application of machine learning methods for predicting under-five mortality: analysis of Nigerian demographic health survey 2018 dataset
    Samuel, Oduse
    Zewotir, Temesgen
    North, Delia
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [4] Application of machine learning methods for predicting childhood anaemia: Analysis of Ethiopian Demographic Health Survey of 2016
    Tesfaye, Solomon Hailemariam
    Seboka, Binyam Tariku
    Sisay, Daniel
    PLOS ONE, 2024, 19 (04):
  • [5] Factors associated with anaemia among pregnant women in Rwanda: an analysis of the Rwanda demographic and health survey of 2020
    Nuwabaine, Lilian
    Kawuki, Joseph
    Kamoga, Livingstone
    Sserwanja, Quraish
    Gatasi, Ghislaine
    Donkor, Elorm
    Mutisya, Linet M.
    Asiimwe, John Baptist
    BMC PREGNANCY AND CHILDBIRTH, 2024, 24 (01)
  • [6] Application of Machine Learning Algorithms in Predicting Extreme Rainfall Events in Rwanda
    Kagabo, James
    Kattel, Giri Raj
    Kazora, Jonah
    Shangwe, Charmant Nicolas
    Habiyakare, Fabien
    ATMOSPHERE, 2024, 15 (06)
  • [7] Assessing predictors of delayed antenatal care visits in Rwanda: a secondary analysis of Rwanda demographic and health survey 2010
    Anatole Manzi
    Fabien Munyaneza
    Francisca Mujawase
    Leonidas Banamwana
    Felix Sayinzoga
    Dana R Thomson
    Joseph Ntaganira
    Bethany L Hedt-Gauthier
    BMC Pregnancy and Childbirth, 14
  • [8] Assessing predictors of delayed antenatal care visits in Rwanda: a secondary analysis of Rwanda demographic and health survey 2010
    Manzi, Anatole
    Munyaneza, Fabien
    Mujawase, Francisca
    Banamwana, Leonidas
    Sayinzoga, Felix
    Thomson, Dana R.
    Ntaganira, Joseph
    Hedt-Gauthier, Bethany L.
    BMC PREGNANCY AND CHILDBIRTH, 2014, 14
  • [9] Prevalence and factors associated with caesarean section in Rwanda: a trend analysis of Rwanda demographic and health survey 2000 to 2019–20
    Peter M. Kibe
    Grace Wambura Mbuthia
    Duncan N. Shikuku
    Catherine Akoth
    James Odhiambo Oguta
    Loise Ng’ang’a
    Samwel Maina Gatimu
    BMC Pregnancy and Childbirth, 22
  • [10] Risk Factors Of Stunting Among Children Under 5 Years Of Age In The Eastern And Western Provinces Of Rwanda: Analysis Of Rwanda Demographic And Health Survey 2014/2015
    Habimana, Samuel
    Biracyaza, Emmanuel
    PEDIATRIC HEALTH MEDICINE AND THERAPEUTICS, 2019, 10 : 115 - 130