Identification of significant risks in pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) approach

被引:27
|
作者
Mahmood, Nasir [1 ,2 ]
Shahid, Saman [3 ]
Bakhshi, Taimur [3 ]
Riaz, Sehar [4 ,5 ]
Ghufran, Hafiz [4 ,5 ]
Yaqoob, Muhammad [5 ,6 ]
机构
[1] Univ Hlth Sci UHS, Dept Biochem Human Genet & Mol Biol, Lahore, Pakistan
[2] Univ Toronto, Dept Cell & Syst Biol, Toronto, ON, Canada
[3] Natl Univ Comp & Emerging Sci NUCES, Fdn Adv Sci & Technol FAST, Dept Sci & Humanities, Lahore, Pakistan
[4] Childrens Hosp, Sch Allied Hlth Sci, Lahore, Pakistan
[5] Inst Child Hlth, Lahore, Pakistan
[6] Childrens Hosp, Dept Med Genet, Lahore, Pakistan
关键词
Pediatric ALL; Machine learning (ML); Classification and regression trees (CART); Platelets; Hemoglobin; Environmental factors; CHILDHOOD LEUKEMIA; GENETIC POLYMORPHISMS; DRINKING-WATER; CHROMOSOMAL-ABNORMALITIES; THROMBOTIC COMPLICATIONS; SOCIOECONOMIC-STATUS; CHILDREN; SUSCEPTIBILITY; POPULATION; MUTATIONS;
D O I
10.1007/s11517-020-02245-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) technique was analyzed to determine the significance of clinical and phenotypic variables as well as environmental conditions that can identify the underlying causes of child ALL. Fifty pediatric patients (n = 50) included who were diagnosed with acute lymphoblastic leukemia (ALL) according to the inclusion and exclusion criteria. Clinical variables comprised of the blood biochemistry (CBC, LFTs, RFTs) results, and distribution of type of ALL, i.e., T ALL or B ALL. Phenotypic data included the age, sex of the child, and consanguinity, while environmental factors included the habitat, socioeconomic status, and access to filtered drinking water. Fifteen different features/attributes were collected for each case individually. To retrieve most useful discriminating attributes, four different supervised ML algorithms were used including classification and regression trees (CART), random forest (RM), gradient boosted machine (GM), and C5.0 decision tree algorithm. To determine the accuracy of the derived CART algorithm on future data, a ten-fold cross validation was performed on the present data set. The ALL was common in children of age below 5 years in male patients whole belonged to middle class family of rural areas. (B-ALL) was most frequent as compared with T-ALL. The consanguinity was present in 54% of cases. Low levels of platelets and hemoglobin and high levels of white blood cells were reported in child ALL patients. CART provided the best and complete fit for the entire data set yielding a 99.83% model fit accuracy, and a misclassification of 0.17% on the entire sample space, while C5.0 reported 98.6%, random forest 94.44%, and gradient boosted machine resulted in 95.61% fitting. The variable importance of each primary discriminating attribute is platelet 43%, hemoglobin 24%, white blood cells 4%, and sex of the child 4%. An overall accuracy of 87.4% was recorded for the classifier. Platelet count abnormality can be considered as a major factor in predicting pediatric ALL. The machine learning algorithms can be applied efficiently to provide details for the prognosis for better treatment outcome. Graphical Identification of significant risks in pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) approach.
引用
收藏
页码:2631 / 2640
页数:10
相关论文
共 50 条
  • [31] IDENTIFICATION OF RISK FACTORS FOR HYPERSENSITIVITY REACTIONS TO PEG-ASPARAGINASE IN PEDIATRIC PATIENTS WITH ACUTE LYMPHOBLASTIC LEUKEMIA (ALL)
    Dominick, Karissa
    Zembillas, Anthony
    Ketz, Jeff
    Hanna, Rabi
    Flagg, Aron
    Dahl, Elizabeth
    PEDIATRIC BLOOD & CANCER, 2015, 62 : 95 - 95
  • [32] Identification of Acute Lymphoblastic Leukemia in Microscopic Blood Image Using Image Processing and Machine Learning Algorithms
    Rajpurohit, Subhash
    Patil, Sanket
    Choudhary, Nitu
    Gavasane, Shreya
    Kosamkar, Pranali
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2359 - 2363
  • [33] Predictors of Being Overweight or Obese in Survivors of Pediatric Acute Lymphoblastic Leukemia (ALL)
    Zhang, Fang Fang
    Rodday, Angie Mae
    Kelly, Michael J.
    Must, Aviva
    MacPherson, Cathy
    Roberts, Susan B.
    Saltzman, Edward
    Parsons, Susan K.
    PEDIATRIC BLOOD & CANCER, 2014, 61 (07) : 1263 - 1269
  • [34] High throughput transcriptome sequencing or pediatric relapsed acute lymphoblastic leukemia (ALL)
    Hogan, L. E.
    Mason, C.
    Meyer, J.
    Wang, J.
    Tang, Z.
    Brown, S.
    Morrison, D. J.
    Hunger, S.
    Raetz, A.
    Carroll, W. L.
    JOURNAL OF CLINICAL ONCOLOGY, 2010, 28 (15)
  • [35] Lympocyte subpopulation disturbances during chemotherapy for pediatric acute lymphoblastic leukemia (ALL).
    Kostaridou, S
    Polychronopoulou-Androulakaki, S
    Panagiotou, J
    Psarra, K
    Kapsimali, V
    Tsagraki, K
    Katevas, P
    Papasteriadis, C
    Haidas, S
    BLOOD, 2000, 96 (11) : 321A - 321A
  • [36] Fever at diagnosis of pediatric acute lymphoblastic leukemia (ALL): Are antibiotics really necessary?
    Khurana, Monica
    Feusner, James Henry
    Lee, Brian
    JOURNAL OF CLINICAL ONCOLOGY, 2012, 30 (15)
  • [37] Bone Density in Pediatric Patients with Acute Lymphoblastic Leukemia (ALL): A Literature Review
    Ghassemi, Ali
    Ghaemi, Nosrat
    Yazdi, Monireh Saffar
    INTERNATIONAL JOURNAL OF PEDIATRICS-MASHHAD, 2015, 3 (01): : 475 - 480
  • [38] Altered expression and function of NK receptors in pediatric acute lymphoblastic leukemia (ALL)
    Mathew, S.
    Powers, S.
    Bowman, W. P.
    Aldy, K.
    Mathew, P.
    EUROPEAN JOURNAL OF IMMUNOLOGY, 2016, 46 : 233 - 233
  • [39] Variation in antibiotic use in pediatric acute lymphoblastic leukemia (ALL) by hospital pediatric volume.
    Wilkes, Jennifer J.
    Xiao, Rui
    Seif, Ailx Eden
    Huang, Yuan-Shung
    Vendetti, Neika D.
    Rheingold, Susan R.
    Aplenc, Richard
    Hennessy, Sean
    Fisher, Brian
    JOURNAL OF CLINICAL ONCOLOGY, 2014, 32 (15)
  • [40] Applying machine learning to identify pediatric patients with newly diagnosed acute lymphoblastic leukemia using administrative data
    Cao, Lusha
    Huang, Yuan-shung
    Getz, Kelly D.
    Seif, Alix E.
    Ruiz, Jenny
    Miller, Tamara P.
    Fisher, Brian T.
    Aplenc, Richard
    Li, Yimei
    PEDIATRIC BLOOD & CANCER, 2024, 71 (03)