Identification of significant risks in pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) approach

被引:27
|
作者
Mahmood, Nasir [1 ,2 ]
Shahid, Saman [3 ]
Bakhshi, Taimur [3 ]
Riaz, Sehar [4 ,5 ]
Ghufran, Hafiz [4 ,5 ]
Yaqoob, Muhammad [5 ,6 ]
机构
[1] Univ Hlth Sci UHS, Dept Biochem Human Genet & Mol Biol, Lahore, Pakistan
[2] Univ Toronto, Dept Cell & Syst Biol, Toronto, ON, Canada
[3] Natl Univ Comp & Emerging Sci NUCES, Fdn Adv Sci & Technol FAST, Dept Sci & Humanities, Lahore, Pakistan
[4] Childrens Hosp, Sch Allied Hlth Sci, Lahore, Pakistan
[5] Inst Child Hlth, Lahore, Pakistan
[6] Childrens Hosp, Dept Med Genet, Lahore, Pakistan
关键词
Pediatric ALL; Machine learning (ML); Classification and regression trees (CART); Platelets; Hemoglobin; Environmental factors; CHILDHOOD LEUKEMIA; GENETIC POLYMORPHISMS; DRINKING-WATER; CHROMOSOMAL-ABNORMALITIES; THROMBOTIC COMPLICATIONS; SOCIOECONOMIC-STATUS; CHILDREN; SUSCEPTIBILITY; POPULATION; MUTATIONS;
D O I
10.1007/s11517-020-02245-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) technique was analyzed to determine the significance of clinical and phenotypic variables as well as environmental conditions that can identify the underlying causes of child ALL. Fifty pediatric patients (n = 50) included who were diagnosed with acute lymphoblastic leukemia (ALL) according to the inclusion and exclusion criteria. Clinical variables comprised of the blood biochemistry (CBC, LFTs, RFTs) results, and distribution of type of ALL, i.e., T ALL or B ALL. Phenotypic data included the age, sex of the child, and consanguinity, while environmental factors included the habitat, socioeconomic status, and access to filtered drinking water. Fifteen different features/attributes were collected for each case individually. To retrieve most useful discriminating attributes, four different supervised ML algorithms were used including classification and regression trees (CART), random forest (RM), gradient boosted machine (GM), and C5.0 decision tree algorithm. To determine the accuracy of the derived CART algorithm on future data, a ten-fold cross validation was performed on the present data set. The ALL was common in children of age below 5 years in male patients whole belonged to middle class family of rural areas. (B-ALL) was most frequent as compared with T-ALL. The consanguinity was present in 54% of cases. Low levels of platelets and hemoglobin and high levels of white blood cells were reported in child ALL patients. CART provided the best and complete fit for the entire data set yielding a 99.83% model fit accuracy, and a misclassification of 0.17% on the entire sample space, while C5.0 reported 98.6%, random forest 94.44%, and gradient boosted machine resulted in 95.61% fitting. The variable importance of each primary discriminating attribute is platelet 43%, hemoglobin 24%, white blood cells 4%, and sex of the child 4%. An overall accuracy of 87.4% was recorded for the classifier. Platelet count abnormality can be considered as a major factor in predicting pediatric ALL. The machine learning algorithms can be applied efficiently to provide details for the prognosis for better treatment outcome. Graphical Identification of significant risks in pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) approach.
引用
收藏
页码:2631 / 2640
页数:10
相关论文
共 50 条
  • [21] Ocular Lesions in Newly Diagnosed Pediatric Acute Lymphoblastic Leukemia (ALL)
    Kikuchi, Natsumi
    Azuma, Sakiko
    Noguchi, Mayuko
    Yamasaki, Kai
    Nitani, Chika
    Okada, Keiko
    Fujisaki, Hiroyuki
    Hara, Junichi
    PEDIATRIC BLOOD & CANCER, 2021, 68
  • [22] INCIDENCE OF CYTOGENETIC ABERRATIONS IN PEDIATRIC ACUTE LYMPHOBLASTIC LEUKEMIA (ALL) IN LATVIA
    Grivina, I.
    Kovalova, Z.
    Kursite, S.
    Nikulshin, S.
    HAEMATOLOGICA-THE HEMATOLOGY JOURNAL, 2010, 95 : 680 - 681
  • [23] Differential expression of immune receptors in pediatric Acute Lymphoblastic Leukemia (ALL)
    Mathew, Stephen O.
    Ahmed, Nourhan G.
    Powers, Sheila
    Jose, Roslin
    Mathew, Porunelloor A.
    JOURNAL OF IMMUNOLOGY, 2023, 210 (01):
  • [24] Mechanisms of defective erythropoiesis and anemia in pediatric acute lymphoblastic leukemia (ALL)
    MacGregor Steele
    Aru Narendran
    Annals of Hematology, 2012, 91 : 1513 - 1518
  • [25] HYPERLEUKOCYTOSIS IN PEDIATRIC ACUTE LYMPHOBLASTIC LEUKEMIA (ALL): DOES IT MAKE A DIFFERENCE?
    Al-Sweedan, Suleimman
    Jafri, Rafat
    Siddiqui, Khawar
    Alahmari, Ali
    Ghemlas, Ibrahim
    Alseraihy, Amal
    PEDIATRIC BLOOD & CANCER, 2016, 63 : S45 - S45
  • [26] KIRS GENE PROFILE IN PEDIATRIC B ACUTE LYMPHOBLASTIC LEUKEMIA (ALL)
    Pana, Zoe Dorothea
    Papi, Rigini
    Hatzipantelis, Emmanouil
    Tragiannidis, Athanassios
    Farmaki, Evagelia
    Kyriakidis, Dimitrios
    Papageorgiou, Theodotis
    Athanassiadou, Fani
    PEDIATRIC BLOOD & CANCER, 2012, 59 (06) : 1036 - 1037
  • [27] MANAGEMENT OF POST-IRRADIATION MENINGEAL LEUKEMIA (ML) IN CHILDHOOD ACUTE LYMPHOBLASTIC-LEUKEMIA (ALL)
    KUN, L
    KAPLAN, B
    LAUER, S
    MULHERN, R
    PROCEEDINGS OF THE AMERICAN ASSOCIATION FOR CANCER RESEARCH, 1981, 22 (MAR): : 485 - 485
  • [28] A new approach to high risk pediatric acute lymphoblastic leukemia?
    Hanna, Diane
    Anderson, Mary Ann
    TRANSLATIONAL CANCER RESEARCH, 2016, 5 : S1428 - S1432
  • [29] Towards a pediatric approach in adults with acute lymphoblastic leukemia (ALL): The GRAALL-2003 study.
    Huguet, Francoise
    Raffoux, Emmanuel
    Thomas, Xavier
    Leguay, Thibaut
    Chevallier, Patrice
    Escoffre, Martine
    Reman, Oumedaly
    Caillot, Denis
    Vey, Norbert
    Boissel, Nicolas
    Hunault, Mathilde
    Buzyn, Agnes
    Chalandon, Yves
    Delannoy, Andre
    Vernant, Jean-Paul
    Lheritier, Veronique
    Bene, Marie-Christine
    Macintyre, Elizabeth
    Ifrah, Norbert
    Dombret, Herve
    BLOOD, 2006, 108 (11) : 48A - 48A
  • [30] Identification of prognostic protein biomarkers in childhood acute lymphoblastic leukemia (ALL)
    Jiang, Nan
    Kham, Shirley Kow Yin
    Koh, Grace Shimin
    Lim, Joshua Yew Suang
    Ariffin, Hany
    Chew, Fook Tim
    Yeoh, Allen Eng Juh
    JOURNAL OF PROTEOMICS, 2011, 74 (06) : 843 - 857