Selection of key genes for dilated cardiomyopathy based on machine learning algorithms and assessment of diagnostic accuracy

被引:1
|
作者
Chen, Tingting [1 ]
Xuan, Xiulin [1 ]
Ni, Jiajia [1 ]
Jiang, Shuyin [2 ]
机构
[1] Zhejiang Univ, Hangzhou Peoples Hosp 1, Dept Cardiovasc Med, Sch Med, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou Peoples Hosp 1, Dept Gastroenterol, Sch Med, 261 Huansha Rd, Hangzhou 310001, Peoples R China
关键词
Dilated cardiomyopathy; machine learning algorithms; immune microenvironment;
D O I
10.21037/jtd-23-1086
中图分类号
R56 [呼吸系及胸部疾病];
学科分类号
摘要
Background: The mechanisms of the occurrence and progression of dilated cardiomyopathy are still unclear and further exploration is needed. The upgrading of programming languages and the improvement of biological databases have created conditions for us to explore the structural and functional information of biological molecules at the nucleic acid and protein levels, screen key pathogenic genes, and elucidate pathogenic mechanisms. This study aimed to screen key pathogenic genes using machine learning algorithms and explore the correlation between key genes and immune microenvironment through transcriptome sequencing data sets of myocardial samples from patients with dilated cardiomyopathy, providing new ideas for elucidating the pathogenesis of the disease. Methods: The transcriptome sequencing data sets of heart tissue from patients with dilated cardiomyopathy were downloaded from the Gene Expression Omnibus ( GEO) database ( GSE29819 and GSE21610). Differentially expressed genes (DEGs) were screened between pathological and normal tissues. The key genes were screened using least absolute shrinkage and selection operator (LASSO) regression analysis and random forest tree algorithms. The diagnostic efficiency of the key genes for the disease was evaluated using the receiver operating characteristic (ROC) curve. Results: Compared with the normal heart tissue (control group) samples, there were 213 DEGs in the heart tissue samples of patients with dilated cardiomyopathy (treat group), including 101 upregulated and 102 downregulated genes. CCL5 and CTGF were highly expressed in the treat group compared to the control group. The ROC curve showed that the areas under the curve (AUCs) of CCL5 and CTGF were 0.821 and 0.902, respectively (P<0.05). In the treat group samples, CCL5 was positively correlated with the infiltration content of most immune cell subtypes. Conclusions: CCL5 and CTGF are key disease-causing genes in dilated cardiomyopathy and have good diagnostic efficiency for the disease. CCL5 and CTGF may be related to immune cell enrichment and myocardial fibrosis, respectively.
引用
收藏
页码:4445 / 4455
页数:11
相关论文
共 50 条
  • [41] Assessment of Ensemble-Based Machine Learning Algorithms for Exoplanet Identification
    Luz, Thiago S. F.
    Braga, Rodrigo A. S.
    Ribeiro, Enio R.
    ELECTRONICS, 2024, 13 (19)
  • [42] An assessment of machine learning algorithms for healthcare analysis based on improved MapReduce
    Sukanya, J.
    Gandhi, K. Rajiv
    Palanisamy, V.
    ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
  • [43] Estimation of Agronomic Characters of Wheat Based on Variable Selection and Machine Learning Algorithms
    Wang, Dunliang
    Li, Rui
    Liu, Tao
    Sun, Chengming
    Guo, Wenshan
    AGRONOMY-BASEL, 2023, 13 (11):
  • [44] Machine learning for sports betting: Should model selection be based on accuracy or calibration?
    Walsh, Conor
    Joshi, Alok
    MACHINE LEARNING WITH APPLICATIONS, 2024, 16
  • [45] Feature Selection and Intrusion Detection in Cloud Environment based on Machine Learning Algorithms
    Javadpour, Amir
    Abharian, Sanaz Kazemi
    Wang, Guojun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 1417 - 1421
  • [46] Congestive heart failure prediction based on feature selection and machine learning algorithms
    Morillo-Velepucha, Diego
    Reategui, Ruth
    Valdiviezo-Diaz, Priscila
    Barba-Guaman, Luis
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [47] A Machine Learning-Based Framework for Dynamic Selection of Congestion Control Algorithms
    Zhou, Jianer
    Qiu, Xinyi
    Li, Zhenyu
    Li, Qing
    Tyson, Gareth
    Duan, Jingpu
    Wang, Yi
    Pan, Heng
    Wu, Qinghua
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (04) : 1566 - 1581
  • [48] COMPREHENSIVE ASSESSMENT OF RARE GENETIC VARIATION IN DILATED CARDIOMYOPATHY GENES IN PATIENTS AND CONTROLS
    Tayal, Upasana
    Mazzarotto, Francesco
    Buchan, Rachel
    Walsh, Roddy
    Barton, Paul
    Ware, James
    Cook, Stuart
    HEART, 2015, 101 : A41 - A42
  • [49] Subtypes and Mechanisms of Hypertrophic Cardiomyopathy Proposed by Machine Learning Algorithms
    Glavaski, Mila
    Preveden, Andrej
    Jakovljevic, Dorde
    Filipovic, Nenad
    Velicki, Lazar
    LIFE-BASEL, 2022, 12 (10):
  • [50] Bearing fault diagnostic using machine learning algorithms
    Sawaqed, Laith S.
    Alrayes, Ayman M.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2020, 9 (04) : 341 - 350