Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method

被引:44
|
作者
Ding, Shijian [1 ]
Wang, Deling [2 ]
Zhou, Xianchao [3 ]
Chen, Lei [4 ]
Feng, Kaiyan [5 ]
Xu, Xianling [6 ]
Huang, Tao [7 ,8 ]
Li, Zhandong [9 ]
Cai, Yudong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[2] Sun Yat Sen Univ, Collaborat Innovat Ctr Canc Med, Dept Med Imaging, State Key Lab Oncol South China,Canc Ctr, Guangzhou 510060, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Publ Hlth, Sch Med, Ctr Single Cell Omics, Shanghai 200025, Peoples R China
[4] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[6] Guangdong AIB Polytech Coll, Guangzhou 510507, Peoples R China
[7] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[8] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
[9] Jilin Engn Normal Univ, Coll Food Engn, Changchun 130052, Peoples R China
来源
LIFE-BASEL | 2022年 / 12卷 / 02期
基金
国家重点研发计划;
关键词
heart cell; single-cell profiles; machine learning method; biomarker; decision rule; NATRIURETIC PEPTIDES; MAMMALIAN HEART; ATRIAL; MUTATIONS; PROTEINS; GENE; TTN;
D O I
10.3390/life12020228
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The heart is an essential organ in the human body. It contains various types of cells, such as cardiomyocytes, mesothelial cells, endothelial cells, and fibroblasts. The interactions between these cells determine the vital functions of the heart. Therefore, identifying the different cell types and revealing the expression rules in these cell types are crucial. In this study, multiple machine learning methods were used to analyze the heart single-cell profiles with 11 different heart cell types. The single-cell profiles were first analyzed via light gradient boosting machine method to evaluate the importance of gene features on the profiling dataset, and a ranking feature list was produced. This feature list was then brought into the incremental feature selection method to identify the best features and build the optimal classifiers. The results suggested that the best decision tree (DT) and random forest classification models achieved the highest weighted F1 scores of 0.957 and 0.981, respectively. The selected features, such as NPPA, LAMA2, DLC1, and the classification rules extracted from the optimal DT classifier played a crucial role in cardiac structure and function in recent research and enrichment analysis. In particular, some lncRNAs (LINC02019, NEAT1) were found to be quite important for the recognition of different cardiac cell types. In summary, these findings provide a solid academic foundation for the development of molecular diagnostics and biomarker discovery for cardiac diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] Predicting congenital heart disease using machine learning techniques
    Rani, Samta
    Masood, Sarfaraz
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01): : 293 - 303
  • [12] Prediction of Atmospheric Profiles With Machine Learning Using the Signature Method
    Fujita, M.
    Sugiura, N.
    Kouketsu, S.
    GEOPHYSICAL RESEARCH LETTERS, 2024, 51 (06)
  • [13] Predicting coronary heart disease in Chinese diabetics using machine learning
    Ma, Cai-Yi
    Luo, Ya-Mei
    Zhang, Tian-Yu
    Hao, Yu-Duo
    Xie, Xue-Qin
    Liu, Xiao-Wei
    Ren, Xiao-Lei
    He, Xiao-Lin
    Han, Yu-Mei
    Deng, Ke-Jun
    Yan, Dan
    Yang, Hui
    Tang, Hua
    Lin, Hao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [14] Smart wearable model for predicting heart disease using machine learning
    Rani, S. V. Jansi
    Chandran, K. R. Sarath
    Ranganathan, Akshaya
    Chandrasekharan, M.
    Janani, B.
    Deepsheka, G.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (9) : 4321 - 4332
  • [15] A Machine Learning Model for Predicting Heart Disease using Ensemble Methods
    Samagh J.S.
    Singh D.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (09) : 558 - 565
  • [16] Machine Learning Reveals Impacts of Smoking on Gene Profiles of Different Cell Types in Lung
    Ma, Qinglan
    Shen, Yulong
    Guo, Wei
    Feng, Kaiyan
    Huang, Tao
    Cai, Yudong
    LIFE-BASEL, 2024, 14 (04):
  • [17] Machine learning for predicting offshore vertical wind profiles
    Rouholahnejad, Farkhondeh
    Santos, Pedro
    Hung, Lin-Ya
    Gottschall, Julia
    EERA DEEPWIND CONFERENCE 2023, 2023, 2626
  • [18] PREDICT: A MACHINE LEARNING ALGORITHM FOR PREDICTING PHARMACOKINETIC PROFILES
    Xiao, J.
    Hayes, S.
    Hu, H.
    Yee, K.
    Patel, B.
    Johnson, M.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2024, 115 : S98 - S98
  • [19] Machine Learning Techniques for Predicting Heart Diseases
    Taha, Mohammed A.
    Alsaidi, Saif Ali Abd Alradha
    Hussein, Reem Ali
    2022 INTERNATIONAL SYMPOSIUM ON INNOVATIVE INFORMATICS OF BISKRA, ISNIB, 2022, : 123 - 128
  • [20] Using Ensemble Machine Learning Methods for Predicting Risk of Readmission for Heart Failure
    Mahajan, Satish M.
    Ghani, Rayid
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 243 - 247