Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method

被引:44
|
作者
Ding, Shijian [1 ]
Wang, Deling [2 ]
Zhou, Xianchao [3 ]
Chen, Lei [4 ]
Feng, Kaiyan [5 ]
Xu, Xianling [6 ]
Huang, Tao [7 ,8 ]
Li, Zhandong [9 ]
Cai, Yudong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[2] Sun Yat Sen Univ, Collaborat Innovat Ctr Canc Med, Dept Med Imaging, State Key Lab Oncol South China,Canc Ctr, Guangzhou 510060, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Publ Hlth, Sch Med, Ctr Single Cell Omics, Shanghai 200025, Peoples R China
[4] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[6] Guangdong AIB Polytech Coll, Guangzhou 510507, Peoples R China
[7] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[8] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
[9] Jilin Engn Normal Univ, Coll Food Engn, Changchun 130052, Peoples R China
来源
LIFE-BASEL | 2022年 / 12卷 / 02期
基金
国家重点研发计划;
关键词
heart cell; single-cell profiles; machine learning method; biomarker; decision rule; NATRIURETIC PEPTIDES; MAMMALIAN HEART; ATRIAL; MUTATIONS; PROTEINS; GENE; TTN;
D O I
10.3390/life12020228
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The heart is an essential organ in the human body. It contains various types of cells, such as cardiomyocytes, mesothelial cells, endothelial cells, and fibroblasts. The interactions between these cells determine the vital functions of the heart. Therefore, identifying the different cell types and revealing the expression rules in these cell types are crucial. In this study, multiple machine learning methods were used to analyze the heart single-cell profiles with 11 different heart cell types. The single-cell profiles were first analyzed via light gradient boosting machine method to evaluate the importance of gene features on the profiling dataset, and a ranking feature list was produced. This feature list was then brought into the incremental feature selection method to identify the best features and build the optimal classifiers. The results suggested that the best decision tree (DT) and random forest classification models achieved the highest weighted F1 scores of 0.957 and 0.981, respectively. The selected features, such as NPPA, LAMA2, DLC1, and the classification rules extracted from the optimal DT classifier played a crucial role in cardiac structure and function in recent research and enrichment analysis. In particular, some lncRNAs (LINC02019, NEAT1) were found to be quite important for the recognition of different cardiac cell types. In summary, these findings provide a solid academic foundation for the development of molecular diagnostics and biomarker discovery for cardiac diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Predicting Heart Diseases Using Machine Learning and Different Data Classification Techniques
    El-Sofany, Hosam F.
    IEEE ACCESS, 2024, 12 : 106146 - 106160
  • [22] Effectively Predicting the Presence of Coronary Heart Disease Using Machine Learning Classifiers
    ul Hassan, Ch Anwar
    Iqbal, Jawaid
    Irfan, Rizwana
    Hussain, Saddam
    Algarni, Abeer D.
    Bukhari, Syed Sabir Hussain
    Alturki, Nazik
    Ullah, Syed Sajid
    SENSORS, 2022, 22 (19)
  • [23] Predicting the Classification of Heart Failure Patients Using Optimized Machine Learning Algorithms
    Ahmed, Marzia
    Sulaiman, Mohd Herwan
    Hassan, Md Maruf
    Bhuiyan, Touhid
    IEEE ACCESS, 2025, 13 : 30555 - 30569
  • [24] Predicting gross domestic product using the ensemble machine learning method
    Adewale, M. D.
    Ebem, D. U.
    Awodele, O.
    Sambo-Magaji, A.
    Aggrey, E. M.
    Okechalu, E. A.
    Donatus, R. E.
    Olayanju, K. A.
    Owolabi, A. F.
    Oju, J. U.
    Ubadike, O. C.
    Otu, G. A.
    Muhammed, U. I.
    Danjuma, O. R.
    Oluyide, O. P.
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [25] Predicting Supply Chain Upstreamness Using An Ensemble Machine Learning Method
    Zhao, Siying
    Ding, Fengshi
    Wang, Zi'ang
    Huang, Jin
    2024 12TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING, ICTLE 2024, 2024, : 28 - 32
  • [26] Predicting cell types with supervised contrastive learning on cells and their types
    Yusri Dwi Heryanto
    Yao-zhong Zhang
    Seiya Imoto
    Scientific Reports, 14
  • [27] Predicting cell types with supervised contrastive learning on cells and their types
    Heryanto, Yusri Dwi
    Zhang, Yao-zhong
    Imoto, Seiya
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [28] Machine learning methods for predicting major types of rheumatic heart diseases in children of Southern Punjab, Pakistan
    Shahid, Sana
    Khurram, Haris
    Billah, Baki
    Akbar, Atif
    Shehzad, Muhammad Ahmed
    Shabbir, Muhammad Farhan
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [29] Predicting the Availability of Hematopoietic Stem Cell Donors Using Machine Learning
    Li, Ying
    Masiliune, Ausra
    Winstone, David
    Gasieniec, Leszek
    Wong, Prudence
    Lin, Hong
    Pawson, Rachel
    Parkes, Guy
    Hadley, Andrew
    BIOLOGY OF BLOOD AND MARROW TRANSPLANTATION, 2020, 26 (08) : 1406 - 1413
  • [30] Predicting Gene Mutations in Renal Cell Carcinoma Using Machine Learning
    Staub, D.
    Hannan, R.
    Thomas, K.
    Jiang, S.
    Pedrosa, I.
    Kapur, P.
    Brugarolas, J.
    Wang, J.
    MEDICAL PHYSICS, 2015, 42 (06) : 3586 - 3587