Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method

被引:44
|
作者
Ding, Shijian [1 ]
Wang, Deling [2 ]
Zhou, Xianchao [3 ]
Chen, Lei [4 ]
Feng, Kaiyan [5 ]
Xu, Xianling [6 ]
Huang, Tao [7 ,8 ]
Li, Zhandong [9 ]
Cai, Yudong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[2] Sun Yat Sen Univ, Collaborat Innovat Ctr Canc Med, Dept Med Imaging, State Key Lab Oncol South China,Canc Ctr, Guangzhou 510060, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Publ Hlth, Sch Med, Ctr Single Cell Omics, Shanghai 200025, Peoples R China
[4] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[6] Guangdong AIB Polytech Coll, Guangzhou 510507, Peoples R China
[7] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[8] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
[9] Jilin Engn Normal Univ, Coll Food Engn, Changchun 130052, Peoples R China
来源
LIFE-BASEL | 2022年 / 12卷 / 02期
基金
国家重点研发计划;
关键词
heart cell; single-cell profiles; machine learning method; biomarker; decision rule; NATRIURETIC PEPTIDES; MAMMALIAN HEART; ATRIAL; MUTATIONS; PROTEINS; GENE; TTN;
D O I
10.3390/life12020228
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The heart is an essential organ in the human body. It contains various types of cells, such as cardiomyocytes, mesothelial cells, endothelial cells, and fibroblasts. The interactions between these cells determine the vital functions of the heart. Therefore, identifying the different cell types and revealing the expression rules in these cell types are crucial. In this study, multiple machine learning methods were used to analyze the heart single-cell profiles with 11 different heart cell types. The single-cell profiles were first analyzed via light gradient boosting machine method to evaluate the importance of gene features on the profiling dataset, and a ranking feature list was produced. This feature list was then brought into the incremental feature selection method to identify the best features and build the optimal classifiers. The results suggested that the best decision tree (DT) and random forest classification models achieved the highest weighted F1 scores of 0.957 and 0.981, respectively. The selected features, such as NPPA, LAMA2, DLC1, and the classification rules extracted from the optimal DT classifier played a crucial role in cardiac structure and function in recent research and enrichment analysis. In particular, some lncRNAs (LINC02019, NEAT1) were found to be quite important for the recognition of different cardiac cell types. In summary, these findings provide a solid academic foundation for the development of molecular diagnostics and biomarker discovery for cardiac diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Towards comparing and using Machine Learning techniques for detecting and predicting Heart Attack and Diseases
    Obasi, Thankgod
    Shafiq, M. Omair
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2393 - 2402
  • [42] A Stacking Method for Predicting Patient QA Passing Rates Using Machine Learning
    Lam, D.
    Dvergsten, T.
    Zhao, T.
    Yang, D.
    Mutic, S.
    Sun, B.
    MEDICAL PHYSICS, 2018, 45 (06) : E669 - E669
  • [43] Predicting electronic stopping powers using stacking ensemble machine learning method
    Akbari, Fatemeh
    Taghizadeh, Somayeh
    Shvydka, Diana
    Sperling, Nicholas Niven
    Parsai, E. Ishmael
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION B-BEAM INTERACTIONS WITH MATERIALS AND ATOMS, 2023, 538 : 8 - 16
  • [44] Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types
    Ma, Qinglan
    Zhang, Yu-Hang
    Guo, Wei
    Feng, Kaiyan
    Huang, Tao
    Cai, Yu-Dong
    LIFE-BASEL, 2024, 14 (08):
  • [45] Predicting single-cell gene expression profiles of imaging flow cytometry data with machine learning
    Chlis, Nikolaos-Kosmas
    Rausch, Lisa
    Brocker, Thomas
    Kranich, Jan
    Theis, Fabian J.
    NUCLEIC ACIDS RESEARCH, 2020, 48 (20) : 11335 - 11346
  • [46] Predicting the Evolution of Cancer Stem Cell Subtypes Using a Machine Learning Framework
    Nikolic, D.
    Ljujic, B.
    Hamzagic, A. Ramovic
    Jankovic, M. Gazdic
    Miric, A.
    Virijevic, K.
    Seklic, D.
    Jovanovic, M.
    Kastratovic, N.
    Petrovic, I.
    Jurisic, V.
    Milivojevic, N.
    Zivanovic, M.
    Filipovic, N.
    APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023, 2024, 999 : 185 - 189
  • [47] Predicting cell phone adoption metrics using machine learning and satellite imagery
    Oughton, Edward J.
    Mathur, Jatin
    TELEMATICS AND INFORMATICS, 2021, 62
  • [48] Classification of Cancerous Profiles using Machine Learning
    Sharma, Aman
    Rani, Rinkle
    2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 31 - 36
  • [49] Classification of Diabetes Types using Machine Learning
    Adigun, Oyeranmi
    Oyeranm, Folasade
    Yekini, Nureni
    Babatunde, Ronke
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 152 - 161
  • [50] Predicting Employee Absence from Historical Absence Profiles with Machine Learning
    Zupancic, Peter
    Panov, Pance
    APPLIED SCIENCES-BASEL, 2024, 14 (16):