Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] A Brief Survey for MicroRNA Precursor Identification Using Machine Learning Methods
    Guan, Zheng-Xing
    Li, Shi-Hao
    Zhang, Zi-Mei
    Zhang, Dan
    Yang, Hui
    Ding, Hui
    CURRENT GENOMICS, 2020, 21 (01) : 11 - 25
  • [32] Identification and validation of immune and cuproptosis-related genes for diabetic nephropathy by WGCNA and machine learning
    Chen, Yubing
    Wu, Zhan
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2024, 39
  • [33] Identification of Immune-Related Risk Genes in Osteoarthritis Based on Bioinformatics Analysis and Machine Learning
    Xu, Jintao
    Chen, Kai
    Yu, Yaohui
    Wang, Yishu
    Zhu, Yi
    Zou, Xiangjie
    Jiang, Yiqiu
    JOURNAL OF PERSONALIZED MEDICINE, 2023, 13 (02):
  • [34] Identification and validation of immune and cuproptosis-related genes for diabetic nephropathy by WGCNA and machine learning
    Chen, Yubing
    Wu, Zhan
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2024, 39 : I2749 - I2749
  • [35] Identification of Immune-Related Biomarkers of Schizophrenia in the Central Nervous System Using Bioinformatic Methods and Machine Learning Algorithms
    Weng, Jianjun
    Zhu, Xiaoli
    Ouyang, Yu
    Liu, Yanqing
    Lu, Hongmei
    Yao, Jiakui
    Pan, Bo
    MOLECULAR NEUROBIOLOGY, 2025, 62 (03) : 3226 - 3243
  • [36] Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types
    Ma, Qinglan
    Zhang, Yu-Hang
    Guo, Wei
    Feng, Kaiyan
    Huang, Tao
    Cai, Yu-Dong
    LIFE-BASEL, 2024, 14 (08):
  • [37] Using machine learning algorithms to identify genes essential for cell survival
    Philips, Santosh
    Wu, Heng-Yi
    Li, Lang
    BMC BIOINFORMATICS, 2017, 18
  • [38] Machine learning methods revealed the roles of immune-metabolism related genes in immune infiltration, stemness, and prognosis of neuroblastoma
    Mu, Jianhua
    Gong, Jianan
    Lin, Peng
    Zhang, Mengzhen
    Wu, Kai
    CANCER BIOMARKERS, 2023, 38 (02) : 241 - 259
  • [39] Using machine learning algorithms to identify genes essential for cell survival
    Santosh Philips
    Heng-Yi Wu
    Lang Li
    BMC Bioinformatics, 18
  • [40] Robust prognostic signature in colon cancer by studying ectopically expressed genes using machine learning
    Valmary-Degano, S.
    Spinelli, A.
    Vitte, A.
    Chuffart, F.
    Laverriere, M.
    Abba, J.
    Rousseaux, S.
    Khochbin, S.
    Bourova-Flin, E.
    VIRCHOWS ARCHIV, 2024, 485 : S161 - S162