Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Identification of Immune-Related Genes in Patients with Acute Myocardial Infarction Using Machine Learning Methods
    Zhu, Xu
    Yin, Ting
    Zhang, Ting
    Zhu, Qingqing
    Lu, Xinyi
    Wang, Luyang
    Liao, Shengen
    Yao, Wenming
    Zhou, Yanli
    Zhang, Haifeng
    Li, Xinli
    JOURNAL OF INFLAMMATION RESEARCH, 2022, 15 : 3305 - 3321
  • [2] Identification of new marker genes from plant single-cell RNA-seq data using interpretable machine learning methods
    Yan, Haidong
    Lee, Jiyoung
    Song, Qi
    Li, Qi
    Schiefelbein, John
    Zhao, Bingyu
    Li, Song
    NEW PHYTOLOGIST, 2022, 234 (04) : 1507 - 1520
  • [3] Identification of marker genes in Alzheimer's disease using a machine-learning model
    Madar, Inamul Hasan
    Sultan, Ghazala
    Tayubi, Iftikhar Aslam
    Hasan, Atif Noorul
    Pahi, Bandana
    Rai, Anjali
    Sivanandan, Pravitha Kasu
    Loganathan, Tamizhini
    Begum, Mahamuda
    Rai, Sneha
    BIOINFORMATION, 2021, 17 (02) : 348 - 355
  • [4] Identification of the diagnostic genes and immune cell infiltration characteristics of gastric cancer using bioinformatics analysis and machine learning
    Xie, Rongjun
    Liu, Longfei
    Lu, Xianzhou
    He, Chengjian
    Li, Guoxin
    FRONTIERS IN GENETICS, 2023, 13
  • [5] AN AUTOMATED IMAGE ANALYSIS AND CELL IDENTIFICATION SYSTEM USING MACHINE LEARNING METHODS
    Itano, Keiko
    Ochiai, Koji
    Takahashi, Koichi
    Matsushima, Takahide
    Asahara, Hiroshi
    PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON FLEXIBLE AUTOMATION (ISFA2020), 2020,
  • [6] Machine Learning-Based Identification of Colon Cancer Candidate Diagnostics Genes
    Koppad, Saraswati
    Basava, Annappa
    Nash, Katrina
    Gkoutos, Georgios, V
    Acharjee, Animesh
    BIOLOGY-BASEL, 2022, 11 (03):
  • [7] Identification of key immune genes of osteoporosis based on bioinformatics and machine learning
    Hao, Song
    Mao, Xinqi
    Xu, Weicheng
    Yang, Shiwei
    Cao, Lumin
    Xiao, Wang
    Dong, Liu
    Jun, Hua
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [8] Identification of hub genes and their correlation with immune infiltration in coronary artery disease through bioinformatics and machine learning methods
    Huang, Ke-Ke
    Zheng, Hui-Lei
    Li, Shuo
    Zeng, Zhi-Yu
    JOURNAL OF THORACIC DISEASE, 2022, 14 (07) : 2621 - 2634
  • [9] Identification the immune related marker genes and transcription-factor network in ruptured cerebral aneurysms using bioinformatics analysis and machine-learning strategies
    Zhao, Xiang
    Fu, Jinxing
    Lei, Chao
    Wang, Zhaochen
    Jing, Zhitao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [10] Identification of immune-related endoplasmic reticulum stress genes in sepsis using bioinformatics and machine learning
    Gong, Ting
    Liu, Yongbin
    Tian, Zhiyuan
    Zhang, Min
    Gao, Hejun
    Peng, Zhiyong
    Yin, Shuang
    Cheung, Chi Wai
    Liu, Youtan
    FRONTIERS IN IMMUNOLOGY, 2022, 13