Explore key genes of Crohn's disease based on glycerophospholipid metabolism: A comprehensive analysis Utilizing Mendelian Randomization, Multi-Omics integration, Machine Learning, and SHAP methodology

被引:1
|
作者
Chen, Changan [1 ]
Quan, Juanhua [1 ]
Chen, Xintian [1 ]
Yang, Tingmei [1 ]
Yu, Caiyuan [1 ]
Ye, Shicai [1 ]
Yang, Yuping [1 ]
Wu, Xiu [1 ]
Jiang, Danxian [2 ]
Weng, Yijie [1 ]
机构
[1] Guangdong Med Univ, Dept Gastroenterol, Affiliated Hosp, Zhanjiang, Guangdong, Peoples R China
[2] Guangdong Med Univ, Dept Med Oncol, Affiliated Hosp, Zhanjiang, Guangdong, Peoples R China
关键词
Multi-Omics; Mendelian Randomization; Machine Learning; SHAP; Crohn's disease; GENOME-WIDE ASSOCIATION; INDUCED COLITIS; PHOSPHATIDYLCHOLINE; PHOSPHOINOSITIDES; EPIDEMIOLOGY; INFLAMMATION; EXPRESSION; PROFILES;
D O I
10.1016/j.intimp.2024.112905
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Background and Aims: Crohn's disease (CD) is a chronic, complex inflammatory condition with increasing incidence and prevalence worldwide. However, the causes of CD remain incompletely understood. We identified CD- related metabolites, inflammatory factors, and key genes by Mendelian randomization (MR), multi-omics integration, machine learning (ML), and SHAP. Methods: We first performed a mediation MR analysis on 1400 serum metabolites, 91 inflammatory factors, and CD. We found that certain phospholipids are causally related to CD. In the scRNA-seq data, monocytes were categorized into high and low metabolism groups based on their glycerophospholipid metabolism scores. The differentially expressed genes of these two groups of cells were extracted, and transcription factor prediction, cell communication analysis, and GSEA analysis were performed. After further screening of differentially expressed genes (FDR<0.05, log2FC>1), least absolute shrinkage and selection operator (LASSO) regression was performed to obtain hub genes. Models for hub genes were built using the Catboost, XGboost, and NGboost methods. Further, we used the SHAP method to interpret the models and obtain the gene with the highest contribution to each model. Finally, qRT-PCR was used to verify the expression of these genes in the peripheral blood mononuclear cells (PBMC) of CD patients and healthy subjects. Result: MR results showed 1-palmitoyl-2-stearoyl-gpc (16:0/18:0) levels, 1-stearoyl-2-arachidonoyl-GPI (18:0/ 20:4) levels, 1-arachidonoyl-gpc (20:4n6) levels, 1-palmitoyl-2-arachidonoyl-gpc (16:0/20:4n6) levels, and 1arachidonoyl-GPE (20:4n6) levels were significantly associated with CD risk reduction (FDR<0.05), with CXCL9 acting as a mediation between these phospholipids and CD. The analysis identified 19 hub genes, with Catboost, XGboost, and NGboost achieving AUC of 0.91, 0.88, and 0.85, respectively. The SHAP methodology obtained the three genes with the highest model contribution: G0S2, S100A8, and PLAUR. The qRT-PCR results showed that the expression levels of S100A8 (p = 0.0003), G0S2 (p < 0.0001), and PLAUR (p = 0.0141) in the PBMC of CD patients were higher than healthy subjects. Conclusion: MR findings suggest that certain phospholipids may lower CD risk. G0S2, S100A8, and PLAUR may be potential pathogenic genes in CD. These phospholipids and genes could serve as novel diagnostic and therapeutic targets for CD.
引用
收藏
页数:13
相关论文
共 26 条
  • [21] Uncovering the epigenetic regulatory clues of PRRT1 in Alzheimer's disease: a strategy integrating multi-omics analysis with explainable machine learning
    Wang, Fang
    Liang, Ying
    Wang, Qin-Wen
    ALZHEIMERS RESEARCH & THERAPY, 2025, 17 (01)
  • [22] Multi-omics and single-cell analysis reveals machine learning-based pyrimidine metabolism-related signature in the prognosis of patients with lung adenocarcinoma
    Hu, Tong
    Shi, Run
    Xu, Yangyue
    Xu, Tingting
    Fang, Yuan
    Gu, Yunru
    Zhou, Zhaokai
    Shu, Yongqian
    INTERNATIONAL JOURNAL OF MEDICAL SCIENCES, 2025, 22 (06): : 1375 - 1392
  • [23] Integration of single-cell sequencing with machine learning and Mendelian randomization analysis identifies the NAP1L1 gene as a predictive biomarker for Alzheimer's disease
    Chen, Runming
    Xie, Yujun
    Chang, Ze
    Hu, Wenyue
    Han, Zhenyun
    FRONTIERS IN AGING NEUROSCIENCE, 2024, 16
  • [24] Prognostic value of anoikis-related genes revealed using multi-omics analysis and machine learning based on lower-grade glioma features and tumor immune microenvironment
    Linazi, Gu
    Maimaiti, Aierpati
    Abulaiti, Zulihuma
    Shi, Hui
    Zhou, Zexin
    Aisa, Mizhati Yimiti
    Kang, Yali
    Abulimiti, Ayguzaili
    Dilimulati, Xierzhati
    Zhang, Tiecheng
    Wusiman, Patiman
    Wang, Zengliang
    Abulaiti, Aimitaji
    HELIYON, 2024, 10 (17)
  • [25] Absence of coding somatic single nucleotide variants within well-known candidate genes in late-onset sporadic Alzheimer's Disease based on the analysis of multi-omics data
    Min, Shishi
    Li, Zongchang
    Shieh, Annie
    Giase, Gina
    Bao, Riyue
    Zhang, Chunling
    Kuney, Liz
    Kopp, Richard
    Asif, Huma
    Alliey-Rodriguez, Ney
    Qin, Lixia
    Craig, David Wesley
    Faulkner, Geoffrey J.
    Gershon, Elliot S.
    Tang, Beisha
    Chen, Chao
    Liu, Chunyu
    NEUROBIOLOGY OF AGING, 2021, 108 : 207 - 209
  • [26] Ustekinumab affects myofibroblast metabolism to alleviate intestinal fibrosis by targeting KDELC1 in Crohn's disease through multi-machine learning combined with single-cell sequencing analysis
    Ma, Su
    Kang, Yongming
    Yang, Zhonglin
    Ji, Xingyu
    Chen, Rui
    Sun, Xiaomei
    FRONTIERS IN MEDICINE, 2024, 11