WGCNA and Machine Learning-Based Integrative Bioinformatics Analysis for Identifying Key Genes of Colorectal Cancer

被引:1
|
作者
Al Mehedi Hasan, Md. [1 ]
Maniruzzaman, Md. [2 ,3 ]
Shin, Jungpil [3 ]
机构
[1] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[2] Khulna Univ, Stat Discipline, Khulna 9208, Bangladesh
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, 9658580, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Bioinformatics; Biomarkers; Proteins; Support vector machines; Object recognition; Network analyzers; Gene expression; Databases; Correlation; Colorectal cancer; Machine learning; WGCNA; machine learning-based models; differentially expressed discriminative genes; bioinformatics analysis; key genes; CARCINOMA; PROGNOSIS; ONTOLOGY;
D O I
10.1109/ACCESS.2024.3472688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal cancer (CC) is a significant public health concern and make it necessary to identify reliable biomarkers and elucidate their molecular and biological mechanisms. This study proposed a system by integrating weighted gene co-expression network analysis (WGCNA) and machine learning-based integrative bioinformatics (ML-IB) analysis to identify key genes for CC. WGCNA was implemented to find a co-expression network of genes and identify important genes by intersecting gene sets obtained using module membership and gene significance criteria across datasets. WGCNA-based significant genes were determined by intersecting important genes between two datasets. ML-IB based approach primarily identified differentially expressed genes (DEGs), then employed support vector machine to determine differentially expressed discriminative genes (DEDGs) and took their common DEDGs across datasets. Protein-protein interaction networks were built and identified hub genes based on the degrees of connectivity and hub module genes using MCODE scores. The ML-IB based significant genes were determined by intersecting hub genes and hub module genes. Four common significant genes were found by intersecting significant genes derived from WGCNA and ML-IB based perspectives. Finally, two genes (AURKA and CCNA2) were determined as key genes for showing strong correlation with survival of CC patients and validated their discriminative capability on an independent test dataset using AUC analysis. The key genes of AURKA and CCNA2 may be used for the early detection of patients with CC. This study will helpful for physicians and doctors to determine and understand the associated the molecular mechanisms and pathway of patients with CC.
引用
收藏
页码:144350 / 144363
页数:14
相关论文
共 50 条
  • [31] Identification of key pathways and genes in mutant KRAS colorectal cancer by integrated bioinformatics analysis
    Zhang, Haiyan
    Zhang, Xiaoming
    Chen, Xiaodong
    Zhang, Wei
    Xian, Jiang
    Zhou, Xia
    Yang, Jun
    Wang, Jie
    ACTA BIOCHIMICA ET BIOPHYSICA SINICA, 2018, 50 (06) : 615 - 617
  • [32] Bioinformatics Analysis Identifying Key Biomarkers in Bladder Cancer
    Zhang, Chuan
    Berndt-Paetz, Mandy
    Neuhaus, Jochen
    DATA, 2020, 5 (02)
  • [33] Identification of key genes associated with cervical cancer based on bioinformatics analysis
    Yang, Xinmeng
    Zhou, Mengsi
    Luan, Yingying
    Li, Kanghua
    Wang, Yafen
    Yang, Xiaofeng
    BMC CANCER, 2024, 24 (01)
  • [34] Identifying feature genes of chickens with different feather pecking tendencies based on three machine learning algorithms and WGCNA
    Wen, Jiying
    Yang, Shenglin
    Zhu, Jinjin
    Liu, Ai
    Tan, Qisong
    Rao, Yifu
    FRONTIERS IN VETERINARY SCIENCE, 2024, 11
  • [35] Machine learning-based analysis of programmed cell death types and key genes in intervertebral disc degeneration
    Lv, Yigang
    Du, Jiawei
    Xiong, Haoning
    Feng, Lei
    Zhang, Di
    Zhou, Hengxing
    Feng, Shiqing
    APOPTOSIS, 2025, 30 (1-2) : 250 - 266
  • [36] Identification of Key Genes and Key Pathways in Breast Cancer Based on Machine Learning
    Bao, Shurui
    He, Guijin
    MEDICAL SCIENCE MONITOR, 2022, 28
  • [37] Identification of key genes and their correlation with immune infiltration in osteoarthritis using integrative bioinformatics approaches and machine-learning strategies
    Xia, Duo
    Wang, Jing
    Yang, Shu
    Jiang, Cancai
    Yao, Jun
    MEDICINE, 2023, 102 (46) : E35355
  • [38] Machine learning-based classifiers to predict metastasis in colorectal cancer patients
    Talebi, Raheleh
    Celis-Morales, Carlos A.
    Akbari, Abolfazl
    Talebi, Atefeh
    Borumandnia, Nasrin
    Pourhoseingholi, Mohamad Amin
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [39] Bioinformatics and machine learning driven key genes screening for hepatocellular carcinoma
    Shen, Ye
    Huang, Juanjie
    Jia, Lei
    Zhang, Chi
    Xu, Jianxing
    BIOCHEMISTRY AND BIOPHYSICS REPORTS, 2024, 37
  • [40] An Integrative Bioinformatics Analysis of Microarray Data for Identifying Differentially Expressed Genes in Preeclampsia
    Song, L. M.
    Long, M.
    Song, S. J.
    Wang, J. R.
    Zhao, G. W.
    Zhao, N.
    RUSSIAN JOURNAL OF GENETICS, 2022, 58 (07) : 866 - 875