WGCNA and Machine Learning-Based Integrative Bioinformatics Analysis for Identifying Key Genes of Colorectal Cancer

被引:1
|
作者
Al Mehedi Hasan, Md. [1 ]
Maniruzzaman, Md. [2 ,3 ]
Shin, Jungpil [3 ]
机构
[1] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[2] Khulna Univ, Stat Discipline, Khulna 9208, Bangladesh
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, 9658580, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Bioinformatics; Biomarkers; Proteins; Support vector machines; Object recognition; Network analyzers; Gene expression; Databases; Correlation; Colorectal cancer; Machine learning; WGCNA; machine learning-based models; differentially expressed discriminative genes; bioinformatics analysis; key genes; CARCINOMA; PROGNOSIS; ONTOLOGY;
D O I
10.1109/ACCESS.2024.3472688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal cancer (CC) is a significant public health concern and make it necessary to identify reliable biomarkers and elucidate their molecular and biological mechanisms. This study proposed a system by integrating weighted gene co-expression network analysis (WGCNA) and machine learning-based integrative bioinformatics (ML-IB) analysis to identify key genes for CC. WGCNA was implemented to find a co-expression network of genes and identify important genes by intersecting gene sets obtained using module membership and gene significance criteria across datasets. WGCNA-based significant genes were determined by intersecting important genes between two datasets. ML-IB based approach primarily identified differentially expressed genes (DEGs), then employed support vector machine to determine differentially expressed discriminative genes (DEDGs) and took their common DEDGs across datasets. Protein-protein interaction networks were built and identified hub genes based on the degrees of connectivity and hub module genes using MCODE scores. The ML-IB based significant genes were determined by intersecting hub genes and hub module genes. Four common significant genes were found by intersecting significant genes derived from WGCNA and ML-IB based perspectives. Finally, two genes (AURKA and CCNA2) were determined as key genes for showing strong correlation with survival of CC patients and validated their discriminative capability on an independent test dataset using AUC analysis. The key genes of AURKA and CCNA2 may be used for the early detection of patients with CC. This study will helpful for physicians and doctors to determine and understand the associated the molecular mechanisms and pathway of patients with CC.
引用
收藏
页码:144350 / 144363
页数:14
相关论文
共 50 条
  • [41] An Integrative Bioinformatics Analysis of Microarray Data for Identifying Differentially Expressed Genes in Preeclampsia
    L. M. Song
    M. Long
    S. J. Song
    J. R. Wang
    G. W. Zhao
    N. Zhao
    Russian Journal of Genetics, 2022, 58 : 866 - 875
  • [42] The Prognostic Value of ASPHD1 and ZBTB12 in Colorectal Cancer: A Machine Learning-Based Integrated Bioinformatics Approach
    Asadnia, Alireza
    Nazari, Elham
    Goshayeshi, Ladan
    Zafari, Nima
    Moetamani-Ahmadi, Mehrdad
    Goshayeshi, Lena
    Azari, Haneih
    Pourali, Ghazaleh
    Khalili-Tanha, Ghazaleh
    Abbaszadegan, Mohammad Reza
    Khojasteh-Leylakoohi, Fatemeh
    Bazyari, Mohammadjavad
    Kahaei, Mir Salar
    Ghorbani, Elnaz
    Khazaei, Majid
    Hassanian, Seyed Mahdi
    Gataa, Ibrahim Saeed
    Kiani, Mohammad Ali
    Peters, Godefridus J.
    Ferns, Gordon A.
    Batra, Jyotsna
    Lam, Alfred King-yin
    Giovannetti, Elisa
    Avan, Amir
    CANCERS, 2023, 15 (17)
  • [43] Machine Learning-Based Identification of Colon Cancer Candidate Diagnostics Genes
    Koppad, Saraswati
    Basava, Annappa
    Nash, Katrina
    Gkoutos, Georgios, V
    Acharjee, Animesh
    BIOLOGY-BASEL, 2022, 11 (03):
  • [44] Identification of Key Genes in Gastric Cancer by Bioinformatics Analysis
    Chong, Xinyu
    Peng, Rui
    Sun, Yan
    Zhang, Luyu
    Zhang, Zheng
    BIOMED RESEARCH INTERNATIONAL, 2020, 2020
  • [45] Bioinformatics Analysis of Key Genes and Pathways of Cervical Cancer
    Chen, Huan
    Wang, Xi
    Jia, Huanhuan
    Tao, Yin
    Zhou, Hong
    Wang, Mingyuan
    Wang, Xin
    Fang, Xiaoling
    ONCOTARGETS AND THERAPY, 2020, 13 : 13275 - 13283
  • [46] Screening and Analysis of Core Genes for Osteoporosis Based on Bioinformatics Analysis and Machine Learning Algorithms
    Lu, Yongxia
    Wang, Wei
    Yang, Baiyuan
    Cao, Gui
    Du, Yue
    Liu, Jingyu
    INDIAN JOURNAL OF ORTHOPAEDICS, 2024, 58 (07) : 944 - 954
  • [47] Potential Pathogenic Genes and Mechanism of Ankylosing Spondylitis: A Study Based on WGCNA and Bioinformatics Analysis
    Wu, Bo
    Yu, Jing
    Liu, Yibing
    Dou, Gaojing
    Hou, Yuanyuan
    Zhang, Zhiyun
    Pan, Xuefeng
    Wang, Hongyu
    Zhou, Pengcheng
    Zhu, Dong
    WORLD NEUROSURGERY, 2022, 158 : E543 - E556
  • [48] Identifying the Role of Oligodendrocyte Genes in the Diagnosis of Alzheimer's Disease through Machine Learning and Bioinformatics Analysis
    Chen, Yan
    Li, Chen
    Yao, Yinhui
    Shang, Yazhen
    CURRENT ALZHEIMER RESEARCH, 2024, 21 (06) : 437 - 455
  • [49] Identifying hub genes and key functional modules in leaf tissue of Populus species based on WGCNA
    Zhang, Huanping
    Yin, Tongming
    GENETICA, 2025, 153 (01)
  • [50] Identifying functional subtypes of IgA nephropathy based on three machine learning algorithms and WGCNA
    Hongbiao Ren
    Wenhua Lv
    Zhenwei Shang
    Liangshuang Li
    Qi Shen
    Shuai Li
    Zerun Song
    Xiangshu Cheng
    Xin Meng
    Rui Chen
    Ruijie Zhang
    BMC Medical Genomics, 17