Towards future directions in data-integrative supervised prediction of human aging-related genes

被引:1
|
作者
Li, Qi [1 ,2 ]
Newaz, Khalique [1 ,2 ,3 ]
Milenkovic, Tijana [1 ,2 ]
机构
[1] Univ Notre Dame, Lucy Family Inst Data & Soc, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Eck Inst Global Hlth EIGH, Notre Dame, IN 46556 USA
[3] Univ Hamburg, Inst Computat Syst Biol, Ctr Data & Comp Nat Sci CDCS, D-20146 Hamburg, Germany
来源
BIOINFORMATICS ADVANCES | 2022年 / 2卷 / 01期
基金
美国国家科学基金会;
关键词
NETWORKS;
D O I
10.1093/bioadv/vbac081
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation Identification of human genes involved in the aging process is critical due to the incidence of many diseases with age. A state-of-the-art approach for this purpose infers a weighted dynamic aging-specific subnetwork by mapping gene expression (GE) levels at different ages onto the protein-protein interaction network (PPIN). Then, it analyzes this subnetwork in a supervised manner by training a predictive model to learn how network topologies of known aging- versus non-aging-related genes change across ages. Finally, it uses the trained model to predict novel aging-related gene candidates. However, the best current subnetwork resulting from this approach still yields suboptimal prediction accuracy. This could be because it was inferred using outdated GE and PPIN data. Here, we evaluate whether analyzing a weighted dynamic aging-specific subnetwork inferred from newer GE and PPIN data improves prediction accuracy upon analyzing the best current subnetwork inferred from outdated data.Results Unexpectedly, we find that not to be the case. To understand this, we perform aging-related pathway and Gene Ontology term enrichment analyses. We find that the suboptimal prediction accuracy, regardless of which GE or PPIN data is used, may be caused by the current knowledge about which genes are aging-related being incomplete, or by the current methods for inferring or analyzing an aging-specific subnetwork being unable to capture all of the aging-related knowledge. These findings can potentially guide future directions towards improving supervised prediction of aging-related genes via -omics data integration.Availability and implementation All data and code are available at zenodo, DOI: 10.5281/zenodo.6995045.Supplementary information are available at Bioinformatics Advances online.
引用
收藏
页数:10
相关论文
共 25 条
  • [21] Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach
    Masoud Arabfard
    Mina Ohadi
    Vahid Rezaei Tabar
    Ahmad Delbari
    Kaveh Kavousi
    BMC Genomics, 20
  • [22] Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach
    Arabfard, Masoud
    Ohadi, Mina
    Tabar, Vahid Rezaei
    Delbari, Ahmad
    Kavousi, Kaveh
    BMC GENOMICS, 2019, 20 (01)
  • [23] DNA Macroarray Study of Skin Aging-related Genes Expression Modulation by Antioxidant Plant Extracts on A Replicative Senescence Model of Human Dermal Fibroblasts
    Dudonne, Stephanie
    Coutiere, Philippe
    Woillez, Marion
    Merillon, Jean-Michel
    Vitrac, Xavier
    PHYTOTHERAPY RESEARCH, 2011, 25 (05) : 686 - 693
  • [24] Silencing Bach1 alters aging-related changes in the expression of Nrf2-regulated genes in primary human bronchial epithelial cells
    Zhang, Hongqiao
    Zhou, Lulu
    Davies, Kelvin J. A.
    Forman, Henry Jay
    ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 2019, 672
  • [25] A repository of microbial marker genes related to human health and diseases for host phenotype prediction using microbiome data
    Han, Wontack
    Ye, Yuzhen
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 236 - 247