Improved scoring of functional groups from gene expression data by decorrelating GO graph structure

被引:1548
|
作者
Alexa, Adrian [1 ]
Rahnenfuehrer, Joerg [1 ]
Lengauer, Thomas [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1093/bioinformatics/btl140
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance. Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.
引用
收藏
页码:1600 / 1607
页数:8
相关论文
共 50 条
  • [31] A Hierarchical Graph Convolution Network for Representation Learning of Gene Expression Data
    Tan, Kaiwen
    Huang, Weixian
    Liu, Xiaofeng
    Hu, Jinlong
    Dong, Shoubin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (08) : 3219 - 3229
  • [32] Clustering gene expression data with a penalized graph-based metric
    Baya, Ariel E.
    Granitto, Pablo M.
    BMC BIOINFORMATICS, 2011, 12
  • [33] Clustering gene expression data with a penalized graph-based metric
    Ariel E Bayá
    Pablo M Granitto
    BMC Bioinformatics, 12
  • [34] Gene expression data clustering based on graph regularized subspace segmentation
    Chen, Xiaoyun
    Jian, Cairen
    NEUROCOMPUTING, 2014, 143 : 44 - 50
  • [35] A graph-theoretic classification of gene expression microarray data of cancer
    Kim, Saejoon
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 179 - 182
  • [36] Gene regulatory network inference from gene expression data based on knowledge matrix and improved rotation forest
    Emadi, Marzieh
    Boroujeni, Farsad Zamani
    Pirgazi, Jamshid
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [37] Inferring Unknown Biological Function by Integration of GO Annotations and Gene Expression Data
    Leale, Guillermo
    Emilio Baya, Ariel
    Milone, Diego H.
    Granitto, Pablo M.
    Stegmayer, Georgina
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) : 168 - 180
  • [38] Improved tree view for visualising microarray gene expression data
    Prasad T.V.
    Ahson S.I.
    International Journal of Information and Communication Technology, 2010, 2 (04) : 323 - 330
  • [39] Improved KNN Imputation for Missing Values in Gene Expression Data
    Keerin, Phimmarin
    Boongoen, Tossapon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 4009 - 4025
  • [40] An improved FMM neural network for classification of gene expression data
    Juan, Liu
    Fei, Luo
    Yongqiong, Zhu
    FUZZY INFORMATION AND ENGINEERING, PROCEEDINGS, 2007, 40 : 65 - +