Improved scoring of functional groups from gene expression data by decorrelating GO graph structure

被引:1548
|
作者
Alexa, Adrian [1 ]
Rahnenfuehrer, Joerg [1 ]
Lengauer, Thomas [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1093/bioinformatics/btl140
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance. Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.
引用
收藏
页码:1600 / 1607
页数:8
相关论文
共 50 条
  • [21] INFERRING FUNCTIONAL RELATIONSHIPS AND CAUSAL NETWORK STRUCTURE FROM GENE EXPRESSION PROFILES
    Nagarajan, Radhakrishnan
    Upreti, Meenakshi
    METHODS IN ENZYMOLOGY, VOL 487: COMPUTER METHODS, PT C, 2011, : 133 - 146
  • [22] Biclustering gene expression data by an improved optimal algorithm
    Wang, MingQian
    Tian, Wei
    Kang, Hao
    Gao, WenJu
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2223 - 2226
  • [23] Single-cell gene set scoring with nearest neighbor graph smoothed data (gssnng)
    Gibbs, David L.
    Strasser, Michael K.
    Huang, Sui
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [24] A Dynamic FRET Reporter of Gene Expression Improved by Functional Screening
    Schifferer, Martina
    Griesbeck, Oliver
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2012, 134 (37) : 15185 - 15188
  • [25] Improved Binary Imperialist Competition Algorithm for Feature Selection from Gene Expression Data
    Aorigele
    Wang, Shuaiqun
    Tang, Zheng
    Gao, Shangce
    Todo, Yuki
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 67 - 78
  • [26] Mining gene expression data for functional annotation of genomes
    Deshpande, Nandan
    Pandey, Akhilesh
    Trends in Biotechnology, 2002, 20 (07)
  • [27] A graph convolutional neural network for gene expression data analysis with multiple gene networks
    Yang, Hu
    Zhuang, Zhong
    Pan, Wei
    STATISTICS IN MEDICINE, 2021, 40 (25) : 5547 - 5564
  • [28] Bayesian Joint Analysis of Gene Expression Data and Gene Functional Annotations
    Wang X.
    Chen M.
    Khodursky A.B.
    Xiao G.
    Statistics in Biosciences, 2012, 4 (2) : 300 - 318
  • [29] Comment on 'Hayai-Annotation Plants: an ultrafast and comprehensive functional gene annotation system in plants': the importance of taking the GO graph structure into account
    Van Bel, Michiel
    Vandepoele, Klaas
    BIOINFORMATICS, 2020, 36 (22-23) : 5558 - 5560
  • [30] Classification using functional data analysis for temporal gene expression data
    Leng, XY
    Müller, HG
    BIOINFORMATICS, 2006, 22 (01) : 68 - 76