Improved scoring of functional groups from gene expression data by decorrelating GO graph structure

被引:1548
|
作者
Alexa, Adrian [1 ]
Rahnenfuehrer, Joerg [1 ]
Lengauer, Thomas [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1093/bioinformatics/btl140
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance. Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.
引用
收藏
页码:1600 / 1607
页数:8
相关论文
共 50 条
  • [1] An improved scoring scheme for predicting glycan structures from gene expression data
    Suga, Akitsugu
    Yamanishi, Yoshihiro
    Hashimoto, Kosuke
    Goto, Susumu
    Kanehisa, Minoru
    GENOME INFORMATICS 2007, VOL 18, 2007, 18 : 237 - 246
  • [2] Optimizing gene set annotations combining GO structure and gene expression data
    Wang, Dong
    Li, Jie
    Liu, Rui
    Wang, Yadong
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [3] Comparisons of Graph-structure Clustering Methods for Gene Expression Data
    Zhuo FANG~1
    ~2 Shanghai Center for Bioinformatics Technology
    ~3 Department of EECS
    ~4 W.M.Keck Center for Comparative and Functional Genomics
    Acta Biochimica et Biophysica Sinica, 2006, (06) : 379 - 384
  • [4] Comparisons of graph-structure clustering methods for gene expression data
    Fang, Zhuo
    Liu, Lei
    Yang, Jiong
    Luo, Qing-Ming
    Li, Yi-Xue
    ACTA BIOCHIMICA ET BIOPHYSICA SINICA, 2006, 38 (06) : 379 - 384
  • [5] Discriminating Graph Pattern Mining from Gene Expression Data
    Fassetti, Fabio
    Rombo, Simona E.
    Serrao, Cristina
    APPLIED COMPUTING REVIEW, 2016, 16 (03): : 26 - 36
  • [6] T-profiler: scoring the activity of predefined groups of genes using gene expression data
    Boorsma, A
    Foat, BC
    Vis, D
    Klis, F
    Bussemaker, HJ
    NUCLEIC ACIDS RESEARCH, 2005, 33 : W592 - W595
  • [7] GO-Mapper: functional analysis of gene expression data using the expression level as a score to evaluate Gene Ontology terms
    Smid, M
    Dorssers, LCJ
    BIOINFORMATICS, 2004, 20 (16) : 2618 - 2625
  • [8] Finding groups in gene expression data
    Hand, DJ
    Heard, NA
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 215 - 225
  • [9] Mapping functional transcription factor networks from gene expression data
    Haynes, Brian C.
    Maier, Ezekiel J.
    Kramer, Michael H.
    Wang, Patricia I.
    Brown, Holly
    Brent, Michael R.
    GENOME RESEARCH, 2013, 23 (08) : 1319 - 1328
  • [10] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 : 1141 - 1144