Novel implementation of conditional co-regulation by graph theory to derive co-expressed genes from microarray data

被引:15
|
作者
Rawat, Arun [1 ]
Deng, Youping [1 ]
机构
[1] Univ So Mississippi, Hattiesburg, MS 39406 USA
关键词
D O I
10.1186/1471-2105-9-S9-S7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Most existing transcriptional databases like Comprehensive Systems-Biology Database (CSB.DB) and Arabidopsis Microarray Database and Analysis Toolbox (GENEVESTIGATOR) help to seek a shared biological role ( similar pathways and biosynthetic cycles) based on correlation. These utilize conventional methods like Pearson correlation and Spearman rank correlation to calculate correlation among genes. However, not all are genes expressed in all the conditions and this leads to their exclusion in these transcriptional databases that consist of experiments performed in varied conditions. This leads to incomplete studies of co-regulation among groups of genes that might be linked to the same or related biosynthetic pathway. Results: We have implemented an alternate method based on graph theory that takes into consideration the biological assumption - conditional co-regulation is needed to mine a large transcriptional data bank and properties of microarray data. The algorithm calculates relationships among genes by converting discretized signals from the time series microarray data (AtGenExpress) to output strings. A 'score' is generated by using a similarity index against all the other genes by matching stored strings for any gene queried against our database. Taking carbohydrate metabolism as a test case, we observed that those genes known to be involved in similar functions and pathways generate a high 'score' with the queried gene. We were also able to recognize most of the randomly selected correlated pairs from Pearson correlation in CSB. DB and generate a higher number of relationships that might be biologically important. One advantage of our method over previously described approaches is that it includes all genes regardless of its expression values thereby highlighting important relationships absent in other contemporary databases. Conclusion: Based on promising results, we understand that incorporating conditional co-regulation to study large expression data helps us identify novel relationships among genes. The other advantage of our approach is that mining expression data from various experiments, the genes that do not express in all the conditions or have low expression values are not excluded, thereby giving a better overall picture. This results in addressing known limitations of clustering methods in which genes that are expressed in only a subset of conditions are omitted. Based on further scope to extract information, ASIDB implementing above described approach has been initiated as a model database. ASIDB is available at http://www.asidb.com.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Tight cluster analysis of microarray data reveals distinctive patterns of co-regulation in lacrimal gland gene expression
    Mathers, WD
    Choi, D
    Fang, Y
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2005, 46
  • [32] Genes Co-Expressed with ESR2 Influence Clinical Outcomes in Cancer Patients: TCGA Data Analysis
    Lipowicz, Julia Maria
    Malinska, Agnieszka
    Nowicki, Michal
    Rawluszko-Wieczorek, Agnieszka Anna
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (16)
  • [33] Clust: automatic extraction of optimal co-expressed gene clusters from gene expression data
    Basel Abu-Jamous
    Steven Kelly
    Genome Biology, 19
  • [34] Clust: automatic extraction of optimal co-expressed gene clusters from gene expression data
    Abu-Jamous, Basel
    Kelly, Steven
    GENOME BIOLOGY, 2018, 19
  • [35] Genome-Wide Tissue-Specific Gene Expression, Co-expression and Regulation of Co-expressed Genes in Adult Nematode Ascaris suum
    Rosa, Bruce A.
    Jasmer, Douglas P.
    Mitreva, Makedonka
    PLOS NEGLECTED TROPICAL DISEASES, 2014, 8 (02):
  • [36] CluGene: A Bioinformatics Framework for the Identification of Co-Localized, Co-Expressed and Co-Regulated Genes Aimed at the Investigation of Transcriptional Regulatory Networks from High-Throughput Expression Data
    Dottorini, Tania
    Palladino, Pietro
    Senin, Nicola
    Persampieri, Tania
    Spaccapelo, Roberta
    Crisanti, Andrea
    PLOS ONE, 2013, 8 (06):
  • [37] Mining time-shifting co-regulation patterns from gene expression data
    Yin, Ying
    Zhao, Yuhai
    Zhang, Bin
    Wang, Guoren
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2007, 4505 : 62 - +
  • [38] The Co-regulation Data Harvester: Automating gene annotation starting from a transcriptome database
    Tsypin, Lev M.
    Turkewitz, Aaron P.
    SOFTWAREX, 2017, 6 : 165 - 171
  • [39] Co-expressed immune and metabolic genes in visceral and subcutaneous adipose tissue from severely obese individuals are associated with plasma HDL and glucose levels: a microarray study
    Wolfs, Marcel G. M.
    Rensen, Sander S.
    Dijk, Elinda J. Bruin-Van
    Verdam, Froukje J.
    Greve, Jan-Willem
    Sanjabi, Bahram
    Bruinenberg, Marcel
    Wijmenga, Cisca
    van Haeften, Timon W.
    Buurman, Wim A.
    Franke, Lude
    Hofker, Marten H.
    BMC MEDICAL GENOMICS, 2010, 3
  • [40] Co-expressed immune and metabolic genes in visceral and subcutaneous adipose tissue from severely obese individuals are associated with plasma HDL and glucose levels: a microarray study
    Marcel GM Wolfs
    Sander S Rensen
    Elinda J Bruin-Van Dijk
    Froukje J Verdam
    Jan-Willem Greve
    Bahram Sanjabi
    Marcel Bruinenberg
    Cisca Wijmenga
    Timon W van Haeften
    Wim A Buurman
    Lude Franke
    Marten H Hofker
    BMC Medical Genomics, 3