Novel implementation of conditional co-regulation by graph theory to derive co-expressed genes from microarray data

被引:15
|
作者
Rawat, Arun [1 ]
Deng, Youping [1 ]
机构
[1] Univ So Mississippi, Hattiesburg, MS 39406 USA
关键词
D O I
10.1186/1471-2105-9-S9-S7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Most existing transcriptional databases like Comprehensive Systems-Biology Database (CSB.DB) and Arabidopsis Microarray Database and Analysis Toolbox (GENEVESTIGATOR) help to seek a shared biological role ( similar pathways and biosynthetic cycles) based on correlation. These utilize conventional methods like Pearson correlation and Spearman rank correlation to calculate correlation among genes. However, not all are genes expressed in all the conditions and this leads to their exclusion in these transcriptional databases that consist of experiments performed in varied conditions. This leads to incomplete studies of co-regulation among groups of genes that might be linked to the same or related biosynthetic pathway. Results: We have implemented an alternate method based on graph theory that takes into consideration the biological assumption - conditional co-regulation is needed to mine a large transcriptional data bank and properties of microarray data. The algorithm calculates relationships among genes by converting discretized signals from the time series microarray data (AtGenExpress) to output strings. A 'score' is generated by using a similarity index against all the other genes by matching stored strings for any gene queried against our database. Taking carbohydrate metabolism as a test case, we observed that those genes known to be involved in similar functions and pathways generate a high 'score' with the queried gene. We were also able to recognize most of the randomly selected correlated pairs from Pearson correlation in CSB. DB and generate a higher number of relationships that might be biologically important. One advantage of our method over previously described approaches is that it includes all genes regardless of its expression values thereby highlighting important relationships absent in other contemporary databases. Conclusion: Based on promising results, we understand that incorporating conditional co-regulation to study large expression data helps us identify novel relationships among genes. The other advantage of our approach is that mining expression data from various experiments, the genes that do not express in all the conditions or have low expression values are not excluded, thereby giving a better overall picture. This results in addressing known limitations of clustering methods in which genes that are expressed in only a subset of conditions are omitted. Based on further scope to extract information, ASIDB implementing above described approach has been initiated as a model database. ASIDB is available at http://www.asidb.com.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes
    Zambelli, Federico
    Pesole, Graziano
    Pavesi, Giulio
    NUCLEIC ACIDS RESEARCH, 2009, 37 : W247 - W252
  • [42] A novel Mixture Model Method for identification of differentially expressed genes from DNA microarray data
    Najarian, K
    Zaheri, M
    Rad, AA
    Najarian, S
    Dargahi, J
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [43] A novel Mixture Model Method for identification of differentially expressed genes from DNA microarray data
    Kayvan Najarian
    Maryam Zaheri
    Ali A Rad
    Siamak Najarian
    Javad Dargahi
    BMC Bioinformatics, 5
  • [44] Venn Mapping: clustering of heterologous microarray data based on the number of co-occurring differentially expressed genes
    Smid, M
    Dorssers, LCJ
    Jenster, G
    BIOINFORMATICS, 2003, 19 (16) : 2065 - 2071
  • [45] Microarray-based identification and analysis of co-expressed genes in azoxymethane-treated colonic mucosa and AOM-induced colonic carcinomas.
    Joseph, LJ
    Brasitus, TA
    Khare, S
    Wali, RR
    GASTROENTEROLOGY, 2000, 118 (04) : A276 - A277
  • [46] Integration of Known Transcription Factor Binding Site Information and Gene Expression Data to Advance from Co-Expression to Co-Regulation
    Maarten Clements
    Eugene P. van Someren
    Theo A. Knijnenburg
    Marcel J.T. Reinders
    Genomics Proteomics & Bioinformatics, 2007, (02) : 86 - 101
  • [47] Co-expressed functional module-related genes in ovarian cancer stem cells represent novel prognostic biomarkers in ovarian cancer
    Gov, Esra
    SYSTEMS BIOLOGY IN REPRODUCTIVE MEDICINE, 2020, 66 (04) : 255 - 266
  • [48] Meta-analysis of brain tumor microarray data using Oncomine identifies NRF1, Tfam and Myc co-expressed genes: its implications in the development of childhood brain tumors
    Kunkle, B.
    Felty, Q.
    Narasimhan, G.
    Trevino, F.
    Roy, D.
    18TH WORLD IMACS CONGRESS AND MODSIM09 INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: INTERFACING MODELLING AND SIMULATION WITH MATHEMATICAL AND COMPUTATIONAL SCIENCES, 2009, : 720 - 726
  • [49] Identifying the Salient Genes in Microarray Data: A Novel Game Theoretic Model for the Co-Expression Network
    Bora, Papori Neog
    Baruah, Vishwa Jyoti
    Borkotokey, Surajit
    Gogoi, Loyimee
    Mahanta, Priyakshi
    Sarmah, Ankumon
    Kumar, Rajnish
    Moretti, Stefano
    DIAGNOSTICS, 2020, 10 (08)
  • [50] Association Network Modeling from Microarray Data around fermentation stress response gene NSFI (YPL230W) using Significantly Co-expressed Gene Set
    Bessonov, Kyrylo
    Chiu, David K. Y.
    van der Merwe, George
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 35 - 40