Novel implementation of conditional co-regulation by graph theory to derive co-expressed genes from microarray data

被引:15
|
作者
Rawat, Arun [1 ]
Deng, Youping [1 ]
机构
[1] Univ So Mississippi, Hattiesburg, MS 39406 USA
关键词
D O I
10.1186/1471-2105-9-S9-S7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Most existing transcriptional databases like Comprehensive Systems-Biology Database (CSB.DB) and Arabidopsis Microarray Database and Analysis Toolbox (GENEVESTIGATOR) help to seek a shared biological role ( similar pathways and biosynthetic cycles) based on correlation. These utilize conventional methods like Pearson correlation and Spearman rank correlation to calculate correlation among genes. However, not all are genes expressed in all the conditions and this leads to their exclusion in these transcriptional databases that consist of experiments performed in varied conditions. This leads to incomplete studies of co-regulation among groups of genes that might be linked to the same or related biosynthetic pathway. Results: We have implemented an alternate method based on graph theory that takes into consideration the biological assumption - conditional co-regulation is needed to mine a large transcriptional data bank and properties of microarray data. The algorithm calculates relationships among genes by converting discretized signals from the time series microarray data (AtGenExpress) to output strings. A 'score' is generated by using a similarity index against all the other genes by matching stored strings for any gene queried against our database. Taking carbohydrate metabolism as a test case, we observed that those genes known to be involved in similar functions and pathways generate a high 'score' with the queried gene. We were also able to recognize most of the randomly selected correlated pairs from Pearson correlation in CSB. DB and generate a higher number of relationships that might be biologically important. One advantage of our method over previously described approaches is that it includes all genes regardless of its expression values thereby highlighting important relationships absent in other contemporary databases. Conclusion: Based on promising results, we understand that incorporating conditional co-regulation to study large expression data helps us identify novel relationships among genes. The other advantage of our approach is that mining expression data from various experiments, the genes that do not express in all the conditions or have low expression values are not excluded, thereby giving a better overall picture. This results in addressing known limitations of clustering methods in which genes that are expressed in only a subset of conditions are omitted. Based on further scope to extract information, ASIDB implementing above described approach has been initiated as a model database. ASIDB is available at http://www.asidb.com.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] EPIG-Seq: extracting patterns and identifying co-expressed genes from RNA-Seq data
    Jianying Li
    Pierre R. Bushel
    BMC Genomics, 17
  • [22] EPIG-Seq: extracting patterns and identifying co-expressed genes from RNA-Seq data
    Li, Jianying
    Bushel, Pierre R.
    BMC GENOMICS, 2016, 17
  • [23] Assessing co-regulation of directly linked genes in biological networks using microarray time series analysis
    Del Sorbo, Maria Rosaria
    Balzano, Walter
    Donato, Michele
    Draghici, Sorin
    BIOSYSTEMS, 2013, 114 (02) : 149 - 154
  • [24] Identification of novel motif patterns to decipher the promoter architecture of co-expressed genes in Arabidopsis thaliana
    Lopez, Yosvany
    Patil, Ashwini
    Nakai, Kenta
    BMC SYSTEMS BIOLOGY, 2013, 7 : S10
  • [25] Computational analysis of publicly available data identifies co-expressed genes in breast cancer cells
    Thompson, HGR
    Harris, JW
    Khosho, AR
    Martinez, JE
    Brody, JP
    BIOPHYSICAL JOURNAL, 2002, 82 (01) : 473A - 473A
  • [26] IDENTIFICATION OF GENES CONSISTENTLY CO-EXPRESSED IN MULTIPLE MICROARRAY DATASETS BY A GENOME-WIDE BI-COPAM APPROACH
    Abu-Jamous, Basel
    Fa, Rui
    Roberts, David J.
    Nandi, Asoke K.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1172 - 1176
  • [27] Significance analysis and improved discovery of disease-specific Differentially Co-expressed Gene Sets in microarray data
    Li, Haixia
    Karuturi, R. Krishna Murthy
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (06) : 617 - 638
  • [28] The identification of co-expressed gene modules in Streptococcus pneumonia from colonization to infection to predict novel potential virulence genes
    Jamalkandi, Sadegh Azimzadeh
    Kouhsar, Morteza
    Salimian, Jafar
    Ahmadi, Ali
    BMC MICROBIOLOGY, 2020, 20 (01)
  • [29] The identification of co-expressed gene modules in Streptococcus pneumonia from colonization to infection to predict novel potential virulence genes
    Sadegh Azimzadeh Jamalkandi
    Morteza Kouhsar
    Jafar Salimian
    Ali Ahmadi
    BMC Microbiology, 20
  • [30] Large-scale mining co-expressed genes in Arabidopsis anther: From pair to group
    Jiao, Qing-Ju
    Huang, Yan
    Shen, Hong-Bin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (02) : 62 - 68