In silico pattern-based analysis of the human cytomegalovirus genome

被引:51
|
作者
Rigoutsos, I
Novotny, J
Huynh, T
Chin-Bow, ST
Parida, L
Platt, D
Coleman, D
Shenk, T
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Bioinformat & Pattern Discovery Grp, Yorktown Hts, NY 10598 USA
[2] Victor Chang Cardiac Res Inst, Darlinghurst, NSW 2010, Australia
[3] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
关键词
D O I
10.1128/JVI.77.7.4326-4344.2003
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
引用
收藏
页码:4326 / 4344
页数:19
相关论文
共 50 条
  • [31] Differential gene expression analysis and its pattern In the human genome: an in-silico study on ischemic stroke
    Singh, H. N.
    CEREBROVASCULAR DISEASES, 2016, 41 : 260 - 260
  • [32] The human cytomegalovirus genome revisited: comparison with the chimpanzee cytomegalovirus genome
    Davison, AJ
    Dolan, A
    Akter, P
    Addison, C
    Dargan, DJ
    Alcendor, DJ
    McGeoch, DJ
    Hayward, GS
    JOURNAL OF GENERAL VIROLOGY, 2003, 84 : 17 - 28
  • [33] Dimensional Reduction of Pattern-Based Simulation Using Wavelet Analysis
    Snehamoy Chatterjee
    Roussos Dimitrakopoulos
    Hussein Mustapha
    Mathematical Geosciences, 2012, 44 : 343 - 374
  • [34] Analysis of artificial neural networks for pattern-based adaptive control
    Sbarbaro, Daniel
    Johansen, Tor A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (05): : 1184 - 1193
  • [35] A pattern-based topic detection and analysis system on Chinese tweets
    Zhang, Lu
    Wu, Zhiang
    Bu, Zhan
    Jiang, Ye
    Cao, Jie
    JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 28 : 369 - 381
  • [36] OLAP Patterns: A pattern-based approach to multidimensional data analysis
    Kovacic, Ilko
    Schuetz, Christoph G.
    Neumayr, Bernd
    Schrefl, Michael
    DATA & KNOWLEDGE ENGINEERING, 2022, 138
  • [37] Pattern-Based Biclustering with Constraints for Gene Expression Data Analysis
    Henriques, Rui
    Madeira, Sara C.
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 326 - 339
  • [38] A behavioral analysis and verification approach to pattern-based design composition
    Jing Dong
    Paulo S.C. Alencar
    Donald D. Cowan
    Software and Systems Modeling, 2004, 3 (4): : 262 - 272
  • [39] Dimensional Reduction of Pattern-Based Simulation Using Wavelet Analysis
    Chatterjee, Snehamoy
    Dimitrakopoulos, Roussos
    Mustapha, Hussein
    MATHEMATICAL GEOSCIENCES, 2012, 44 (03) : 343 - 374
  • [40] BicPAMS: software for biological data analysis with pattern-based biclustering
    Rui Henriques
    Francisco L. Ferreira
    Sara C. Madeira
    BMC Bioinformatics, 18