In silico pattern-based analysis of the human cytomegalovirus genome

被引:51
|
作者
Rigoutsos, I
Novotny, J
Huynh, T
Chin-Bow, ST
Parida, L
Platt, D
Coleman, D
Shenk, T
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Bioinformat & Pattern Discovery Grp, Yorktown Hts, NY 10598 USA
[2] Victor Chang Cardiac Res Inst, Darlinghurst, NSW 2010, Australia
[3] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
关键词
D O I
10.1128/JVI.77.7.4326-4344.2003
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
引用
收藏
页码:4326 / 4344
页数:19
相关论文
共 50 条
  • [1] In silico structural and functional analysis of the human cytomegalovirus (HHV5) genome
    Novotny, J
    Rigoutsos, I
    Coleman, D
    Shenk, T
    JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (05) : 1151 - 1166
  • [2] Pattern-based clustering and attribute analysis
    Gabriela Alexe
    Sorin Alexe
    Peter L. Hammer
    Soft Computing, 2006, 10 : 442 - 452
  • [3] Pattern-based clustering and attribute analysis
    Alexe, G
    Alexe, S
    Hammer, PL
    SOFT COMPUTING, 2006, 10 (05) : 442 - 452
  • [4] Efficient analysis of pattern-based constraint specifications
    Wahler, Michael
    Basin, David
    Brucker, Achim D.
    Koehler, Jana
    SOFTWARE AND SYSTEMS MODELING, 2010, 9 (02): : 225 - 255
  • [5] Efficient analysis of pattern-based constraint specifications
    Michael Wahler
    David Basin
    Achim D. Brucker
    Jana Koehler
    Software & Systems Modeling, 2010, 9 : 225 - 255
  • [6] Pattern-Based Analysis of Time Series: Estimation
    Sabeti, Elyas
    Song, Peter X. K.
    Hero, Alfred O.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 1236 - 1241
  • [7] A behavioral analysis approach to pattern-based composition
    Dong, J
    Alencar, PSC
    Cowan, DD
    OOIS 2001: 7TH INTERNATIONAL CONFERENCE ON OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2001, : 540 - 549
  • [8] Pattern-Based and Visual Analytics for Visitor Analysis on Websites
    Cervantes, Barbara
    Gomez, Fernando
    Monroy, Raul
    Loyola-Gonzalez, Octavio
    Angel Medina-Perez, Miguel
    Ramirez-Marquez, Jose
    APPLIED SCIENCES-BASEL, 2019, 9 (18):
  • [9] PASER: A Pattern-Based Approach to Service Requirements Analysis
    Wang, Ye
    Wang, Ting
    Sun, Jie
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (04) : 547 - 576
  • [10] BicPAM: Pattern-based biclustering for biomedical data analysis
    Rui Henriques
    Sara C Madeira
    Algorithms for Molecular Biology, 9