Evaluating protein structures determined by structural genomics consortia

被引:578
作者
Bhattacharya, Aneerban
Tejero, Roberto
Montelione, Gaetano T.
机构
[1] Rutgers State Univ, Ctr Adv Biotechnol & Med, NE Struct Genomics Consortium, Piscataway, NJ 08854 USA
[2] Robert Wood Johnson Med Sch, Piscataway, NJ 08854 USA
[3] Univ Valencia, Dept Quim Fis, Valencia, Spain
关键词
protein structure quality; ProsaII; PRO-CHECK; MolProbity; Verify3D; protein NMR structures; X-ray crystal structures;
D O I
10.1002/prot.21165
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural genomics projects are providing large quantities of new 3D structural data for proteins. To monitor the quality of these data, we have developed the protein structure validation so ware suite (PSVS), for assessment of protein structures generated by NMR or X-ray crystallographic methods. PSVS is broadly applicable for structure quality assessment in structural biology projects. The software integrates under a single interface analyses from several widely-used structure quality evaluation tools, including PROCHECK (Laskowski et al., J AppI Crystallog 1993;26:283-291), MolProbity (Lovell et al., Proteins 2003;50:437-450), Verify3D (Luthy et al., Nature 1992;356:83-85), ProsaII (Sippl, Proteins 1993;17: 355-362), the PDB validation software, and various structure-validation tools developed in our own laboratory. PSVS provides standard constraint analyses, statistics on goodness-of-fit between structures and experimental data, and knowledge-based structure quality scores in standardized format suitable for database integration. The analysis provides both global and site-specific measures of protein structure quality. Global quality measures are reported as Z scores, based on calibration with a set of high-resolution X-ray crystal structures. PSVS is particularly useful in assessing protein structures determined by NMR methods, but is also valuable for assessing X-ray crystal structures or homology models. Using these tools, we assessed protein structures generated by the Northeast Structural Genomics Consortium and other international structural genomics projects, over a 5-year period. Protein structures produced from structural genomics projects exhibit quality score distributions similar to those of structures produced in traditional structural biology projects during the same time period. However, while some NMR structures have structure quality scores similar to those seen in higher-resolution X-ray crystal structures, the majority of NMR structures have lower scores. Potential reasons for this "structure quality score gap" between NMR and X-ray crystal structures are discussed.
引用
收藏
页码:778 / 795
页数:18
相关论文
共 55 条
[31]   NMR data collection and analysis protocol for high-throughput protein structure determination [J].
Liu, GH ;
Shen, Y ;
Atreya, HS ;
Parish, D ;
Shao, Y ;
Sukumaran, DK ;
Xiao, R ;
Yee, A ;
Lemak, A ;
Bhattacharya, A ;
Acton, TA ;
Arrowsmith, CH ;
Montelione, GT ;
Szyperski, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (30) :10487-10492
[32]   Structure validation by Cα geometry:: φ,ψ and Cβ deviation [J].
Lovell, SC ;
Davis, IW ;
Adrendall, WB ;
de Bakker, PIW ;
Word, JM ;
Prisant, MG ;
Richardson, JS ;
Richardson, DC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 50 (03) :437-450
[33]   ASSESSMENT OF PROTEIN MODELS WITH 3-DIMENSIONAL PROFILES [J].
LUTHY, R ;
BOWIE, JU ;
EISENBERG, D .
NATURE, 1992, 356 (6364) :83-85
[34]   Structural genomics: keystone for a Human Proteome Project [J].
Montelione, GT ;
Anderson, S .
NATURE STRUCTURAL BIOLOGY, 1999, 6 (01) :11-12
[35]   STEREOCHEMICAL QUALITY OF PROTEIN-STRUCTURE COORDINATES [J].
MORRIS, AL ;
MACARTHUR, MW ;
HUTCHINSON, EG ;
THORNTON, JM .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1992, 12 (04) :345-364
[36]   Traditional biomolecular structure determination by NMR spectroscopy allows for major errors [J].
Nabuurs, Sander B. ;
Spronk, Chris A. E. M. ;
Vuister, Geerten W. ;
Vriend, Gert .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (02) :71-79
[37]   RECOORD:: A recalculated coordinate database of 500+proteins from the PDB using restraints from the BioMagResBank [J].
Nederveen, AJ ;
Doreleijers, JF ;
Vranken, W ;
Miller, Z ;
Spronk, CAEM ;
Nabuurs, SB ;
Güntert, P ;
Livny, M ;
Markley, JL ;
Nilges, M ;
Ulrich, EL ;
Kaptein, R ;
Bonvin, AMJJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (04) :662-672
[38]   CALCULATION OF PROTEIN STRUCTURES WITH AMBIGUOUS DISTANCE RESTRAINTS - AUTOMATED ASSIGNMENT OF AMBIGUOUS NOE CROSSPEAKS AND DISULFIDE CONNECTIVITIES [J].
NILGES, M .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 245 (05) :645-660
[39]   Automated NOESY interpretation with ambiguous distance restraints: The refined NMR solution structure of the pleckstrin homology domain from beta-spectrin [J].
Nilges, M ;
Macias, MJ ;
ODonoghue, SI ;
Oschkinat, H .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 269 (03) :408-422
[40]   Deviations from standard atomic volumes as a quality measure for protein crystal structures [J].
Pontius, J ;
Richelle, J ;
Wodak, SJ .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 264 (01) :121-136