De novo prediction of three-dimensional structures for major protein families

TitleDe novo prediction of three-dimensional structures for major protein families
Publication TypeJournal Article
Year of Publication2002
AuthorsBonneau, R., Strauss C. E. M., Rohl C. A., Chivian D., Bradley P., Malmström L., Robertson T., & Baker D.
JournalJournal of molecular biology
Volume322
Issue1
Pagination65-78
Date Published2002 Sep 6
ISSN0022-2836
KeywordsCalibration, Computational Biology, Computer Simulation, Databases, Protein, Models, Molecular, Primary Publication, Protein Folding, Protein Structure, Tertiary, Proteins, Sensitivity and Specificity, Sequence Alignment
Abstract

We use the Rosetta de novo structure prediction method to produce three-dimensional structure models for all Pfam-A sequence families with average length under 150 residues and no link to any protein of known structure. To estimate the reliability of the predictions, the method was calibrated on 131 proteins of known structure. For approximately 60% of the proteins one of the top five models was correctly predicted for 50 or more residues, and for approximately 35%, the correct SCOP superfamily was identified in a structure-based search of the Protein Data Bank using one of the models. This performance is consistent with results from the fourth critical assessment of structure prediction (CASP4). Correct and incorrect predictions could be partially distinguished using a confidence function based on a combination of simulation convergence, protein length and the similarity of a given structure prediction to known protein structures. While the limited accuracy and reliability of the method precludes definitive conclusions, the Pfam models provide the only tertiary structure information available for the 12% of publicly available sequences represented by these large protein families.

Alternate JournalJ. Mol. Biol.
AttachmentSize
bonneau02B.pdf855.42 KB