Research Article
Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks
Article first published online: 18 SEP 2000
DOI: 10.1002/1097-0134(20001115)41:3<271::AID-PROT10>3.0.CO;2-Z
Copyright © 2000 Wiley-Liss, Inc.
Issue
1097-0134/asset/cover.gif?v=1&s=d817e79b67ba6cacf8bdcce1a819c04de300a7e3)
Proteins: Structure, Function, and Bioinformatics
Volume 41, Issue 3, pages 271–287, 15 November 2000
Additional Information
How to Cite
de Brevern, A.G., Etchebest, C. and Hazout, S. (2000), Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Proteins, 41: 271–287. doi: 10.1002/1097-0134(20001115)41:3<271::AID-PROT10>3.0.CO;2-Z
Publication History
- Issue published online: 18 SEP 2000
- Article first published online: 18 SEP 2000
- Manuscript Accepted: 12 JUN 2000
- Manuscript Received: 6 JAN 2000
- Abstract
- Article
- References
- Cited By
Keywords:
- protein backbone structure;
- unsupervised classifier;
- structure-sequence relationships;
- structure prediction;
- protein block;
- Bayesian approach;
- prediction strategies
Abstract
By using an unsupervised cluster analyzer, we have identified a local structural alphabet composed of 16 folding patterns of five consecutive Cα (“protein blocks”). The dependence that exists between successive blocks is explicitly taken into account. A Bayesian approach based on the relation protein block-amino acid propensity is used for prediction and leads to a success rate close to 35%. Sharing sequence windows associated with certain blocks into “sequence families” improves the prediction accuracy by 6%. This prediction accuracy exceeds 75% when keeping the first four predicted protein blocks at each site of the protein. In addition, two different strategies are proposed: the first one defines the number of protein blocks in each site needed for respecting a user-fixed prediction accuracy, and alternatively, the second one defines the different protein sites to be predicted with a user-fixed number of blocks and a chosen accuracy. This last strategy applied to the ubiquitin conjugating enzyme (α/β protein) shows that 91% of the sites may be predicted with a prediction accuracy larger than 77% considering only three blocks per site. The prediction strategies proposed improve our knowledge about sequence-structure dependence and should be very useful in ab initio protein modelling. Proteins 2000;41:271–287. © 2000 Wiley-Liss, Inc.

1097-0134/asset/PROT_centre.gif?v=1&s=77b56b1f2cdaba74cb3bb149bd9b029cd8803cdb)