Research Article
Prediction of protein B-factor profiles
Article first published online: 11 JAN 2005
DOI: 10.1002/prot.20375
Copyright © 2005 Wiley-Liss, Inc.
Issue
1097-0134/asset/cover.gif?v=1&s=d817e79b67ba6cacf8bdcce1a819c04de300a7e3)
Proteins: Structure, Function, and Bioinformatics
Volume 58, Issue 4, pages 905–912, 1 March 2005
Additional Information
How to Cite
Yuan, Z., Bailey, T. L. and Teasdale, R. D. (2005), Prediction of protein B-factor profiles. Proteins, 58: 905–912. doi: 10.1002/prot.20375
Publication History
- Issue published online: 8 FEB 2005
- Article first published online: 11 JAN 2005
- Manuscript Accepted: 27 SEP 2004
- Manuscript Received: 21 JUN 2004
Funded by
- Australian Research Council (ARC)
- DETYA
- National Health and Medical Research Council of Australia R. Douglas Wright Career Development Award
Keywords:
- B-factor;
- flexibility;
- support vector regression;
- evolutionary information;
- ROC analysis;
- protein sequence
Abstract
The polypeptide backbones and side chains of proteins are constantly moving due to thermal motion and the kinetic energy of the atoms. The B-factors of protein crystal structures reflect the fluctuation of atoms about their average positions and provide important information about protein dynamics. Computational approaches to predict thermal motion are useful for analyzing the dynamic properties of proteins with unknown structures. In this article, we utilize a novel support vector regression (SVR) approach to predict the B-factor distribution (B-factor profile) of a protein from its sequence. We explore schemes for encoding sequences and various settings for the parameters used in SVR. Based on a large dataset of high-resolution proteins, our method predicts the B-factor distribution with a Pearson correlation coefficient (CC) of 0.53. In addition, our method predicts the B-factor profile with a CC of at least 0.56 for more than half of the proteins. Our method also performs well for classifying residues (rigid vs. flexible). For almost all predicted B-factor thresholds, prediction accuracies (percent of correctly predicted residues) are greater than 70%. These results exceed the best results of other sequence-based prediction methods. Proteins 2005. © 2005 Wiley-Liss, Inc.

1097-0134/asset/PROT_centre.gif?v=1&s=77b56b1f2cdaba74cb3bb149bd9b029cd8803cdb)