Get access

Computation of conformational entropy from protein sequences using the machine-learning method—Application to the study of the relationship between structural conservation and local structural stability

Authors

  • Shao-Wei Huang,

    1. Institute of Bioinformatics, National Chiao Tung University, Taiwan, Republic of China
    Search for more papers by this author
  • Jenn-Kang Hwang

    Corresponding author
    1. Institute of Bioinformatics, National Chiao Tung University, Taiwan, Republic of China
    2. Department of Biological Science & Technology, National Chiao Tung University, Taiwan, Republic of China
    • Department of Biological Science and Technology, National Chiao Tung University, HsinChu 30050, Taiwan, ROC
    Search for more papers by this author

Abstract

A complete protein sequence can usually determine a unique conformation; however, the situation is different for shorter subsequences—some of them are able to adopt unique conformations, independent of context; while others assume diverse conformations in different contexts. The conformations of subsequences are determined by the interplay between local and nonlocal interactions. A quantitative measure of such structural conservation or variability will be useful in the understanding of the sequence–structure relationship. In this report, we developed an approach using the support vector machine method to compute the conformational variability directly from sequences, which is referred to as the sequence structural entropy. As a practical application, we studied the relationship between sequence structural entropy and the hydrogen exchange for a set of well-studied proteins. We found that the slowest exchange cores usually comprise amino acids of the lowest sequence structural entropy. Our results indicate that structural conservation is closely related to the local structural stability. This relationship may have interesting implications in the protein folding processes, and may be useful in the study of the sequence–structure relationship. Proteins 2005. © 2005 Wiley-Liss, Inc.

Ancillary