Effective scoring function for protein sequence design

Authors

  • Shide Liang,

    1. Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas
    Search for more papers by this author
  • Nick V. Grishin

    Corresponding author
    1. Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas
    2. Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas
    • Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050
    Search for more papers by this author

Abstract

We have developed an effective scoring function for protein design. The atomic solvation parameters, together with the weights of energy terms, were optimized so that residues corresponding to the native sequence were predicted with low energy in the training set of 28 protein structures. The solvation energy of non-hydrogen-bonded hydrophilic atoms was considered separately and expressed in a nonlinear way. As a result, our scoring function predicted native residues as the most favorable in 59% of the total positions in 28 proteins. We then tested the scoring function by comparing the predicted stability changes for 103 T4 lysozyme mutants with the experimental values. The correlation coefficients were 0.77 for surface mutations and 0.71 for all mutations. Finally, the scoring function combined with Monte Carlo simulation was used to predict favorable sequences on a fixed backbone. The designed sequences were similar to the natural sequences of the family to which the template structure belonged. The profile of the designed sequences was helpful for identification of remote homologues of the native sequence. Proteins 2004;54:000–000. © 2003 Wiley-Liss, Inc.

Ancillary