Abstract
- Top of page
- Abstract
- INTRODUCTION
- MATERIALS AND METHODS
- RESULTS AND DISCUSSION
- CONCLUSION
- REFERENCES
The supersecondary structure of amyloids and prions, proteins of intense clinical and biological interest, are difficult to determine by standard experimental or computational means. In addition, significant conformational heterogeneity is known or suspected to exist in many amyloid fibrils. Previous work has demonstrated that probability-based prediction of discrete β-strand pairs can offer insight into these structures. Here, we devise a system of energetic rules that can be used to dynamically assemble these discrete β-strand pairs into complete amyloid β-structures. The STITCHER algorithm progressively ‘stitches’ strand-pairs into full β-sheets based on a novel free-energy model, incorporating experimentally observed amino-acid side-chain stacking contributions, entropic estimates, and steric restrictions for amyloidal parallel β-sheet construction. A dynamic program computes the top 50 structures and returns both the highest scoring structure and a consensus structure taken by polling this list for common discrete elements. Putative structural heterogeneity can be inferred from sequence regions that compose poorly. Predictions show agreement with experimental models of Alzheimer's amyloid beta peptide and the Podospora anserina Het-s prion. Predictions of the HET-s homolog HET-S also reflect experimental observations of poor amyloid formation. We put forward predicted structures for the yeast prion Sup35, suggesting N-terminal structural stability enabled by tyrosine ladders, and C-terminal heterogeneity. Predictions for the Rnq1 prion and alpha-synuclein are also given, identifying a similar mix of homogenous and heterogeneous secondary structure elements. STITCHER provides novel insight into the energetic basis of amyloid structure, provides accurate structure predictions, and can help guide future experimental studies. Proteins 2012. © 2011 Wiley Periodicals, Inc.
INTRODUCTION
- Top of page
- Abstract
- INTRODUCTION
- MATERIALS AND METHODS
- RESULTS AND DISCUSSION
- CONCLUSION
- REFERENCES
Amyloid is a highly-ordered cross β-protein aggregate that can be achieved by a very broad set of proteins with widely divergent and unrelated amino acid sequences.1, 2 Given the right conditions, a great many, perhaps most, proteins have the potential to form amyloids. The tendency towards amyloid appears to be because of intrinsic properties of the peptide backbone, a finding of great importance for understanding the evolution of protein folds. A much smaller fraction of proteins and protein fragments, assemble into amyloid under normal physiological conditions, and these are of great interest in diverse aspects of biology and medicine.3
Early amyloid research concentrated on amyloids associated with a wide variety of mammalian diseases, from systemic immunoglobulin amyloidoses to neurodegenerative diseases such as Alzheimer's.4 Initial assumptions that accumulated amyloid always caused cellular and tissue toxicity, as is still believed to take place in peripheral amyloidoses,5 proved to be unfounded upon the discovery of a wider variety of amyloids. Amyloids are now known to play roles in bacterial biofilms,6 the production of melanin,7 the storage of hormones in secretory granules,8 and neuronal learning and memory.9 A set of self-templating fungal amyloids additionally give rise to epigenetic heritable traits. These bistable “prion” proteins can persist as soluble or amyloid species with different functional activities. The self-templating property causes cell-wide persistence of one or the other stable state, a status passed from generation to generation via cytoplasmic transfer of amyloid templates from mother to daughter cells.10, 11 Increasingly, evidence suggests that the formation of amyloids may more commonly be a protective mechanism, which especially in the case of the neurodegenerative amyloidoses, acts as to sequester misfolded polypeptides that would otherwise dwell in more toxic, and more highly interactive, oligomeric species. Therefore, there is great interest in deciphering the structures that underlie amyloid states.
While the secondary structure of amyloid is known to be highly β-rich,12–16 experimental structural determination has proven highly difficult, with only extremely short segments crystallized17, 18 and a very few successful solid-state nuclear magnetic resonance (ssNMR) studies.19–23 Because of the scarcity of direct evidence, the nature of amyloid and prion supersecondary structures, and their relation to sequence have been highly contentious topics.17, 24, 25 The parallel β-helices form a fold widely cited as one-potential model for amyloid,26–28 while others favor a ‘superpleated β-sheet.’29–31 Complications include the morphological heterogeneity of amyloid structures suggested by EM imagery27, 32 and the demonstration of prion ‘strains’ or ‘variants’ with differing growth and stability phenotypes.33–35 In the case of the yeast prion protein Sup35, such variants have been demonstrated to maintain specificity through serial passage34 and have been correlated with differences in conformation.36
The bistable nature of amyloid prions, as well as the observation of heterogeneity and ‘strains’ in amyloid and prion folding, undermines the canonical viewpoint of ‘one-protein, one-fold’ long held by theorists of protein folding. Instead, a murky view arises of a set of stable valleys in a field of conformational configurations, within which variations are permitted around common or similar folding patterns. Enzymologists have long studied the variations in globular protein conformations caused by ligand binding, catalytic activity, presence of ions or cofactors. Amyloids embody a similar but larger set of variations.
Bryan et al.37 and others have proposed that β-strand pairs form the core energetic subunits that make up amyloid structure, and a β-strand predictor was designed around this named BETASCAN. BETASCAN calculates likelihood scores for potential β-strands and strand-pairs based on correlations observed in parallel β-sheets. A key and novel feature of BETASCAN was a maxima-finding algorithm that searched the strands and pairs with the greatest local likelihood for all of the sequence's potential β-structures. While sufficient to predict sequence regions with high potential for amyloidogenic β-structure, BETASCAN did not suggest the most likely overall β-sheet fold. For example, BETASCAN was unable to distinguish between the highly similar amyloidal HET-s allele and nonamyloidal HET-S allele in Podospora anserine.
The STITCHER method described in this article extends prediction of amyloid-like proteins by employing a combination of probabilistic prediction38 and free-energy39 methods for protein structural prediction. Since, few atomic-detail templates exist from known structures, the algorithm proceeds via a dynamic assembly of β-strands in agreement with the twists and turns necessary to form a β-helix or superpleated-sheet fold. This philosophy of establishing and then manipulating predefined structural components has been previously used successfully.40, 41 The score of each successive β-strand addition is determined through a combination of novel free-energy model and BETASCAN-derived probabilities. The free-energy methods account for the enthalpy of created hydrogen bonds and the entropy of linkers, while the probabilities describing the likelihood of β-sheet formation account for the specific side-chain–side-chain interactions that confer structural stability. Of particular importance to our energetic model is the detection of stacking ladders, formed by the side-chain–side-chain stacking and bonding of glutamine, asparagine, tyrosine, and phenylalanine residues.42–44 To capture the observed structural heterogeneity of amyloids (e.g., the “strain” phenomenon), STITCHER calculates a list of the top-scoring M = 50 structures instead of just a single optimal. From this set of high-scoring candidates, a consensus structure is derived to represent the commonalities in specific strand-pairs among these 50 structures. Specifically, portions of the structure are considered more likely to form if they are seen in 80% or more of the top structures.
In our results, we show that the STITCHER method can be used to accurately reconstruct structure, as is given by the example of the well-studied Alzheimer's amyloid beta-peptide and the Podospora anserina Het-s prion. STITCHER is also shown to be less prone to false positives than the prior BETASCAN program as it distinguishes the amyloid forming HET-s protein from its close, nonamyloidal homolog, HET-S. Novel structural predictions are then analyzed for the prion domain of the yeast protein Sup35 as well as the Rnq1 prion and alpha-synuclein. The STITCHER algorithm may be accessed at http://stitcher.csail.mit.edu.
CONCLUSION
- Top of page
- Abstract
- INTRODUCTION
- MATERIALS AND METHODS
- RESULTS AND DISCUSSION
- CONCLUSION
- REFERENCES
STITCHER introduces a novel energetic scoring model for amyloid fibrils and an efficient algorithm for dynamically assembling discrete β-strand pairs into complete amyloid structures. The system of physical constraints used to “stitch” β-strands into a complete structure offers an accurate generalization of successful template-based methods such as BETAWRAP and its successors,44, 51, 52 which only conformed to rigid templates. In addition, STITCHER takes into account the unavoidable uncertainty in free-energy parameters and the potential heterogeneity of amyloid folds by computing multiple solutions instead a single optimal. Although, the highest-scoring fold frequently offers a good solution, the best results are achieved by assembling the top solution's most frequently occurring strand-pairs into consensus structures. It should be noted that the particular fold taken by an amyloid is sensitive to environmental conditions, including pH, concentration of protein, and presence of chaperone proteins. Thus, the structure with the highest STITCHER score may not be that taken by the protein under experimental conditions.
The results for alpha-synuclein, Rnq1p, and to some extent the various Sup35p proteins highlight the role of single-residue repeats, motif repeats, and sidechain stacking ladders in amyloid structure. Single-residue repeats may contribute to structural stability through stacking ladders, especially in the cases of polyglutamine and polyasparagine. However, this stability increase comes at the expense of β-strand pairing specificity, as the importance of aligning any particular pair of residues is reduced. A similar but diminished effect is seen in repeats composed of multiple-residue motifs, such as in S. cerevisiae Sup35p. This suggests that the β-strand pairing specificity for repeat-heavy amyloidogenic proteins may be conveyed through two other features of a structure: short intervening linker loops, and the formation of strand-pairs with more rare stacking ladders such as histidine and phenylalanine.
In the cases of known amyloid structures amyloid beta and HET-s, STITCHER is able to predict the core regions of β-structure observed experimentally. Conversely, the predicted results for Het-S agree with the protein's observed nonamyloidogenic nature, despite a high β-propensity sequence that is nearly the same as the amyloidogenic HET-s prion. While a more robust analysis and verification would require additional amyloid structure determination, the STITCHER methodology appears to be a valuable addition to the growing number of amyloid detection algorithms. Further, as new experimental data provides better insight into the nature of β-strand energetics, and new amyloid structures arise, the STITCHER algorithm could be readily extended. While some interpretation and experimental verification is necessary for a complete understanding of amyloid folding, the identification of the range of most likely folds should greatly enhance the further computational and experimental investigation of amyloid and prion proteins.