Abstract
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
The organization and assembly of the cellulosome, an extracellular multienzyme complex produced by anaerobic bacteria, is mediated by the high-affinity interaction of cohesin domains from scaffolding proteins with dockerins of cellulosomal enzymes. We have performed molecular dynamics simulations and free energy calculations on both the wild type (WT) and D39N mutant of the C. thermocellum Type I cohesin-dockerin complex in aqueous solution. The D39N mutation has been experimentally demonstrated to disrupt cohesin-dockerin binding. The present MD simulations indicate that the substitution triggers significant protein flexibility and causes a major change of the hydrogen-bonding network in the recognition strips—the conserved loop regions previously proposed to be involved in binding—through electrostatic and salt-bridge interactions between β-strands 3 and 5 of the cohesin and α-helix 3 of the dockerin. The mutation-induced subtle disturbance in the local hydrogen-bond network is accompanied by conformational rearrangements of the protein side chains and bound water molecules. Additional free energy perturbation calculations of the D39N mutation provide differences in the cohesin-dockerin binding energy, thus offering a direct, quantitative comparison with experiments. The underlying molecular mechanism of cohesin-dockerin complexation is further investigated through the free energy profile, that is, potential of mean force (PMF) calculations of WT cohesin-dockerin complex. The PMF shows a high-free energy barrier against the dissociation and reveals a stepwise pattern involving both the central β-sheet interface and its adjacent solvent-exposed loop/turn regions clustered at both ends of the β-barrel structure.
Introduction
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
The recalcitrance of lignocellulosic biomass to enzymatic hydrolysis is a bottleneck in cellulosic ethanol production.1 One promising avenue for overcoming biomass recalcitrance is to understand and modify the properties of bacterial cellulosomes.2 Cellulosomes are large extracellular enzyme complexes that are produced by anaerobic bacteria and can efficiently break down plant cell wall polysaccharides, such as cellulose, hemicellulose, and pectin into sugars.3–5 The cellulosome complex consists of various kinds of enzymes arranged around a scaffolding protein that does not exhibit catalytic activity but enables the complex to adhere to cellulose.
The organization of the cellulosome is mediated by high-affinity protein–protein interactions between Type I cohesin domains within the scaffolding proteins and complementary Type I dockerin domains carried by cellulosomal enzymes. Early studies on cellulosomes demonstrated that, although cohesin and dockerin exhibit relatively high-sequence homology, the interaction between cohesins and dockerins is generally species specific, that is, cohesins from one species do not recognize and interact with dockerins present in other species.6
A crystal structure for the Type I cohesin-dockerin complex from C. thermocellum has been determined7 (Fig. 1), providing insight into the structure and mechanism of cohesin-dockerin assembly. The cohesin domain forms a nine-stranded β-barrel with an overall jelly-roll topology. The two sheets of the β-barrel are composed of strands 5, 6, 3, and 8 on the contact face with the dockerin, and strands 4, 7, 2, 1, and 9 on the opposite face. The entire structure is stabilized by a tightly packed aromatic/hydrophobic core. The compact nature of the cohesin structure, together with the fact that the contact surface features no obvious binding pocket or cleft, suggests that binding between cohesins and dockerins occurs through the exposed surface residues. The dockerin partner of the cohesin-dockerin complex forms three α-helices, in a conformation defined by two-calcium-binding loop-helix motifs. Indeed, it has been found that Ca2+ plays a key role in maintaining the structural integrity of the cohesin-dockerin complex.8, 9
Inspection of available crystal structures7, 10–12 and site-directed mutagenesis experiments13, 14 suggest that the cohesin-dockerin association is maintained by hydrophobic interactions promoted through an extensive hydrogen-bonding network between one face of the cohesin and the dockerin. A number of hydrophilic residues play an essential role in the recognition and formation of the complex as follows: Arg77, Tyr74, Asp39, Glu86, and Ser88 of the cohesin domain, and Leu22, Arg23, Ser45, Thr46, and Arg53 from α-helices 1 and 3 of the dockerin domain (Fig. 1). In addition to structural information, the effect of single mutations of key residues on the affinity of the interaction has been probed,13, 15 and the binding affinity of WT cohesin-dockerin complex has also been measured.12
Although the crystallographic structure and experimental measurements have provided essential information about the association of cohesins and dockerins, the underlying microscopic dynamic and energetic processes are not directly accessible to experiments. Consequently, aspects of the mechanism governing the assembly of cohesins and dockerins remain uncertain. To deepen our understanding and gain further insight into the assembly mechanism, we have performed molecular dynamics (MD) simulations of the cohesin-dockerin complex. Simulations were also performed of the D39N mutant that has been demonstrated experimentally to dramatically reduce the binding affinity.15 Global dynamics of the complexes and local hydrogen bonding networks formed by several highly conserved residues are examined; the change in free energy of binding accompanying the mutation of Asp39 are also calculated. Furthermore, the free energy landscape for the dissociation of the cohesin and dockerin domains in aqueous solution is explored. The specific domains and amino acid residues that may be involved in this interaction are discussed. The results reveal a detailed view on how the two domains interact.
Discussion
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Recognition of Type I cohesins by dockerins is the determining event in assembly of individual enzymatic subunits into the cellulosome complex. To our knowledge, however, protein–protein interactions between cohesin and dockerins have not been examined using computational tools. It is therefore particularly informative to elucidate the detailed molecular principles upon which the cohesin-dockerin interaction is based at the atomistic level. The present MD simulations on the Type I cohesin-dockerin complexes in aqueous solution provide information on static stability of the model structures and dynamic details of the cohesin-dockerin interaction, such as degrees of fluctuations and local conformational changes induced by mutations, thus complementing experimental studies.
Biochemical mutagenesis studies have provided critical clues as to the mode of cohesin-dockerin interaction. One of the striking mutations, known to cause recognition failure, is D39N. Asp39 of the cohesin, one of the most conserved residues, is located at the protein–protein interface of the complex. This residue forms direct hydrogen bonds with Ser45 of the dockerin, the most critical residue for domain recognition,12, 14, 21 and water-mediated hydrogen bonds with Val21 and Ile43. It has been demonstrated by mutagenesis that the single substitution of Asp39 by a neutrally charged Asn reduces the affinity of the interaction by more than three orders of magnitude and disrupts the normal recognition of the dockerin.15 Thus, this residue is a hot-spot for the cohesin-dockerin interaction.
Although experimental studies have revealed the importance of Asp39, in the absence of high-resolution structures, the detailed mechanism of how the mutation causes the malfunction remains largely unclear. Extensive MD studies of both the WT cohesin-dockerin complex and D39N mutant were carried out in this study to examine the contribution of this residue to the protein–protein binding. The results indicate that the substitution of the key residue Asp39 by Asn leads to conformational rearrangements of the local structure, such as the geometric reorientation of Asn39 side chain and the loss of hydrogen bonds with Ser45 and Ile43, which may have an impact on the destabilization of the complex and the experimentally observed decrease in affinity. The mutation does not result in significant modification of the dynamic properties of the principal β-sheet contact surface or the α-helices 1 and 3, but does dramatically increase the degree of mobility in the recognition strips, the conserved loops connecting β-strands 4 and 5, 6, and 7 of the cohesin domain.
It is evident that the site of the mutation is spatially far from the recognition strip regions that show increased structural instabilities. Therefore, direct contacts may not be primarily responsible for these fluctuations; instead, conformational fluctuations might be due to alterations in the global modulating forces. It is well-known that long-range electrostatic interaction plays a crucial role in protein stability and protein–protein binding. One of the important electrostatic elements could be the dipole originating from the aligned peptide units of an α-helix, which can contribute to the stabilization of protein structure by interacting with charged side chains.25, 26 In the cohesin-dockerin complex, Ser45 and Arg53 are two key dockerin residues in close contact with Asp39 and Glu86 of the cohesin domain. Ser45 at the N-terminal positive end of α-helix 3 points toward Asp39, whereas Arg53 at the C-terminus forms salt bridges with Glu86 (Fig. S5 in the Supporting Information). This suggests that electrostatic interactions, possibly involving the helix 3 dipole,27 may play a role in the behavior of the recognition strips. Furthermore, calculation of the electrostatic interaction energy between the peptide atoms in the α-helix 3 and the peptide atoms in the two β-strands-3, 5 indicates that there is indeed a favorable interaction between the α-helix and the β-strands that stabilizes the native structure by 9 kcal/mol over D39N. Thus, neutralization of the key residue Asp39 may promote a global conformational effect by altering the electrostatic interaction in the helix-sheet packing, and this interaction may play an essential role in the formation of cohesin-dockerin complexes. On the other hand, the rupture of the hydrogen bond between Asp39 and Ser45 by the D39N mutation may loosen the cohesin-dockerin structure and cause helix 3 to move more freely, thus further promoting structural fluctuations in the recognition strips by weakening the hydrogen bonds and salt bridges between key residues, such as Arg53 and Glu86. Overall, the molecular basis underlying the cause of the observed large protein flexibility in the recognition strips may arise from the decrease in the stability of the interaction of the secondary structure elements, presumably through a less-favorable electrostatic interaction, and from the consequent disruption of the hydrogen-bond network.
The understanding of the underlying molecular association/dissociation mechanism in terms of structure and dynamical events is further facilitated by the knowledge of the free energy profile for the WT cohesin-dockerin dissociation. Specific protein–protein interactions in the cohesin-dockerin complex give rise to the features in the free energy landscape of dissociation. The sequential events of interdomain hydrogen-bond rupture and the step-by-step pattern of the cohesin-dockerin dissociation revealed by the present free energy calculations identify the dominant protein–protein interactions contributing to the overall binding free energy, and indicate that a set of residues lying on the flattened β-sheet surface and in the peripheral loop regions is the main obstacle to dockerin unbinding. Although examination of the crystal structure alone suggests that the formation of the cohesin-dockerin complex involves relatively large surface areas on both partners, the present results show that specific surface regions play more critical roles than others in forming and maintaining the integrity of the cellulosome complex. The results also provide evidence for a mode of binding involving both α-helices in the dockerin and the corresponding surface region from its cohesin partner: the C-terminal end of dockerin helix 1 interacting with the β-strand 5/6 loop, whereas the N-terminus diverted away from the cohesin surface; the N-terminus of helix 2 covering the core fragment of the β-sheet interface and the C-terminus interacting with the recognition strips. The cohesin-dockerin binding may therefore take place in a cooperative manner.
The experimental estimate of the overall equilibrium binding constant for the present cohesin/dockerin complex is 8 × 107M−1,12 corresponding to a free energy change of about 12 kcal/mol (ΔG = −RTlnKa, where R is the gas constant and T = 65°C). In the simulations, the overall difference in the calculated free energy between the minimum of the bound state and the barrier is ∼17 kcal/mol. This agreement is reasonable, given that the direct comparison of the dissociation free energy with the experimentally determined absolute binding energy would require a knowledge of the contribution to the free energy difference of the change in the free energy associated in the translational and rotational degrees of freedom on complexation,28–30 and given the inherent errors stemming from the implementation of free energy algorithms and the sampling errors that may arise from the conformational flexibility of the unbound dockerin domain in solution. Furthermore, this study is focused on a detailed view of the underlying mechanism of association and interaction in the cohesin-dockerin complex rather than calculating the absolute binding free energy.
The present simulation results provide insight into the molecular principles that govern cohesin-dockerin domain recognition. These principles could in turn be used to guide protein engineering modifications so as to alter cohesin-dockerin binding. Efforts are underway to design engineered cellulosomal modules, which can conduct more efficient biomass degradation than the corresponding wild-type protein complexes. Both atomic-detail and coarse-grained computer simulations are expected, in conjunction with appropriate biochemical and biophysical experiments,31 to provide a foundation for understanding the principles of domain synergy and cellulosomal activity, thus allowing the rational, structure-based design of improved cellulosomal assemblies for cellulosic ethanol production.