Istanbul Technical University, Faculty of Science and Letters, Department of Molecular Biology and Genetics, Istanbul, Turkey
Istanbul Technical University, Molecular Biology-Biotechnology and Genetics Research Centre, 34469 Istanbul, Turkey
Corresponding author. Address: Istanbul Technical University, Faculty of Science and Letters, Department of Molecular Biology and Genetics, 34469 Istanbul, Turkey. Fax: +90 212 2856386.
The folding mechanism and stability of dimeric formate dehydrogenase from Candida methylica was analysed by exposure to denaturing agents and to heat. Equilibrium denaturation data yielded a dissociation constant of about 10−13 M for assembly of the protein from unfolded chains and the kinetics of refolding and unfolding revealed that the overall process comprises two steps. In the first step a marginally stable folded monomeric state is formed at a rate (k1) of about 2 × 10−3 s−1 (by deduction k−1 is about10−4 s−1) and assembles into the active dimeric state with a bimolecular rate constant (k2) of about 2 × 104 M−1 s−1. The rate of dissociation of the dimeric state in physiological conditions is extremely slow (k−2 ∼ 3 × 10−7 s−1).
It is arguable that efforts to control the stability of proteins by genetic modification of sequence could be laid on a firmer foundation such that we would understand better the full mechanism of folding and unfolding of the active system. This, ideally, requires elucidation of the kinetic and equilibrium properties of each step and, in addition, it would be illuminating to know the temperature-dependence of the system so that its classical thermodynamic properties can be derived. With such a foundation, it might be possible to target certain critical steps in the process, so that sequence engineering would be more rationally directed.
While there is a wealth of data on the kinetics and thermodynamics of folding in single-chain proteins [1], our understanding of folding and assembly processes in multi-chain proteins is less comprehensive. Clearly, it is more demanding to study such systems, owing to their greater complexity and the frequently encountered inefficiency of self-assembly of oligomeric structures. However, many of the proteins of interest in biotechnology are oligomeric, as are many of the structures in biological systems where we want to understand the dynamics of assembly and disassembly and for these reasons it is useful to examine such mechanisms with a view to providing a framework for their analysis.
In this paper we use Candida methylica formate dehydrogenase (cmFDH) cloned and overproduced by Allen and Holbrook [2] as a study object owing to its practical potential in synthetic chemistry and the ease of measuring its regain of activity. The native form of cmFDH is a dimer and each subunit has 364 residues folded into two distinct but structurally homologous domains, each comprising a parallel β-sheet core surrounded by α-helices arranged in a Rossmann-type fold. One domain is functionally defined by co-enzyme binding, the other by possessing the residues necessary for catalysis; coenzyme and substrate are encapsulated in the inter-domain cleft during the catalytic reaction. The molecule dimerizes by twofold symmetrical interactions between the co-enzyme-binding domains while the catalytic domains are distal to the dimer interface [3].
FDH catalyses the conversion of formate and NAD+ to carbon dioxide and NADH and thus has considerable potential in enzymatic synthesis. Its usefulness lies in the fact that it is capable of regenerating NADH in enzymatic reductions of aldehydes and ketones to form chirally pure alcohols that are valuable as building blocks in the synthesis of more elaborate structures [4]. While there has been much empirical work on stabilizing FDHs against elevated temperatures and other environmental factors such as oxidation [5], the thermodynamic and kinetic properties of its folding and unfolding pathway have not been dissected in detail. In this paper, we define the rates of steps in the minimal model of folding and assembly reaction and deduce the equilibrium properties of the system with respect to its thermal and denaturant sensitivities. These results should act as a basis for understanding the effects of site-specific engineering or forced evolution on the stability of this molecule.
2 Materials and methods
2.1 Protein purification
Wild-type cmFDH was prepared and purified as described previously [6]. SDS–PAGE and Coomassie Brilliant Blue staining showed the protein to be >95% pure after purification. Protein concentration was estimated at 280 nm from its specific extinction coefficient, 49 170 cm−1 M−1, and expressed in terms of the monomeric protein concentration using monomer mass of 42 000 Da.
2.2 Equilibrium unfolding assay of cmFDH
Unfolding experiments were performed by using 0.1 and 1 μM protein. Protein samples were incubated in 20 mM Tris, pH 8 at a series of GdnHCl concentrations at 25 °C in previous experiments. According to results of preliminary experiments unfolding studies were carried on with 0.1 μM protein, samples were incubated at different temperatures (25, 30, 35, 40, 45 °C) for 4 h. The protein fluorescence intensity of each sample was measured between 300 and 400 nm using an excitation wavelength of 285 nm in a Spex Fluoromax spectrofluorometer. Maximum emission was obtained at 350 nm. For each sample three scans were averaged for final spectrum. Denaturation curves were analysed by assuming a two-state mechanism. Data were fitted Eq. (1) to calculate Kw by fixing the m at the mean value for all transitions with Grafit 5.
K=Kw·exp(m·D)((1))
where Kw is the D/M in water, sigD is the signal for folded state, sigM is the signal for unfolded state and Mo is the monomer concentration as a constant. For the analysis of equilibrium data, guanidinium hydrochloride concentration ([GdnHCl]) is converted to molar denaturant activity (D), according to the following relationship:
D = ([GdnHCl] · C0.5)/([GdnHCl] + C0.5), where C0.5 is a denaturation constant with a value of 7.5 M [7].
The change in free energy between the folded and unfolded states in the absence of denaturant, ΔGw, was calculated for each temperature (25, 30, 35, 40, 45 °C), using the ΔG = −RT ln Kw relationship. For the free-energy change associated with a particular transition with temperature, data were fitted to the following equation:
ΔG=ΔH+ΔCp·(T‐To)‐T·ΔS‐ΔCp·T·ln(T/To),((2))
where ΔH(To), ΔS(To) and ΔCp are the enthalpy, entropy and heat capacity changes, respectively, at a defined reference temperature (To). All data were fitted using the Grafit 5 analysis software (Erithracus Software, UK).
2.3 Stopped flow unfolding experiments of cmFDH
The rate constant for unfolding at a particular concentration of GdnHCl was measured using stopped flow fluorometer (Applied Photophysics (05-109) pbp Spectra Kinetic Monochromator). One micromolar protein is jumped into 4, 4.5, 5, 5.5 and 6 M GdnHCl concentrations and fluorescence decay was measured against time. Protein was excited by monochromatic light at 285 nm and the resulting fluorescence was selected with a WG320 cut-off filter. Each recorded transient was an average of at least three individual reactions. Reactions were recorded at 25 °C. Data were fitted to single exponential decay pattern. The rate constants were plotted as a function of denaturant activity to extract the rate constant for unfolding in the absence of denaturant (ku(w)) according to the following equation:
Ku=ku(w)·exp(‐m·D)((3))
2.4 Refolding activity assay
cmFDH enzyme was unfolded in the unfolding buffer containing 4 M GdnHCl and 20 mM DTT in 20 mM Tris pH 8, at room temperature for 2 h. Refolding assay was initiated by the dilution of 16.6 μM unfolded FDH to 0.1, 0.2, 0.3, 0.5 and 1 μM concentration in the refolding assay buffer containing 20 mM Tris pH 8, 5 mM DTT, 20 mM formate, 1 mM nicotinamide adenine dinucleotide (NAD). The production of NADH was monitored by following the increase in absorption at 340 nm spectrophotometrically to assess the refolding yield of different concentrations of FDH. Measurements were performed by using UV/VIS Spectrometer Lambda 2 for 60 min in 1 ml cuvette. For numerical analysis of the refolding curves data were globally fitted using a numerical method [8] to the system 2U → 2M → D.
2.5 Thermal denaturation
The proteins (wild-type and mutant FDHs) were aliquot and incubated at different temperatures with 2 °C intervals for 20 min as triplicate. Remaining activity measurements were performed at 25 °C in a reaction mixture containing 20 mM Tris Buffer at pH 8, 1 mM NAD+, 2 mM formate and 0.4 μM enzyme. Rate constants (k) were derived directly from the residual activity (A) by the following relationship:
A=Ao·exp(‐k·tinc)((4))
where tinc is the period of incubation (in our case 1200s) and Ao is the initial activity. The free-energy barrier for unfolding as a function of temperature is described by the relationship:
k=ko·exp(‐ΔG/(R·T))((5))
where ΔG = ΔH + ΔCp · (T − To) − T · ΔS − ΔCp · T · ln(T/To) and ko is the rate of an unfolding reaction in the absence of an activation barrier (a value of 107 s−1 was taken for our analysis, see Section 3).
3 Results and discussion
3.1 Equilibrium unfolding
To measure the free energy of folding of this dimeric FDH, we analysed the guanidinium-induced denaturation of the enzyme at equilibrium. Upon unfolding there is a large (∼70%) quench is the native tryptophan fluorescence and the transition to the unfolded state occurs in a single step, without intermediates, as shown in Fig. 1 . As denaturation of the dimer requires dissociation of the component subunits, this transition must be dependent upon protein concentration. To confirm this we collected equilibrium unfolding curves at two protein concentrations an order of magnitude apart and found that at the higher concentration the midpoint of the curve was shifted to higher concentrations of denaturant (Fig. 1). The data have been fitted to Eq. (1) for dimer dissociation. Owing to the equation accounting for the effect of protein concentration on the denaturation transition, both of the fit-lines produce the same value for the apparent association constant (Ka), i.e. about 1013 M−1. Note that this represents the equilibrium: 1 dimer = 2 unfolded monomers, so includes both folding and dissociation processes.
Equilibrium unfolding curves. Normalized equilibrium unfolding curves of 1 μM (open circles) and 0.1 μM (closed circles) of cmFDH. Fluorescence intensity of five tryptophan residue of cmFDH was measured at increasing concentrations of the guanidinium hydrochloride ([GdnHCl] at 25 °C. Data were calculated according to equation 1 as a function of GdnHCl denaturant activity. For the 1 μM, Kw = 1.4 (±0.6) × 1013 M−1, m = −17.7 ± 1.0 and for the 0.1 μM, Kw = 1.5 (±0.8) × 1013 M−1, m = −17.2 ± 1.2. i.e. irrespective of concentration the curves fit with the same parameters.
3.2 Thermodynamics of the folding–unfolding transition
The above experiment was performed at a series of temperatures in order to extract the thermal dependence of equilibrium unfolding. The plot in Fig. 2 shows the free-energy change (i.e. −RT · ln Ka) for this transition as a function of temperature fitted to the Gibbs–Helmholtz equation (Eq. (2)). The data shows that, at the reference temperature of 25 °C the enthalpy change on folding (ΔH) is unfavourable (approximately +27 ± 18 kcal/mol), while the entropy change is favourable (−TΔS = −46 ± 18 kcal/mol). The heat capacity change (ΔCp) is −10.5 ± 1.8 kcal/mol/K; a value that is dominated by the degree of desolvation of non-polar surface during the folding process. Hence, ΔCp is dependent on the chain-length of the protein and can be empirically estimated by the formula [9]:
ΔCp=N×‐0.015+0.13kcal/mol/K
The enzyme unfolds in a single transition involving two subunits of 364 residues hence the expected value is −10.8 kcal/mol/K, extremely close to the value of −10.5 derived from the data shown in Fig. 2.
Thermodynamics of unfolding of cmFDH. The free-energy changes of unfolding in water (ΔGw) over a range of temperatures (T) were determined from equilibrium denaturation data and fitted to Eq. (2), at a reference temperature of To = 298 K. In this experiments 0.1 μM cmFDH protein was used. ΔH, ΔCp and ΔS values of cmFDH were calculated as +27 ± 18 kcal/mol, −10.5 ± 1.8 kcal/mol/K and −46 ± 18 kcal/mol), respectively.
3.3 Kinetics of folding and unfolding
The rate constant for unfolding in the absence of denaturant can be estimated from the data shown in Fig. 3 . Extrapolation to zero denaturant yield a value for the rate constant of 3.4 (±1.5) × 10−7 s−1 and the Δm-value for the linear free-energy relationship, which relates to the change in solvent exposure between the folded dimeric ground state and the transition state for unfolding, is 4.6 (±) 0.1.
Unfolding kinetics of cmFDH. FDH was unfolded in a stopped flow fluorimeter and the rate constant recorded by analysis of the decrease in fluorescence. The plot shows the variation with denaturant activity fitted to the equation described in Section 2.
To analyse the folding kinetics, we measured the rate of regain of enzymatic activity from the fully unfolded state as a function of time (Fig. 4 ). The reactivation curves have a distinct lag phase followed by the acquisition of catalytic activity, showing that refolding process is rate-limited by at least two steps occurring on the same time scale. Further, one of these steps must be multi-molecular since the half-time of refolding is clearly sensitive to the protein concentration. At zero time the tangents to the refolding curves are zero, showing that the intermediate state (or states) lying between the two rate-limiting steps is devoid of activity. In any dimeric system, at least two processes must occur before the individual unfolded chains can form the active state, i.e. folding and association [10]. The simplest model that accounts for the data is an essentially irreversible uni-bi reaction: 2U → 2M → D, where U is the unfolded chain, M the folded monomer, and D is the active dimer.
Refolding of cmFDH followed by regain of activity. Activity was measured as a function of time for refolding reactions measured at a series of protein concentrations. The data were globally fitted using a numerical method to the system 2U → 2 M → D Fitted rate constants were 1.6 × 10−3 s−1 and 1.9 × 104 M−1 s−1.
Using numerical methods we globally, i.e. simultaneously, fitted the progress curves to the above kinetic model which yielded values of 1.9 (±0.4) × 10−3 s−1 for the unimolecular folding step and 1.6 (±0.5) × 104 M−1 s−1 for the bimolecular association of subunits. Results show a close agreement between model and data over a wide concentration range.
The monomeric intermediate of cmFDH forms active dimers at a rate which is slower than expected for a process limited by subunit diffusion in solution. Considerations based on orientational constraints and Brownian diffusion for protein associations have suggested that the basal rate should lie in the range of 105–106 M−1 s−1[11–13]. Electrostatic attractions can enhance this rate significantly, as shown experimentally in the formation of the barnase:barstar complex [14] and the colicin:Im9 interaction [15]. In addition such charge effects have been predicted with reasonable accuracy by models developed by Alsallaq and Zhou [16]. In these studies association rates greater than 109 M−1 s−1 were observed, which accords with the value for dimerization of the apo form of superoxide dismutase (see Table 1 ) where the second-order rate constant is 2 × 109 M−1 s−1. Comparisons with other large proteins that fold through a unimolecular/bimolecular mechanism and for which all steps have been defined are shown in Table 1[17–20].
Table Table 1. Comparison equilibrium and kinetic parameters of some large proteins that fold through a unimolecular/bimolecular mechanism
At the level of simple reaction kinetics, there are at least three explanations that can account for the apparently slow association rates of the formate dehydrogenase monomers. Firstly, and most straightforwardly, the bimolecular rate might be genuinely 104 M−1 s−1 owing to some unusually stringent orientational requirement for securing a productive interaction or owing to charge repulsion between monomers. Secondly, the monomer may be an ensemble of conformations in rapid equilibrium with each other. If only a minority population can dimerize then the apparent bimolecular rate will appear to be slower than the real value. For instance if only 1/10th of the population were competent at any one time, then the apparent second-order rate constant would be 100-fold slower than the real bimolecular constant, i.e. a real rate constant of 106 M−1 s−1 would appear to be only 104 M−1 s−1. Thirdly, and least intuitively, an unstable dimeric state might be formed with a rapid bimolecular constant, but be unstable at the experimentally accessible concentration of subunits. If there is a rate-limiting unimolecular isomerization of this encounter complex to form the active dimer, then the progress curve will appear second-order because of the depletion of monomers as the reaction progresses. For instance, if the bimolecular rate constant were 106 M−1 s−1 and the dissociation rate 101 s−1, the Kd for this rapid equilibrium would be 10−5 M. With a unimolecular rate constant for isomerization of 10−1 s−1 and a concentration of 10−6 M monomer (i.e. the proportion of dimer in the rapid equilibrium is 1/10th), then the renaturation reaction would be closely approximate to a second-order process with an apparent bimolecular rate constant of 104 M−1 s−1.
The relatively slow bimolecular rate constant is combined with a slow rate of dissociation to yield a high stability for the native dimer (ΔGo = −14.6 kcal/mol), in keeping with other dimeric proteins of high molecular weight. This level of stability (Kd ∼ 10−13 M) means that at a micromolar concentration, less than one millionth of the enzyme is in the inactive and unstable monomeric state. It is interesting to note that the rate constant for FDH dissociation (Fig. 3) is so slow that, once formed, the dimer has a dissociative half-life of one and a half years.
3.4 Construction of a quantitative mechanism
For the simplest model there are two steps; a unimolecular folding process defined by k1 and k−1 in the forward (U → M) and reverse direction respectively and a bimolecular process defined by k2 and k−2. The overall equilibrium constant Ka defines the distribution D/U2, hence is the product of the two steps. However, since it requires two unfolded chains to form the dimer then:
Ka=(k1/k‐1)2·k2/k‐2((6))
Making the assumption that the rate-limiting step in denaturation is dissociation (k−2) then the only value in this system that is not experimentally defined is k−1. This value can be deduced as follows:
k‐1=((k12·k2)/(Ka·k‐2))0.5((7))
Given values of 1.5 × 1013 M−1 for Ka, 1.9 × 10−3 s−1 for k1, 1.6 × 104 M−1 s−1 for k2 and 3.4 × 10−7 s−1 for k−2, this relationship provides a value for the M → U step of about 1 × 10−4 s−1. In this system M has no enzymatic activity and its stability is marginal, the free-energy change for this step being only 1.7 kcal/mol.
3.5 A simplified analytical method for approximating folding and assembly rates
The formal analytical equation for fitting a uni–bi kinetic systems is extremely unwieldy, therefore to extract the values for rate constants in the renaturation reaction we have used numerical fitting procedures. The numerical fitting technique employed to extract the forward rate constants in the renaturation process required good time-resolution in the data and a simultaneous analysis of the progress curves. However, in the analysis of such data, it is interesting to note that summing the individual half-lives for a bi-molecular step (t1/2 = 1/(kbi × Mo) and a unimolecular step (t1/2 = ln 2 × kuni) provides a tolerably close approximation to the overall half-time of refolding. If we use the approximation that the measured t1/2 = 1/(kbi × Mo) + ln 2 × kuni then the data sets shown in Fig. 4 can be re-analysed in terms of t1/2 versus total monomer concentration. Such a plot is shown in Fig. 5 and the fitting procedure yields a value for kbi of 2.4 × 104 M−1 s−1 and a value for kuni of 1.6 × 10−3 s−1; remarkably close to the values arrived at from global numerical fitting, i.e. 1.6 × 104 M−1 s−1 and 1.9 × 10−3 s−1, respectively. The kuni measured in such reactions might be a compound of two or more unimolecular steps in a more complex mechanism; in this case 1/kuni = τuni = τ1 + τ2 +⋯τn.
The relationship between the half-time of refolding and reactant concentration. The refolding half-times were fitted to the equation; t1/2 = 1/(kbi × Mo) + ln 2 × kuni. The shown fit yields values of 1.6 × 10−3 s−1 and 2.4 × 104 M−1 s−1, respectively, for the unimolecular folding and bimolecular association rate constants.
3.6 Thermal denaturation
Shown in Fig. 6 A are denaturation data derived from an experiment in which the enzyme was incubated at a series of temperatures for a period of 20 min and the residual activity recorded. The heat inactivation of cmFDH, as for most proteins, is not reversible and follows a first-order decay [21–23]. As a result of this, denaturation cannot be formally treated as an equilibrium system to which the orthodox analysis can be applied, rather it should be thought of as being defined by a temperature-sensitive rate constant for the irreversible step. These data can be treated as representing rate constants of denaturation as a function of temperature, i.e. the rate constant (k) can be derived directly from the residual activity (A) by the following relationship:
A=Ao·exp(‐k·tinc)((8))
where tinc is the period of incubation (in our case 1200s) and Ao is the initial activity. The variation of rate constant with temperature can be described by the Arrhenius relationship: k = ko · exp(Ea/RT) where Ea is the activation enthalpy. When the data in Fig. 6A are fitted to this relationship we obtain a value for ko of 2 × 1057 s−1 and for Ea a value of 91 kcal/mol.
Heat inactivation of cmFDH. (A) The thermal denaturation of cmFDH was measured by incubating the enzyme at varying temperatures for 20 min. After this period the residual activity was recorded. The method for fitting the residual activity data is described in Section 2. The plot shown in (B) shows the deduced ΔG for activation against the temperature.
However, in the knowledge that protein folding involves large changes in heat capacity, it is more realistic to fit the data to the integrated form of the van't Hoff relationship where the free-energy barrier for unfolding is related to temperature by the equation:
k=ko·exp(‐ΔG/(R·T))((9))
where ΔG is the free energy of activation and ko is the rate of an unfolding reaction in the absence of an activation barrier. For the purposes of this analysis, we take a value of 107 s−1 as the upper rate of rearrangements involved in protein folding or unfolding [24].
In Fig. 6B the deduced ΔG for activation is plotted against the temperature. The relationship between the height of the free-energy barrier and the temperature defines the changes in heat capacity, enthalpy and entropy between the folded state and the transition state according to Eq. (2). Combining the above equations allows a direct fit of residual activity against ΔH, ΔCp and ΔS as shown by the fit line in Fig. 6A which is superimposed on the original data. This analysis gives an enthalpic barrier of 25 kcal/mol, a favourable entropic term (−5.5 kcal/mol at 298 K) and a heat capacity change of 2.0 kcal/mol/K. [Measurements of unfolding rate constants as a function of temperature yielded similar values for ΔH, ΔCp and ΔS i.e. 31 kcal/mol, 1.6 kcal/mol/K. and −11 kcal/mol at 298 K (see Supplementary Fig. S1)].
Interestingly, these values are comparable to those for the barrier to the unfolding of phosphoglycerate kinase (N-domain) (N-PGK) which shows a favourable entropy term (−6.3 kcal/mol −298 K) and an unfavourable enthalpy (+18.4 kcal/mol) [24]. A further similarity can be seen in the heat capacity change in attaining the transition state in unfolding–folding which in the case of FDH and N-PGK is about 20% of the ΔCp for complete unfolding, i.e. the transition state is about 80% folded as measured by solvent exposure. It is also intriguing to note that for chemical denaturation of FDH, according to the Δm-value for the transition state (∼4.5, Fig. 3) and the overall Δm value for unfolding (∼17.5, Fig. 1), the transition state is about 75% folded. Indeed, using the thermodynamic parameters derived from the rates of thermal denaturation, we would predict the rate of unfolding at 25 °C to be 6 × 10−7 s−1, a value remarkably close to 3 × 10−7 s−1 derived from orthodox denaturant analysis.
Acknowledgements
This work was partially supported by the Turkish State Planning Organization's Advanced Technologies Program and Turkish State Planning Organization (Project No. 90188).