S. pneumoniae phosphomevalonate kinase (PMK, PMVK, MvaK2; EC 220.127.116.11; 335 residues; predicted molecular mass = 37026.5 Da) is a cytoplasmic enzyme in the mevalonate-dependent isoprenoid biosynthesis pathway.1, 2 The enzyme catalyzes transfer of the γ-phosphoryl group of ATP to the phosphate oxygen of (R)-5-phosphomevalonate to form (R)-5-pyrophosphomevalonate.3 PMK is a member of the GHMP kinase superfamily [named for galactokinase (GK), homoserine kinase (HSK), mevalonate kinase (MK), and PMK], which includes at least 19 30%-identical clusters of proteins with various enzymatic activities. Previously determined structures of two GHMP kinases, Methanococcus jannaschii HSK (PDB ID 1FWK) and Saccharomyces cerevisiae mevalonate-5-diphosphate decarboxylase (MDD; PDB ID 1FI4), produced low-accuracy homology models for a number of PMKs.4 Phylogenetic analyses of nearly 200 superfamily members suggested that an experimental structure of at least one PMK would be required to produce more accurate models for members of this GHMP kinase subfamily. We cloned and expressed two PMKs: one from Staphylococcus aureus and one from S. pneumoniae. S. aureus PMK proved to be largely insoluble under standard purification conditions, whereas the S. pneumoniae homolog emerged as the better structure determination target.
PMK, like HSK and MDD, is an α/β protein (dimensions 59 Å × 53 Å × 48 Å) composed of two domains. The linear arrangement of the secondary structural elements is β1, 310, β2, β3, αA, β4, αB, αC, αD, αE, β5, αF, αG, β6, β7, αH, αI, αJ, αK, β8, β9, αL, and β10 (Fig. 1). The N-terminal domain contains a six-stranded β-sheet of mixed polarity (antiparallel β6-β5-β2-β1-β4 followed by β3 parallel to β4) arching over an α-helical region (αC, αE, αA, αD, αG, and αF) (Fig. 2). β1 (11 residues) and β2 (17 residues containing a β-corner) further subdivide the six-stranded β-sheet into two four-stranded β-sheets: one antiparallel (β6-β5→N-terminal portion of β2→C-terminal portion of β1) and the other of mixed polarity (antiparallel N-terminal portion of β1→C-terminal portion of β2→β4 followed by β3 parallel to β4). The C-terminal domain is composed of an antiparallel four-stranded β-sheet (β8-β9-β7-β10) flanked by αJ, αK, and αL. Visual inspection of the surface of PMK revealed a deep cleft between two β-sheets (β6-β5-β2-β1 and β8-β9-β7-β10). This feature almost certainly represents the enzyme active site because it contains three conserved GHMP kinase motifs5 (I, II, and III).
Previously, the New York Structural Genomics Research Consortium (NYSGRC) reported homology modeling of various PMKs.4 With the X-ray structure of S. pneumoniae PMK, we can evaluate the accuracy of these modeling attempts (PMK versus an HSK-derived model gave a root-mean-square deviation (RMSD) = 2.8 Å; PMK versus an MDD-derived model gave an RMSD = 3.6 Å). It is not surprising that the PMK models proved to be of relatively low accuracy because they were both calculated with distantly related templates (sequence identity between PMK and HSK is 16%, between PMK and MDD 8%). In both cases, modeling was compromised by errors in the alignment of the modeled sequence with the template and by the presence of insertions and deletions.
Automated homology modeling with MODPIPE6 using PMK as a template yielded 250 protein models (model score > 0.7, model length > 170 residues) reflecting conservation of the GHMP kinase fold. High- or medium-accuracy models in the 50–100% and 30–50% sequence identity ranges were obtained for prokaryotic PMKs. Lower-accuracy models in the 20–30% sequence identity range represent other PMKs and some proteins annotated as MKs. Models with < 20% sequence identity include fungal PMKs, various MKs, HSKs, GKs, and MDDs, plus proteins of unknown function.
The X-ray structure of PMK provided a useful test of the GHMP kinase homology modeling. In addition, modeling with the PMK template vindicated earlier predictions concerning target selection for structural genomics. Further studies of PMK will be required to better understand the enzyme's mechanism of action, which should be facilitated by the X-ray structure.
The full-length gene encoding PMK was amplified by polymerase-chain reaction using the forward primer containing an NdeI restriction site (GTCGTCATATGATGATTGCTGTTAAAACTTGCGGAAAAC), a reverse primer containing a BamHI restriction site (GTCGTGGATCCTCACGATTTGTCGTCATGTCCTATC), and S. pneumoniae strain R6 DNA as template using standard protocols.7 The amplified insert was cloned into the corresponding sites of a modified pGEX-6P-1 plasmid. The resulting Se-Met-labeled fusion protein was expressed in E. coli BL21 and purified on glutathione and Sepharose Q resins following established procedures.8 Protein for crystallization (24 mg/mL) was dialyzed extensively against 20 mM HEPES pH 7.0, 100 mM KCl, and 3 mM DTT. Gel filtration indicated that PMK is monomeric in solution (data not shown). MALDI-MS confirmed the identity of the purified recombinant protein (measured mass = 37747.3 ± 30 Da, predicted mass = 37703.3 Da).
Diffraction-quality Se-Met PMK crystals (hexagonal rods, Table I) were obtained by hanging-drop vapor diffusion at 20°C against a reservoir containing 0.1 M HEPES pH 7.5, 0.8 M NaH2PO4, 0.8 M KH2PO4, and 30 mM unbuffered ATP. Crystals were cryoprotected by soaking in the mother liquor supplemented with 25% (v/v) glycerol for 2–3 min and immersion in liquid propane (see Table I for space group and unit cell dimensions).
Table I. Crystallographic Data and Refinement Statistics
PDB ID 1K47
Crystal characteristics and data collection statistics
Diffraction data were collected under standard cryogenic conditions on Beamline X25 (National Synchrotron Light Source, Brookhaven National Laboratory), processed and scaled by using Denzo/Scalepack.9. The structure of the Se-Met protein was determined with data from a single-wavelength anomalous diffraction experiment.10 Selenium positions (41/42) (7/monomer) were located with SnB11 and anomalous difference Fourier syntheses and refined by using MLPHARE12 (figure of merit of 0.22 at 2.4 Å resolution). Density modification, combined with sixfold noncrystallographic averaging, yielded an experimental electron density map suitable for model building with O.13 Residues 51–59, 330–335, and three residues from an N-terminal cloning artifact for each monomer were not visible in the electron density map and were omitted from refinement. The final model, consisting of 1932 of 2010 residues and 779 water molecules, was refined at 2.42 Å resolution to an R factor of 21.5% with an Rfree value of 25.7% (Table I).
We thank Dr. K. Rajashankar for help with data collection. This work was supported by NIGMS grant P50-GM62529 (S.K.B.) and NIH grant GM20276 (M.J.R.). S.K.B. is an Investigator in the Howard Hughes Medical Institute. S. pneumoniae PMK represents NYSGRC target T27.