We have determined the crystal structure of MTH938 (Fig. 1), a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mthe) genome (DNA bases 843,263–862,747),1 at 2.2 Å resolution by Se-Met multiwavelength anomalous diffraction (MAD) techniques. Se-Met labeled MTH938 crystallized with the symmetry of space group P41212 with one dimer per asymmetric unit. The dimensions of each monomer of 111 amino acid residues are about 26 × 30 × 32 Å3. A Dali search2 with this MTH938 structure found no significant structural similarity (highest Z-score of 2.7) with any existing protein. The crystal structure of MTH938 reveals a new tertiary fold consisting of three β-sheets and three α-helices (Fig. 1). There is a disulfide bond between residues Cys 5 and Cys 87 in each monomer. As Mthe is an anaerobic archaea and the cystine pair is not conserved in the amino acid sequence alignment (Fig. 1), the potential structural and functional significance of the disulfide bond is uncertain. It is interesting that the only eukaryotic homolog in this sequence cluster is an unnamed human protein, suggesting possible lateral gene transfer into the human genome.
Two larger sheets, one from each monomer, associate as a ten-strand mixed β-sheet [Fig. 2(a)] that forms the base of a cleft [Fig. 2(b)]. Molecular modeling and electrostatic potential calculations3 suggest that this cleft could potentially bind double-stranded nucleic acid with interacting elements from αA and the tip of β5 of either subunit of the MTH938 dimer. The dimer interface surface area of 262 Å2, however, corresponds to only about 5.5% of the surface area of a monomer. Dynamic light scattering and gel filtration chromatography also indicate that MTH938 is monomeric in solution. Further biochemical and structural investigations on this protein are in progress.
DNA from M. thermoautotrophicum, bases 843,263 to 862,747, section 74 of 148, was cloned into expression vector pET15b and transformed into Escherichia coli BL21-DE3 cells. The selenomethionine derivative of MTH938 was prepared following a published protocol.4 Purified Se-Met labeled MTH938 containing a 10 amino acid N-terminal linker with a hexa-His tag was concentrated to about 10 mg/ml in 20 mM Tris-HCl, pH 8.0, with 100 mM NaCl and 5 mM β-mercaptoethanol.
Crystals grown in hanging drops containing 20% PEG 3350, 0.2 M ammonium chloride, and 0.1 M sodium cacodylate at pH 6.2 were used for X-ray diffraction data collection. Diffraction intensity data (Table I) were collected using the Advanced Photon Source (APS) Beamline 14BM-D, Argonne National Laboratory, from a single frozen crystal (100 K) at three wavelengths. The wavelengths selected were the peak (λ1) and inflection (λ2) of the Se K-edge, and at a higher energy remote wavelength (λ3). The data were processed and scaled to 2.2 Å resolution using Denzo and Scalepack,5 respectively. The summary of X-ray data statistics is listed in Table I. Four Se sites, corresponding to two molecules per asymmetric unit, were located using direct methods as implemented in SnB 2.16 and MAD phases were calculated to 2.7 Å resolution based on the anomalous signal from the Se sites using SOLVE version 1.187 with a figure of merit (FOM) of 0.69. The MAD phases were further improved and extended to 2.2 Å resolution using RESOLVE version 1.047 and ARP V5.1.8 The model was built manually into electron density maps calculated using phases obtained from these procedures. Cycles of model building, using O version 6.19 followed by least squares refinement using CNS10 with bulk solvent correction, yielded the final structure that includes all 111 amino acids of MTH938, the last two amino acids of the N-terminual His-tag for both the molecules in the asymmetric unit, and 85 solvent water molecules. Amino acid residues 74, 78, and 80 in molecule A and 74 and 78 in molecule B were refined as alanines because of poor side-chain density. The final crystallographic R-factor and free R-factor (Table I) were 0.228 and 0.266, respectively, for 12,671 reflections (99.3%) between 20–2.2 Å resolution and |F| > 0.0. The refined atomic coordinates and both the unmerged and merged X-ray diffraction data have been deposited in the Protein Data Bank (PDB ID 1IHN).
Table I. Crystallographic Data and Refinement Statistics
PDB ID: 1IHN
Crystal characteristics and data collection statistics
Cell constants a = b = 63.63 Å, c = 116.80 Å; Space group P 41212
Contents of asymmetric unit: 2 MTH938 molecules; Z = 16 molecules/unit cell
Rmeas = ∑h [m/(m − 1)]1/2 ∑i |Ih,i − 〈Ih〉|/ ∑h ∑iIh,i, m is the multiplicity of each reflection, ∑h is taken over all unique reflections, and ∑i is taken over the set of independent observations of each unique reflection.12
Rcryst = ∑hkl |Fo(hkl) − Fc(hkl)|/ ∑hkl |Fo(hkl)|, where Fo and Fc are observed and calculated structure factors, respectively.
We thank G. Kornhaber and D. Zheng for helpful discussions, and APS BioCARS staff members for their support in data collection. MTH938 represents structure #8 from the Northeast Structural Genomics Consortium.