Kelly A. Mercier and John R. Cort contributed equally to this work.
Article
Structure and function of Pseudomonas aeruginosa protein PA1324 (21–170)
Article first published online: 11 JAN 2009
DOI: 10.1002/pro.62
Copyright © 2009 The Protein Society
Additional Information
How to Cite
Mercier, K. A., Cort, J. R., Kennedy, M. A., Lockert, E. E., Ni, S., Shortridge, M. D. and Powers, R. (2009), Structure and function of Pseudomonas aeruginosa protein PA1324 (21–170). Protein Science, 18: 606–618. doi: 10.1002/pro.62
Publication History
- Issue published online: 23 FEB 2009
- Article first published online: 11 JAN 2009
- Accepted manuscript online: 11 JAN 2009 12:00AM EST
- Manuscript Accepted: 8 DEC 2008
- Manuscript Revised: 3 DEC 2008
- Manuscript Received: 12 JUN 2008
Funded by
- Protein Structure Initiative of the National Institutes of Health. Grant Number: U54 GM074958
- Nebraska Tobacco Settlement Biomedical Research Development Funds, Nebraska EPSCoR, and an Eloise Gerry Fellowship from Sigma Delta Epsilon
- NIH. Grant Number: RR015468-01
Keywords:
- Pseudomonas aeruginosa PA1324;
- NMR;
- functional genomics;
- NMR high-throughput screens;
- protein-ligand binding;
- protein-ligand co-structures;
- structural biology;
- structural genomics;
- chemical proteomics;
- protein structure initiative;
- hypothetical proteins;
- FAST-NMR;
- Northeast Structural Genomics Consortium
Abstract
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Pseudomonas aeruginosa is the prototypical biofilm-forming gram-negative opportunistic human pathogen. P. aeruginosa is causatively associated with nosocomial infections and with cystic fibrosis. Antibiotic resistance in some strains adds to the inherent difficulties that result from biofilm formation when treating P. aeruginosa infections. Transcriptional profiling studies suggest widespread changes in the proteome during quorum sensing and biofilm development. Many of the proteins found to be upregulated during these processes are poorly characterized from a functional standpoint. Here, we report the solution NMR structure of PA1324, a protein of unknown function identified in these studies, and provide a putative biological functional assignment based on the observed prealbumin-like fold and FAST-NMR ligand screening studies. PA1324 is postulated to be involved in the binding and transport of sugars or polysaccharides associated with the peptidoglycan matrix during biofilm formation.
Introduction
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Genome sequencing projects conducted in the past decade have enabled many novel areas of research in biology, chemistry, and drug discovery. With more than 686 completed genomes and 1270 on-going projects, ∼6.5 million protein sequences have been identified, but ∼30–50% of these proteins are not functionally annotated.1 The protein structure initiative (PSI) was initiated to determine a representative structure for each protein in the proteome with an expectation that structural information will assist in solving the functional annotation problem.2 PSI protein selection is based on the goal of maximizing the coverage of structural space by choosing representative targets from protein families that lack a known structure.3 A single structure may serve as a model for the other member in the protein family.4
Since 2004, PSI has been shown to contribute 50% of the novel structures deposited in the protein database (PDB).5 This coincides with an increase in the number of proteins classified as unknown function, where there are currently 2755 such proteins in the PDB.6 A number of computational methods have been developed to aid in the annotation of proteins of unknown function.7 These methods use protein structures and a range of techniques to compare global folds, local structural features, or matches to 3D templates to assign putative biological functions.8 Although these computational approaches are extremely efficient in analyzing large data sets, the results for a specific protein may be ambiguous or absent.9, 10
Functional Annotation Screening Technology by NMR (FAST-NMR) provides experimental data for the functional annotation of novel proteins by combining ligand affinity screens and structural biology with bioinformatics.7, 11 Functional ligands that bind to the protein are identified using a tiered NMR ligand screen.12 The protein's functional epitope is determined by mapping the ligand induced chemical shift perturbations (CSPs) onto the surface of the protein. A co-structure is rapidly determined using this CSPs data,13 which is then used to drive the bioinformatics analysis. The Comparison of Protein Active Site Structures (CPASS) program and database compares the experimental ligand-defined active site determined from the FAST-NMR screen to all the ligand-defined active sites available in the PDB.7, 11, 14 A general biological function is assigned through the comparative similarity of the experimentally determined ligand binding site with a protein of known function.
The sequence of the Pseudomonas aeruginosa PA01 genome was completed in 2000, but as of January 2008 > 43% of the proteome lacks a functional annotation.15P. aeruginosa is one of the primary bacteria that forms biofilms16, 17 and causes serious problems with individuals with chronic lung problems, such as cystic fibrosis.17, 18 In the past decade, the prevalence of nosocomial infections has risen significantly, particularly in developing countries.19, 20 Biofilms are often the cause of these infections, and their formation is poorly understood.21 This situation is further exacerbated by the increasing rate of antibiotic resistance for P. aeruginosa, which has reached upwards of 40% for some strains. Resistance to imipenem, ciprofloxacin, and ceftazidine have all been observed in the past year.22 Therefore, developing new antibiotics against P. aeruginosa biofilms is of great importance, requiring the identification of new drug targets.23, 24
P. aeruginosa genes that are upregulated in biofilms may represent potential therapeutic targets. PA1324 is an example of such a gene that was identified by microarray analysis of gene expression associated with biofilms and quorum sensing in P. aeruginosa. PA1324 and PA1323 were identified in two expression profiling studies in association with quorum sensing: transcription of both genes appears to increase similarly in quorum sensing conditions.25, 26 One of these studies indicated that PA1324 and PA1323 expression was not repressed upon treatment of cells with furanone, a repressor of quorum sensing.25 A study of biofilm formatation found PA1324 to be significantly upregulated with expression levels 10–30 times higher versus planktonic cultures.27 PA1323 was upregulated as well. Another study of biofilm formation in P. aeruginosa included microarray data indicating that PA1324 is not regulated by the SadARS three-component regulatory system required for biofilm maturation.28 To better understand the role of PA1324 in biofilms, the structure and putative biological function of the protein was determined using NMR spectroscopy and FAST-NMR.
Results
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
PA1324 sequence analysis
PA1324 (21–170) is target PaP1 of the Northeast Structural Genomics Consortium (NESG; www.nesg.org)29, 30 and is currently listed as a hypothetical protein in KEGG31 and TIGR.32 PA1324 occurs in at least five strains of P. aeruginosa and has several homologues from a diverse assortment of proteobacteria, all with 26–28% sequence identity (Fig. 1). The proteins identified in the BLASTp sequence analysis as being most similar to PA1324 are hypothetical proteins and putative lipoproteins. Residues 1–18 in PA1324 are predicted to be a hydrophobic signal peptide by SOSUI.33 Thus, PA1324 was expressed with a 20 residue His-tag (GSSHHHHHHSSGLVPRGSHM) in place of the signal peptide plus two residues at the N-terminus, such that the numbering of residues 21–170 corresponds to the natural PA1324 sequence.
Figure 1. Multiple sequence alignment of PA1324 and six homologues. Gene names and NCBI RefSeq numbers are indicated by three letter abbreviations as follows: Pae, Pseudomonas aeruginosa PA1324, NP_250015; Csa, Chromohalobacter salexigens Csal_0658, YP_572718; Sde, Saccharophagus degradans Sde_0431, YP_525907; Dvu, Desulfovibrio vulgaris Dvul_1060, YP_966507; She, Shewanella sp. W3-18-1 Sputw3181_2461, YP_963839; Sul, Sulfurovum sp. NBC37-1 SUN_1567, YP_001358874.

The gene for PA1323 is in the same reading frame as PA1324 and terminates just 60 base pairs upstream from the start of PA1324, suggesting the two genes may form an operon. PA1323 is a conserved hypothetical protein (COG4575, Pfam05957, DUF883). On the same strand and upstream from PA1323 is the gene for PA1322, which is a putative TonB-dependent receptor-like transporter for catechols/ferric iron uptake (FIU). Interestingly, Psi-BLAST queries with the PA1324 sequence report low similarity hits with E scores of >0.2 to the N-terminal region of numerous TonB-dependent receptor-like proteins (though not any of the FIU class). The N-terminal 150–200 amino acids of many TonB receptors form a plug on the periplasmic side of the membrane that helps control transport through the barrel-like receptor-transporter. This may suggest a role for PA1324 and PA1323 in mediating the activity of PA1322; however, in the absence of structural information, the sequence similarity may be too low to be reliable.
PA1324 structure description and analysis
The solution structure of the globular portion (residues 21–170) of protein PA1324 from P. aeruginosa was determined by NMR. Table I summarizes key details of the structural ensemble. PA1324 forms an elongated barrel-like sandwich of two β-sheets, open on one side of the sandwich, and partially closed on the other, but not to the extent that the edge strands form hydrogen bonds to each other (Fig. 2). Around the barrel the strands occur in the order 7-1-4-3-2-5-6. A prominent loop of 37 residues between strands 2 and 3 (from residue 71 to 107) folds back upon the outside of the barrel and appears to form a pocket. Several polar residues occupy this pocket: E134, R91, G84, T153, Y79, S132, E70, R104. Near the N-terminal purification tag (residues 1–20), residues 27 to 41 form a loose region that contacts (NOEs observed) the prominent loop, but does not form a regular structure that is converged in the ensemble. The fold of the barrel-like sandwich is clearly that of the prealbumin-like fold (SCOP), which contains several superfamilies, including that of the thyroxine transport protein transthyretin (pre-albumin) and of a conserved accessory domain of membrane-type carboxypeptidases. One subfamily of the transthyretin superfamily is a starch-binding domain, and recent evidence from the Gene Ontology database suggests that PA1324 has a carbohydrate-binding domain.38
Figure 2. Structure of PA1324 21-170. Left: ribbon cartoon structure with rainbow coloring. The N- (blue) and C- (red) termini, the prominent loop (residues 71–107), and β-strands of the barrel (numbered 1–7) are indicated. Right: stereo view of backbone atoms for all 20 superimposed structures comprising the ensemble.

| Distance restraints | |||
|---|---|---|---|
| Total | 1494 | ||
| Intraresidue | 231 | ||
| Sequential | 457 | ||
| Medium range (1 < |i−j| < 5) | 189 | ||
| Long range | 617 | ||
| Hydrogen bonds (2 per H-bond) | 70 | ||
| Dihedral angle restraints | |||
| Total | 132 | ||
| Phi | 65 | ||
| Psi | 67 | ||
| Total restraints | 1696 | ||
| Restricting restraints per restrained residue (144 residues) | 11.8 | ||
| Restricting restraints, long range, per restrained residue (144 residues) | 4.7 | ||
| Average restraint violations per structure | |||
| Distance restraints (all > 0.0 Å) | 60.1 ± 5.1 | ||
| Maximum violation (Å) | 0.05 | ||
| Dihedral restraints (all > 0.00) | 6.3 ± 3.8 | ||
| Maximum violation (°) | 1.02 | ||
| RMSD to average structures (Å): | |||
| Residues 27–170 (144 residues) | |||
| Backbone atoms (N,C,C′) | 0.92 | ||
| All heavy atoms | 1.42 | ||
| Residues 42–70, 108–170 (92 residues)a | |||
| Backbone atoms (N,C,C′) | 0.66 | ||
| All heavy atoms | 1.08 | ||
| Residues 33–37, 43–57, 60–82, 86–113, 116–140, 144–159, 161–168b | |||
| Backbone atoms | 0.80 | ||
| All heavy atoms | 1.20 | ||
| Ramachandran plotc: | |||
| Protein residues | 27–170 inclusive (144) | Barrel only (92) | Ordered (120) |
| Most favored region (%) | 84.0 | 86.6 | 89.6 |
| Additional allowed region (%) | 14.3 | 12.6 | 10.3 |
| Generously allowed region (%) | 1.4 | 0.2 | 0.1 |
| Unallowed region (%) | 0.3 | 0.4 | 0.0 |
| Global quality scoresd: | Raw | Z-Score | |
| PROCHECK(all) | −0.46 | −2.72 | |
| PROCHECK(phi-psi) | −0.60 | −2.05 | |
| MolProbitye clash score | 18.41 | −1.63 | |
| RPF Scoresf | |||
| Recall | 0.84 | ||
| Precision | 0.77 | ||
| F-measure | 0.81 | ||
| DP-score | 0.62 | ||
Dali analysis of PA1324 revealed several proteins of modest structural similarity, with Z-scores of between 2 and 5.39 Despite sequence similarities less than 20%, these proteins clearly adopt the same fold as PA1324. The top protein identified was transthyretin. Other proteins identified have functions corresponding to sugar and polysaccharide-binding proteins, extracellular matrix and cell adhesion proteins and structural proteins. Because the Dali and primary sequence analysis were unsuccessful for definitive functional annotation, further investigation was needed to uncover a putative biological function of PA1324.
FAST-NMR results for PA1324
FAST-NMR was used to screen ligands for possible binding by PA1324 to leverage a functional hypothesis.11 FAST-NMR provides functional information for hypothetical proteins by experimentally identifying ligands and ligand binding sites. After ligands are identified and the binding site is mapped by CSPs, a co-structure is obtained in ∼30–45 min by using the CSPs to direct an AutoDock simulation where our ADF program selects the best conformer based on consistency with the magnitude of chemical shift changes.13 A functional hypothesis can be developed by comparing the ligand-defined active-site from FAST-NMR to a database of protein-ligand binding sites for proteins of known function using CPASS (Comparison of Protein Active-Site Structures).7, 11, 14 This information provides a starting point for further functional investigations, additional screening of related ligands, ligand modification or optimization, or even identification of drug leads.
FAST-NMR consists of a two-stage screening methodology. The primary 1D 1H line-broadening screen yielded 20 hits from the functional compound library (5.46% hit rate) that bind PA1324.12 Figure 3 shows a representative example of the typical results obtained from these line broadening experiments, where the increase of line width (decrease in peak intensity) indicates a binding event. Seven of these 20 compounds contain aryl-sulfonate, sulfonyl fluoride, or sulfone functional groups. This suggests potential chemical characteristics that may be present in the native ligand. Some of the compounds found to bind PA1324 in the 1D 1H line-broadening screen are shown in Figure 4.
Figure 3. A 1D 1H NMR line-broadening experiment showing an expanded reference mixture spectrum (A) and the mixture with PA1324 (B). The highlighted region indicates the observed line broadening and binding of suramin to PA1324.

Figure 4. A subset of the 20 compounds found to bind to PA1324 by 1D 1H NMR line-broadening experiments was Aryl or benzyl sulfonyl, and sulfonate moieties are present in 7 of the 20 compounds.

The 20 compounds were further evaluated for PA1324 binding with a secondary screen using the 2D 1H-15N HSQC experiment. CSPs were identified through the comparison of a reference 2D 1H-15N HSQC spectrum for free PA1324 to the spectrum in the presence of the compound. Suramin was identified as the primary lead compound based on the larger quantity and greater magnitude of CSPs observed relative to the other compounds [Fig. 5(A)]. Suramin is a known analogue of heparin, a highly negatively charged sulfated biopolymer from the glycosaminoglycan family of carbohydrates.41 Titration of suramin into a sample of PA1324 was conducted to determine the suramin binding site. 2D 1H-15N HSQC spectrum of the PA1324-surmain complex showed the backbone amide chemical shifts of residues N23, T38, L52, M76, K77, G84, L86, D101, Y130, T153, K154, and S159 were perturbed upon the addition of suramin. Interestingly, this putative suramin binding site resembles a heparin binding site because it contains regions of positively charged lysine and arginine residues.42, 43 Figure 5(B) shows an electrostatic potential surface of the binding site of suramin. A titration with a heterogeneous sample of heparin (Sigma Aldrich, St. Louis, MO) was also completed [Fig. 1(S)]. Heparin appears to bind predominately in the same cleft occupied by suramin.
Figure 5. Suramin binding to PA1324. (A) An expanded region of an overlay of the 2D 1H-15N HSQC spectrum of free PA1324 (black) and PA1324 in the presence of suramin (red). (B) A GRASP40 electrostatic surface of PA1324 demonstrating the positive electrostatic potential of the suramin binding site. Blue and red indicate positively charged and negatively charged surfaces, respectively. (C) A docked model of suramin in the PA1324 binding site identified by FAST-NMR. Residues that incurred a CSP upon binding suramin are colored blue. The distal CSPs could be long-range effects of the binding interaction or potential suramin conformational exchange.

Possible structures of PA1324 bound to suramin were modeled using the NMR-defined binding site to guide an AutoDock simulation.13 Figure 5(C) shows a co-structure of suramin in the FAST-NMR defined PA1324 binding site, which was compared with all other ligand-defined binding sites in the PDB using CPASS. The highest CPASS scoring comparisons (∼40–45% similarity) were from proteins involved in polysaccharide biosynthesis, the cellulosome (scavenger of plant cell-wall polysaccharides), transport, and DNA binding proteins (segregation proteins). Many of these proteins bind sugars and polysaccharides. A comparison of the FAST-NMR defined PA1324 binding site to the protein with the highest CPASS score, the spore coat polysaccharide biosynthesis protein SpsA from Bacillus subtilis (PDB ID 1H7L) bound to thymidine-5′-phosphate (TMP),44 shows many similar features and interactions (Fig. 6). SpsA is a glycosyltransferase, but although the PA1324 has a similar binding site to SpsA, the functional analysis by FAST-NMR does not indicate any enzymatic activity for PA1324. The 1D 1H NMR spectrum of suramin in the presence of PA1324 does not undergo any changes with time. The CPASS results do, however, support the DALI inference relating the putative biological function of PA1324 with general sugar binding proteins and transport.
Discussion
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Functional analysis of PA1324
Chronic pulmonary infection with P. aeruginosa is the major cause of mortality among cystic fibrosis patients.45P. aeruginosa virulence results from its antibiotic resistance and the ability of P. aeruginosa to form biofilms. PA1324 was identified as being significantly upregulated in P. aeruginosa biofilms along with PA1323.46P. aeruginosa biofilms are controlled by known quorum-sensing systems47–49 that coordinate the regulation of gene transcription through the secretion of signaling molecules or autoinducers.50 The transcription of genes for both PA1323 and PA1324 have been demonstrated to increase similarly under quorum sensing conditions,25, 51 but are not affected by furanone, a repressor of quorum sensing.25 A number of environmental factors have also been shown to induce biofilm formation,52, 53 which includes anaerobic conditions,54 Fe limitation,55–57 high osmolarity,58 and high temperature.58 Interestingly, sub-inhibitory concentrations of some antibiotics were found to enhance biofilm formations.59 The addition of ethanol,60 oleic acid,61 glucose,62 and UDP-N-acetylglucosamine63 also stimulate biofilms.
An important component of biofilm formation is the production of an extracellular polysaccharide called polysaccharide intercellular adhesin (PIA) that mediates cell–cell adhesion.64 Bacteria within a biofilm are embedded in this hydrated extracellular matrix composed of β (1-6)-linked N-acetylglucosamine polymer.65, 66 Glucose is required for PIA production,67 where UDP-N-acetylglucosamine is the precursor of the biofilm's polysaccharide matrix.63 The addition of both glucose and UDP-N-acetylglucosamine stimulate PIA production and biofilm formation.62, 63 Interestingly, N-acetylglucosamine is also a repeating unit within heparin, which also has been shown to stimulate biofilm formation.68 Furthermore, heparin plays an important role in P. aeruginosa adherence to epithelial respiratory cells through an interaction with P. aeruginosa outer-membrane proteins.69
The tertiary structure of PA1324 determined by NMR indicates a barrel-like sandwich topology corresponding to a prealbumin fold. This fold contains a carbohydrate-binding superfamily, where the highest similarity from a Dali analysis identified a protein involved in transport. Other low-similarity proteins from the Dali analysis include sugar or heparin binding proteins and proteins involved in the extracellular matrix or cell adhesion. Recent evidence from the Gene Ontology database also suggests PA1324 is a carbohydrate-binding domain.38 The FAST-NMR screen found only one significant compound that bound to PA1324—suramin, a heparin analog. The suramin PA1324 binding site was compared with all ligand binding sites in the PDB using CPASS, where the highest CPASS scores were to proteins that interact with carbohydrates or carbohydrate-like molecules. Site-directed mutational studies for a number of carbohydrate binding domains (CBD) indicate some common features for carbohydrate affinity. Specifically, carbohydrate binding involves a combination of hydrogen-bond and aromatic interactions, where replacing conserved hydrogen bonding (D, N, E, Q, K, R) or aromatic residues (W, Y) significantly diminished or eliminated carbohydrate binding.70–72 These specific amino acid interactions have been corroborated by NMR and x-ray structures of CBD bound to carbohydrates.73–75 Importantly, the experimentally identified PA1324 suramin binding site contains these typical carbohydrate binding residues (D, N, K, Y). The ability of PA1324 to bind carbohydrates was further confirmed by the observation that PA1324 binds heparin in a manner similar to suramin.
The binding affinity of carbohydrates to CDBs tend to range from low micromolar to 100s of micromolar, where the magnitude of the dissociation constant (KD) is strongly dependent on the carbohydrate.71–78 For example, the affinity of cyanovirin-N to Man-2 and Man-9 were determined to be 757 ± 80 μM and 3.4 ± 0.05 μM, respectively.78 Similarly, the human and mouse galectin-9 N-terminal carbohydrate recognition domains were shown to bind 42 different carbohydrates with KDs ranging from 0.3 μM to 109 μM.75 The PA1324 NMR chemical shift titration studies provides a KD estimate of 152 ± 18 μM and 51 ± 14 μM for suramin and heparin, respectively. This is consistent with other CBDs, especially considering that suramin is an unoptimized structural homolog of the natural ligand and heparin is a heterogeneous mixture.
The gene for PA1322, a putative TonB-dependent receptor-like transporter,79 is on the same strand and upstream from PA1323 and PA1324. P. aeruginosa TonB1 was shown to be essential in both biofilm formation and quorum sensing.80 This activity is separate from its known role in iron uptake. Furthermore, a TonB-dependent receptor was identified that transports sucrose in the phytopathogenic bacterium Xanthomonas campestris pv. campestris and is conserved in other bacteria.81 Similarly, a TonB-dependent receptor that transport maltodextrins across the outer membrane were identified in Caulobacter crescentus.82 These observations suggest that TonB-dependent receptors transport other nutrients besides iron and vitamin B12 and may play a role in PIA production.83 This is especially relevant because heparin and UDP-N-acetylglucosamine stimulate PIA production and biofilm formation, where UDP-N-acetylglucosamine is also a PIA precursor.
The genetic organization of TonB-dependent receptors implies that functionally related proteins (σ and anti-σ) are neighboring genes,79 which infers an association between PA1322, PA1323, and PA1324. PA1323 is predicted by Phobius84 to contain at least one transmembrane helix and exhibits sequence similarity to other putative membrane proteins. The TonB dependent regulatory systems consist of a number of membrane bound proteins (anti-σ, TonB, ExbB, ExbD) suggesting a potential biological role for PA1323. PA1324 has a predicted signal peptide suggesting transport to the membrane. Also, PA1324 has shown low-sequence similarity (E scores of >0.2) to the N-terminal region of TonB-dependent receptor-like proteins, which forms a plug that controls substrate transport through the barrel-like receptor-transporter. Deleting the plug domain inactivates substrate transport where activity is regenerated by cosynthesis of the separate plug and β-barrel domains.85 The plug structure consists of a twisted β-sheet with loops and helices connecting the strands, where the substrate binds the outer surface of the plug.86 Interestingly, an isolated plug domain does not adopt a folded structure.87 Although there appears to be some gross similarities between the PA1324 NMR structure and the general description of a plug domain, Dali did not predict any similarity between the two structures. Also, a sequence alignment of PA1322 with other known TonB-dependent receptor proteins implies an intact N-terminus in PA1322. Thus, does PA1324 represent an alternative mechanism of regulating PA1322, the putative TonB-dependent receptor-like transporter by mimicking a folded plug domain complexed to a substrate?
Combining the sequence and structural analysis with the functional screening, we suggest a role for PA1324 in carbohydrate binding or transport together with PA1323, the protein predicted to co-express with PA1324. PA1324 and PA1323 might be involved in the polysaccharide secretion or scavenging process required for PIA formation in P. aeruginosa biofilms. This biological function may also involve the regulation of PA1322, a putative TonB-dependent receptor-like protein that may transport carbohydrates across the P. aeruginosa membrane.
Materials and Methods
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Sample preparation
Chemicals were purchased from Sigma (St. Louis, MO). Stable isotope-labeled compounds were purchased from Cambridge Isotope Laboratories (Andover, MA). Genomic DNA from P. aeruginosa strain PA01 was obtained from the American Type Culture Collection (Manassas, VA). Primers were designed for PCR amplification of the PA1324 21-170 gene fragment, so that residues 1–20 (MLSIKKNLGLLAMTAALAAC) would be absent in the expressed protein. Residues 1–18 are predicted to be a hydrophobic signal peptide by SOSUI.33 Thus, the 170 residue expressed protein is PA1324 21–170 with a 20 residue His-tag (GSSHHHHHHSSGLVPRGSHM) in place of the signal peptide sequence at the N-terminus. A M21A mutation occurs as a cloning artifact at the insertion point in the expression vector, pET28b. Escherichia coli [Rosetta BL21(DE3)] cells were transformed by heat shock with this vector and plated on LB agar plates containing 30 μg/mL kanamycin. Single colonies from the plate were used to inoculate 4 mL overnight seed cultures in M9 minimal medium made with U-15N NH4Cl (1 g/L) and U-13C glucose (2 g/L) and supplemented with 7.5 μM FeCl3, 10 μM ZnSO4, and 10 μM MnSO4; seed cultures were shaken overnight at 37°C, then used to inoculate 50 mL starter cultures; these were grown to OD600 0.7 and diluted into 0.5 L media in 2L glass flasks. When the cultures reached OD600 0.6–0.8, expression of PA1324 was induced with 1.1 mM IPTG, the flasks were moved to a 25°C shaking incubator for overnight growth. After 15 h, cells were pelleted with gentle centrifugation and resuspended in 25 mL lysis buffer (50 mM sodium phosphate, 500 mM NaCl, 10 mM imidazole, pH 8.0). Cells were lysed by passage four times through a French Press. After insoluble material was removed by centrifugation, protamine sulfate (12.5 mg) was added to precipitate nucleic acids, and the supernatant was ultracentrifuged for 1 h. The His-tagged protein was loaded onto a gravity column containing 10 mL Ni-NTA affinity resin (Qiagen, Valencia, CA) equilibrated with lysis buffer, washed with 50 mL lysis buffer, and removed with elution buffer (50 mM phosphate, 500 mM NaCl, 400 mM imidazole). Protein in elution buffer was exchanged into NMR buffer (20 mM BisTris Propane, 300 mM NaCl pH 6.8) on a PD10 column. The yield of protein was >100 mg/L of culture. For NMR samples, 7% (v/v) D2O was added, and 250 μL of sample was put into a Shigemi NMR tube. In addition, a U-15N and 5% biosynthetically directed fractionally 13C-labeled sample was generated for stereospecific assignment of isopropyl methyl groups in leucine and valine residues by growing cells as described above but with 5% of the carbon as U-13C-glucose and 95% as unenriched glucose.88
NMR structure determination
NMR experiments were conducted at 298 K using standard triple-resonance experiments.89, 90 NMR experiments were conducted at the Environmental Molecular Sciences Laboratory of Pacific Northwest National Laboratory, Richland, WA. Experiments were collected on Varian Inova 600, 750, and 800 instruments. The following experiments were recorded: 1H-15N-HSQC, 1H-13C-HSQC, HNCO, HNCA, HNCOCA, HNCACB, CBCACONNH, CBCACOCAHA, CCC-TOCSY-NNH, HCC-TOCSY-NNH, HCCH-TOCSY, HCCH-COSY, 3D-15N-NOESY (100 ms mixing time), 4D-13C-13C-HMQC-NOESY-HMQC (80 ms mixing time), two 3D-13C-NOESYs (80 ms mixing time) optimized for either aliphatic or aromatic carbons, HNHA, and aromatic ring side chain correlation experiments (HBCBCG-CDHD/-CEHE). All pulse sequences were from the BioPack library (Varian) except 4D-13C-13C-HMQC-NOESY-HMQC, and HBCBCG-CDHD/-CEHE-aro, which were from Lewis Kay (University of Toronto). Amide proton exchange was monitored by acquiring 1H-15N-HSQC spectra following dissolution of lyophilized protein in D2O. Stereospecific Leu and Val side chain assignments were obtained from a 1H-13C-HSQC experiment recorded on the 5% 13C-sample.88 Chemical shift assignments were deposited to BioMagResBank (bmrb-id 6343).
NOESY peak assignments and preliminary structure ensembles were determined with AutoStructure using peak-picked NOESY data and chemical shift assignments as input.37 Manual inspection of the data clearly indicated the correct fold was calculated, and manual refinement of the preliminary ensemble and error-checking of the restraint lists was conducted using XPLOR-NIH, using standard sa.inp and dgsa.inp protocols modified to use 20,000 high temperature steps at 2000 K, followed by 200,000 cooling steps (3 fs steps). Sum averaging of NOE restraints was used. Final refinement in explicit water solvent models with Lennard-Jones and electrostatic potentials included was accomplished with CNS using a modification of the procedure and forcefield of Nilges.91 NOE distance restraints had uniform lower bounds of 1.8 Å and upper bounds of either 2.8, 3.0, 3.5, 4.0, 4.5, or 5.0 Å. Hydrogen bond restraints were derived from amide proton D2O exchange data. Amide 1H-15N-HSQC cross peaks still present 30 min. after dissolution of a lyophilized sample in D2O were given bounds of 1.8–2.5 Å for the HN-O distance and 2.8–3.5 Å for the N-O distance, provided preliminary structural ensembles clearly indicated the probable acceptor atom. Dihedral restraints were obtained from TALOS,92 only those dihedrals with a score of 10 were used. TALOS reported restraints were given bounds of ± 30° for phi and ± 40° for psi, and only dihedral angles in regions of regular secondary structure except in regions where the angle reported by TALOS was consistent with the HNHA experiment for phi angles or with the ratio of sequential interresidue and intraresidue amide NH to CαH cross peak intensities in the 15N NOESY data.93 The entire ensemble of 20 structures calculated, together with the distance and dihedral restraints, was deposited to the Protein Data Bank (pdb-id 1XPN).
FAST-NMR
The 1D 1H NMR line-broadening samples consist of 100 μM of each compound and 25 μM protein in a 20 mM D-Bis-Tris buffer with 11.1 μM TMSP as a NMR reference in 100% D2O at pH 7.0 (uncorrected). The NMR spectra were collected on a Bruker 500 MHz Avance spectrometer equipped with a triple-resonance, Z-axis gradient cryoprobe, a BACS-120 sample changer, and Icon NMR software for automated data collection. 1H NMR spectra were collected with 128 transients at 298 K with solvent presaturation of the residual HDO, a sweep-width of 6009 Hz and 32 K data points and a total acquisition time of 6 min. The NMR spectra were processed automatically using a macro in the ACD/1D NMR manager (Advanced Chemistry Development, Toronto, ON). The NMR data was Fourier transformed, zero-filled, phased, and baseline corrected. Each spectrum was referenced with the TMSP peak set to 0.0 ppm and peak-picked. Compounds were identified as binding by a visual comparison of the free and protein spectra for all 113 mixtures.
Only the 20 compounds that exhibited line broadening in the 1D 1H experiment were used in the 2D 1H-15N HSQC experiments to verify specific binding and identify the ligand binding site. Twenty-one 2D 1H-15N HSQC spectra were collected with 16 transients at 298 K with a sweep-width of 6009 Hz and 1 K data points in the direct dimension and 1612 Hz and 256 data points in the indirect dimension for a total acquisition time of 2.5 h. Each NMR sample consists of 100 μM protein and 400 μM compound in a Bis-Tris buffer in 95% H2O, 5% D2O at pH 7.0 (uncorrected). The spectra were processed using NMRPipe94 on a Intel Xeon 3.06 GHz dual processor Linux Workstation. Chemical shift differences were identified by comparing a reference 2D 1H-15N HSQC spectrum of the free protein to the spectrum of the compound-protein samples. CSPs were then mapped onto the surface of the PA1324 structure (PDB ID: 1XPN) and visualized using VMD-XPLOR.95 Only one compound, suramin, exhibited a definitive binding site. A series of titration experiments were done with increasing amount of suramin to determine the binding site. The PA1324 concentration was approximately constant at 100 μM in a single tube whereas the suramin concentrations were increased from 0 to 5 μM, 10 μM, 15 μM, 20 μM, 25 μM, 50 μM, 100 μM, 200 μM, 300 μM, 400 μM, 500 μM, and 600 μM. A titration experiment was also done with a heterogeneous sample of heparin (Sigma Aldrich, St. Louis, MO) to verify suramin binding to PA1324 was analogous to PA1324 binding heparin. Because the heterogeneous heparin sample has a molecular weight range of 6–12 kDa, an average MW of 9 kDa was used for determining the heparin concentration for the titration experiment. The PA1324 was again held constant at 100 μM whereas the heparin concentrations were increased from 0 to 5 μM, 10 μM, 15 μM, 20 μM, 25 μM, and 50 μM. At higher heparin concentrations, PA1324 began to precipitate out of solution. The addition of suramin and heparin caused chemical shift changes and peak broadening, where some peaks were broaden beyond detection at higher ligand concentrations. KD were measured by fitting chemical shift changes [Δ(ppm)] versus ligand concentration (L) using a standard Langmuir binding isotherm:
(i)
The chemical shift changes were measured as:
(ii)
where ΔNH is the difference between free and bound 1H amide chemical shifts (ppm) and Δ15N is the difference between free and bound 15N chemical shifts (ppm). The chemical shift changes from the suramin and heparin titrations for eleven and eight PA1324 residues, respectively, were superimposed based on normalizing the average chemical shift changes. A KD was measured by simultaneously fitting a single binding curve to this data using KaleidaGraph (Synergy Software, Reading, PA). A KD of 152 ± 18 μM with an R2 of 0.89 was determined for suramin binding to PA1324 and a KD of 51 ± 14 μM with an R2 of 0.84 was determined for heparin binding to PA1324 [Fig. 2(S)].
AutoDock96 was used to model the interaction between suramin and PA1324. Amides found to undergo CSPs following suramin binding were used to define a binding site region encompassing all the perturbed residues. A grid was manually defined within AutoDock to encompass the determined ligand-binding site by adjusting the x,y,z coordinates for the center of the grid box to position the grid in the binding pocket. The grid size is determined by the number of points in the x,y,z dimensions and is just visibly large enough to fit the entire ligand. The co-structures are selected based on a consistency with the observed CSPs in the 2D 1H-15N HSQC spectra.
CPASS database and software was developed to aid in the functional annotation of hypothetical proteins by using a protein-ligand co-structure.14 The CPASS database contains ∼27,000 ligand-defined active-sites identified from structures deposited in the PDB. CPASS returns a similarity score (0–100%) and an interactive 3D graphical display of the structural alignment for each active-site comparison. The CPASS software runs on 16-node Beowulf Linux cluster with a simple web-based interface. Each comparison averages ∼40s requiring ∼18 h to complete a comparison against the entire database.
Acknowledgements
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
A portion of the research was performed using the Environmental Molecular Sciences Laboratory (EMSL), a national scientific user facility sponsored by the Department of Energy's Office of Biological and Environmental Research and located at Pacific Northwest National Laboratory. Montelione laboratory and other members of the NESG Consortium for shared NMR technologies.
References
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
- 21
- 22
- 23
- 24
- 25
- 26
- 27
- 28
- 29
- 30
- 31
- 32
- 33
- 34
- 35
- 36
- 37
- 38
- 39
- 40
- 41
- 42
- 43
- 44
- 45
- 46
- 47
- 48
- 49
- 50
- 51
- 52
- 53
- 54
- 55
- 56
- 57
- 58
- 59
- 60
- 61
- 62
- 63
- 64
- 65
- 66
- 67
- 68
- 69
- 70
- 71
- 72
- 73
- 74
- 75
- 76
- 77
- 78
- 79
- 80
- 81
- 82
- 83
- 84
- 85
- 86
- 87
- 88
- 89
- 90
- 91
- 92
- 93
- 94
- 95
- 96
Supporting Information
- Top of page
- Abstract
- Introduction
- Results
- Discussion
- Materials and Methods
- Acknowledgements
- References
- Supporting Information
Additional Supporting Information may be found in the online version of this article.
| Filename | Format | Size | Description |
|---|---|---|---|
| PRO_62_sm_SuppInfo.pdf | 293K | Supporting Information. |
Please note: Wiley-Blackwell are not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.

1469-896X/asset/olbannerleft.gif?v=1&s=d218899ae53b2862ab119790ed504b8d72122fb3)
1469-896X/asset/olbannerright.gif?v=1&s=59470eb9a1d9b7b13b1be75e9445e6c46ee2214f)

