Abstract
- Top of page
- Abstract
- Methods
- General Computational Strategy
- Results and Discussions
- Conclusions
- Acknowledgments
- References
- Supporting Information
Yersinia pestis causes diseases ranging from gastrointestinal syndromes to bubonic plague and could be misused as a biological weapon. As its protein tyrosine phosphatase YopH has already been demonstrated as a potential drug target, we have developed two series of forty salicylic acid derivatives and found sixteen to have micromolar inhibitory activity. We designed these ligands to have two chemical moieties connected by a flexible hydrocarbon linker to target two pockets in the active site of the protein to achieve binding affinity and selectivity. One moiety possessed the salicylic acid core intending to target the phosphotyrosine-binding pocket. The other moiety contained different chemical fragments meant to target a nearby secondary pocket. The two series of compounds differed by having hydrocarbon linkers with different lengths. Before experimental co-crystal structures are available, we have performed molecular docking to predict how these compounds might bind to the protein and to generate structural models for performing binding affinity calculation to aid future optimization of these series of compounds.
Yersinia pestis can cause human diseases such as gastrointestinal syndromes and bubonic plague (1–3) and could be misused as a biological weapon (4). There is already evidence that blocking the protein tyrosine phosphatase YopH of this bacterium can be an effective therapeutic strategy. For example, altering the gene of YopH to a non-functional one removed the bacterium’s pathogenicity (5–7). Mutating the essential catalytic cysteine residue of YopH to alanine also abolished its protein tyrosine phosphatase activity and dampened the pathogenic effects of the bacterium (8,9). Consequently, potent and selective YopH inhibitors are expected to serve as novel anti-plague agents.
Several YopH inhibitors have already been identified over the last few years: Sun et al. (4) developed p-nitrocatechol sulfate (pNCS) and determined its co-crystal structure with YopH. Phan et al. (10) designed a hexapeptide mimic, Ac-DADE-F2Pmp-L-NH2, of the protein’s natural substrate (F2Pmp stands for difluoro-substituted phosphonomethylphenylalanine, which is a phosphotyrosine analog.) and determined its co-crystal structure with the protein. Liang et al. (11) identified aurintricarboxylic acid as a potent inhibitor of YopH and it displayed 6- to 120-fold selectivity in favor of YopH over a panel of mammalian protein tyrosine phosphatases. Tautz et al. (12) screened the DIVERSet™ library (ChemBridge, Inc. San Diego, CA, USA) of drug-like compounds and identified furanyl salicylate compounds as potent inhibitors of YopH. Hu and Stebbins (13) performed molecular docking and 3D-QSAR studies to rationalize the binding of derivatives of α-ketocarboxylic acids and squaric acid to YopH and to provide 3D-QSAR models to guide future refinement of this class of compounds.
In spite of these encouraging developments, the search for additional drug leads remains vital as many factors can prevent existing drug leads from passing through a series of stringent preclinical and clinical evaluations to become successful drugs. In this regard, most YopH inhibitors reported in the literature display unfavorable pharmacological properties and are not cell permeable. Moreover, multidrug-resistant strains of Yersinia pestis can emerge (14,15). To develop YopH inhibitors that carry sufficient polar and non-polar interactions with the active site and yet possess favorable pharmacological properties, we decided to capitalize our previous findings that the natural product salicylic acid can serve as a pTyr surrogate (16) and that naphthyl and polyaromatic salicylic acid derivatives exhibit enhanced affinity for protein tyrosine phosphatase relative to the corresponding single ring compounds (11,16). Therefore, in this work, we synthesized a new class of benzofuran salicylic acids and found many of them to demonstrate μm activity.
Our initial design principle assumed the benzofuran salicylic acid core to bind to the phosphotyrosine-binding pocket. By introducing an additional chemical entity, linked to the core by a flexible hydrophobic linker, we hoped to target a neighboring pocket simultaneously to improve potency and selectivity. This article presents two series of these compounds differing by having different length of the linker connecting the two chemical moieties (B and D series shown in Figure 1).
To investigate whether these compounds are likely to bind the way that we expected, we performed molecular docking using a flexible ligand-flexible protein model we developed recently. The method improved docking by going beyond the rigid-protein approximation to account for induced-fit effects so that it could dock a wider range of ligands properly to a protein. The model used molecular dynamics simulation as a sampling tool. However, instead of running simulations at a constant temperature, it employed a simulated annealing cycling protocol to improve sampling efficiency. The protein was not completely flexible but with harmonic constraints applied to the α carbons to keep its structure near a suitable reference structure such as one obtained from X-ray crystallography. However, all other atoms, including all the side chains, were unrestrained (17,18). Although not yet a completely flexible protein model, this model avoided artifacts resulting from non-optimal energy and solvation models by focusing on exploring the conformational space near a known experimental structure. Previously, we showed that this model successfully docked several small organic ligands to protein kinases and the protein tyrosine phosphatase YopH (17,18); a completely rigid-protein model, on the other hand, failed for some ligands studied (18).
In this application, we further leveraged this approach by performing docking in two stages to improve speed without significantly sacrificing reliability. As we were studying relatively similar compounds within a chemical series, we assumed docking several representative ligands in stage 1 could generate all the major docking modes accessible by every compound in the series. Then, in stage 2, we allowed each compound in the series to refine its structure around these major docking modes by performing less expensive docking on a focused subset of configurational space. We then applied the resulting docking poses to compute binding affinity to check further whether they yielded results consistent with experimental IC50. In the future, one can use the best resulting structural models for performing binding affinity calculation on new derivatives to suggest new compounds that are worthwhile to make and test experimentally.
Results and Discussions
- Top of page
- Abstract
- Methods
- General Computational Strategy
- Results and Discussions
- Conclusions
- Acknowledgments
- References
- Supporting Information
Figure 3A–F plot RMSDheavy versus energy for the six YopH–ligand systems selected in stage 1 docking, each docking covered a total of 80 ns of simulation time. Here, RMSDheavy represented the RMSD of all heavy atoms of the ligand between a structure near a local minimum (a structure obtained below 5 K) and one that had the lowest energy. The plot for ligand D09 gave one deep-energy well significantly separated from the next lowest-energy well. On the other hand, the other ligands yielded two or more deep-energy wells with relatively similar energies, making it difficult to identify a single best docking mode. We therefore performed the clustering described above to identify several major docking modes for stage 2 docking that included all ligands within a series and used the resulting structures to perform binding affinity calculation to examine which model produced results most consistent with measured IC50.
The clustering described earlier produced four major docking modes denoted by B-Model I to IV for the B series and D-Model I to IV for the D series in Table 2. We labeled the structural models such that B-Model I was most similar to D-Model I, B-Model II was most similar to D-Model II, B-Model III was most similar to D-Model III, and B-Model IV was most similar to D-Model IV. However, remember from Figure 1 that ligands in the two series differed by the length of the linker connecting the two chemical moieties intended to target two different pockets. Therefore, one would expect minor structural differences between B-Model I and D-Model I, B-Model II and D-Model II etc. Also, recall that we obtained these models in two rounds of clustering. The first round clustered every structure near local energy minima obtained from the SA docking using a cutoff of 2 Å. The second round merged the twenty lowest-energy clusters obtained from round 1 into a smaller number of larger clusters using a larger cutoff distance of 5 Å. We found four clusters that were adopted by five of the six ligands and we chose these clusters as the four major docking mode used for stage 2 docking.
Table 2. Twenty best structural clusters grouped into four larger clusters | Model | Ligand |
|---|
| B11 | B16 | B17 | D09 | D03 | D14 |
|---|
|
| B-Model Ia/D-Model I | 14th/1b | 2nd/6 | 1st/3 | 1st/11 | 1st/3 | 6thc/5 |
| B-Model II/D-Model II | 1st/7 | 4th/4 | 2nd/9 | 14th/2 | 2nd/5 | 8th/5 |
| B-Model III/D-Model III | 4th/8 | 1st/6 | 4th/7 | 6th/2 | 6th/4 | 14th/2 |
| B-Model IV/D-Model IV | 11thc/4 | 11th/4 | 19th/1 | | 3rd/8 | 3rd/2 |
| Others | | | | 2d/5 | | 3d/6 |
Table 2 shows how the twenty lowest-energy clusters obtained in round 1 were grouped to form the four clusters in round 2. For example, the notation 1st/11 in D-Model I for ligand D09 means that eleven of the twenty lowest-energy clusters obtained from round 1 were merged into one cluster and the lowest-energy structure came from the 1st small cluster obtained in round 1. (We labeled the clusters from 1st to 20th in increasing energy.) The structure of the 1st cluster was used to represent D-Model I and as a template to construct all the ligands for stage 2 docking.
Five (B11, B16, B17, D03, and D14) of these six ligands all yielded the above four docking modes. D09, on the other hand, did not assume Model IV. Instead, it adopted another docking mode in which the ligand was sharply bended at the linker with the chemical moieties on its two ends tightly packed against each other (shown in “Others” in Table 2). We did not include this docking mode in stage 2 docking because this docking mode occurred infrequently and at higher energies and were thus less likely to be the correct docking mode. Ligand D14 also took on a similar sharply bended docking mode but again was ignored for stage 2 docking for the same reason. Table 3 summarizes the docking mode adopted by the lowest-energy structure of each ligand. The three ligands in the B series took on different docking modes as their lowest-energy docking structures. On the other hand, the lowest-energy structures for the three ligands in the D series all accepted Model I.
Table 3. The major docking modes adopted by the lowest-energy structures of the six ligands used in stage 1 docking | Ligand | B11 | B16 | B17 | D03 | D09 | D14 |
| Model | B-Model II | B-Model III | B-Model I | D-Model I | D-Model I | D-Model I |
Figure 4 displays the structures of these docking modes. In each model, two (for D-Model IV only) or three different ligands that docked similarly were overlaid. For example, B-model I in Figure 4A shows the lowest-energy structures from the 14th cluster for ligand B11, from the 2nd cluster for ligand B16, and from the 1st cluster for ligand B17. On the other hand, D-model I in Figure 4E displays the lowest-energy structures from the 1st cluster for ligand D09, the 1st cluster for ligand D03, and the 6th cluster for ligand D14. The figure shows that six binding modes (B/D-Model I, B/D-Model II, and B/D Model IV) had the salicylic acid core docked to the phosphotyrosine-binding pocket. They differed mainly in the non-salicylic portion of the compounds. On the other hand, B/D Model III differed by having the non-salicylic acid core docked inside the phosphotyrosine-binding pocket, with the salicylic acid core exposed to the protein surface.
Figure 5 uses the surface representation to show the different ways that ligand B11 might fit into the protein. The four lowest-energy structures in column 2 of Table 2 are shown. These structures show that one end of the bidentate ligand always occupied the phosphotyrosine-binding pocket. On the other hand, the other end of the ligand fitted into three different secondary pockets. A closer comparison of these structures revealed that these pockets changed somewhat depending on which ligand was bound to them – a flexible-protein model, as done here, is essential to capture such effects.
Stage 2 docking assumed that all the compounds within a series bound in one of these four docking modes with minor adjustment of the protein structures at the protein–ligand interface. D-Model I appeared to be the most likely docking mode for the D series, as it was the lowest-energy docking pose for all the three ligands in the D series on which extensive flexible ligand-flexible protein docking was performed in stage 1 (Table 3). On the other hand, the three ligands in the B series found different major docking modes as their lowest-energy docking pose, suggesting the possibility that not all compounds in the B series might bind with the same major docking mode.
To check this further, we used each structural model to perform binding affinity calculation to find out which model gave results most consistent with experimental IC50. Table 4 gives the calculated binding affinity using the ε(r) = 4r or the GBMV solvation model for the twenty ligands in the B series and for each of the four different structural models: B-Model I-IV obtained in stage 1. The results also included a mixed structural model in which not all ligands within the series were required to dock to the same major docking mode. Instead, each ligand selected the docking mode that gave the most favorable binding affinity. In this preliminary evaluation of the performance of the different docking modes, we simply used one cutoff value of the binding affinity to divide the compounds into actives and non-actives. Because there were nine known actives for the B series, we first classified the nine compounds with the most favorable predicted binding affinity as actives and the rest as non-actives. The sensitivity, specificity, accuracy, and enrichment factor in the table then indicate that B-Model I performed the best for both solvation models (with the best accuracy, for example). However, the mixed structural model was not far behind for both solvation models. B-Model IV also scored quite well for the ε(r) = 4r solvation model but did not score as well with the GBMV model. Based on these data, we therefore concluded that B-Model I was most likely, although the mixed structural model could be a possibility as well. For B-Model I, the salicylic acid core bound to the phosphotyrosine-binding pocket as intended.
Table 4. Binding affinity of ligands in the B series obtained from the ε(r) = 4r and the GBMV solvation models for the four major docking modes identified from stage 1 docking | Ligands | ε(r) = 4r model | GBMV model |
|---|
| I | II | III | IV | Mixa | I | II | III | IV | Mixa |
|---|
|
| B01 | −66.08 | −68.36 | −67.93 | −69.30 | −69.30 | −49.75 | −41.72 | −42.66 | −53.80 | −53.80 |
| B02 | −67.98 | −68.76 | −66.91 | −69.16 | −69.16 | −47.59 | −42.07 | −41.31 | −51.27 | −51.27 |
| B03 | −69.95 | −73.18 | −72.38 | −71.60 | −73.18 | −53.30 | −44.88 | −46.78 | −48.23 | −53.30 |
| B04 | −64.95 | −65.61 | −65.05 | −69.58 | −69.58 | −50.50 | −40.59 | −43.15 | −50.40 | −50.50 |
| B05 | −69.88 | −72.59 | −70.78 | −72.73 | −72.73 | −50.39 | −42.57 | −48.43 | −50.70 | −50.70 |
| B06 | −73.54 | −76.71 | −72.70 | −75.12 | −76.71 | −51.48 | −39.61 | −48.07 | −52.41 | −52.41 |
| B07 | −70.89 | −74.17 | −70.89 | −73.38 | −74.17 | −50.95 | −44.44 | −53.45 | −52.49 | −53.45 |
| B08 | −80.83 | −74.16 | −72.45 | −79.39 | −80.83 | −54.00 | −41.40 | −47.71 | −56.04 | −56.04 |
| B09 | −86.87 | −73.29 | −78.64 | −74.86 | −86.87 | −54.98 | −44.79 | −45.07 | −56.20 | −56.20 |
| B10 | −81.33 | −74.26 | −71.13 | −78.41 | −81.33 | −57.49 | −46.95 | −39.28 | −51.72 | −57.49 |
| B11 | −75.68 | −78.52 | −74.27 | −77.50 | −78.52 | −53.81 | −45.54 | −53.01 | −50.95 | −53.81 |
| B12 | −76.83 | −74.03 | −75.86 | −76.85 | −76.85 | −54.54 | −45.61 | −46.00 | −59.52 | −59.52 |
| B13 | −69.81 | −74.14 | −75.09 | −75.29 | −75.29 | −53.37 | −45.20 | −48.36 | −51.17 | −53.37 |
| B14 | −79.53 | −76.35 | −75.32 | −79.78 | −79.78 | −54.17 | −42.63 | −55.08 | −56.08 | −56.08 |
| B15 | −78.79 | −77.50 | −73.73 | −75.70 | −78.79 | −56.11 | −48.59 | −48.14 | −50.97 | −56.11 |
| B16 | −74.46 | −76.23 | −76.62 | −76.02 | −76.62 | −52.20 | −46.71 | −57.20 | −50.48 | −57.20 |
| B17 | −77.91 | −79.59 | −76.44 | −78.71 | −79.59 | −54.66 | −43.51 | −55.56 | −51.19 | −55.56 |
| B18 | −74.03 | −73.87 | −75.37 | −77.30 | −77.30 | −51.86 | −39.21 | −54.03 | −51.87 | −54.03 |
| B19 | −69.75 | −73.28 | −71.39 | −73.61 | −73.61 | −53.76 | −42.13 | −42.17 | −50.77 | −53.76 |
| B20 | −64.09 | −64.14 | −62.45 | −66.51 | −66.61 | −47.76 | −37.56 | −36.28 | −38.66 | −47.76 |
| TP/(TP + TN) | 9/9 | 7/9 | 7/9 | 8/9 | 8/9 | 8/9 | 6/9 | 5/9 | 5/9 | 8/9 |
| Se (%) | 100 | 78 | 78 | 89 | 89 | 89 | 67 | 56 | 56 | 89 |
| Sp (%) | 100 | 82 | 82 | 91 | 91 | 91 | 73 | 64 | 64 | 91 |
| Acc (%) | 100 | 80 | 80 | 90 | 90 | 90 | 70 | 60 | 60 | 90 |
| EF | 2.22 | 1.73 | 1.73 | 1.98 | 1.98 | 1.98 | 1.48 | 1.23 | 1.23 | 1.98 |
Table 5 shows the corresponding results for the D series. Because the experimental results only showed seven actives, we classified the seven compounds predicted to have the most favorable binding affinity as actives. The results suggest that D-Model I performed the best for both solvation model. In contrast to the B series, the mixed structural model only scored well for the ε(r) = 4r solvation model but not for GBMV model. If we only trust docking models that performed well with both solvation models, D-Model I was the best choice for the D series.
Table 5. Binding affinity of ligands in the D series obtained by using the ε(r) = 4r and the GBMV solvation models for the four major docking modes identified from stage 1 docking | D-Series ligands | ε(r) = 4r implicit model | GBMV implicit model |
|---|
| I | II | III | IV | Mixa | I | II | III | IV | Mixa |
|---|
|
| D01 | −74.16 | −73.98 | −71.58 | −76.34 | −76.34 | −55.71 | −55.87 | −44.90 | −65.22 | −65.22 |
| D02 | −74.30 | −76.32 | −68.61 | −76.05 | −76.32 | −54.21 | −54.86 | −49.46 | −71.45 | −71.45 |
| D03 | −79.29 | −75.19 | −75.34 | −79.26 | −79.29 | −54.90 | −54.52 | −50.88 | −71.94 | −71.94 |
| D04 | −71.30 | −75.45 | −66.67 | −74.34 | −75.45 | −61.99 | −53.63 | −48.55 | −65.15 | −65.15 |
| D05 | −75.46 | −77.03 | −77.65 | −77.26 | −77.65 | −54.97 | −52.22 | −53.53 | −65.88 | −65.88 |
| D06 | −78.47 | −78.31 | −78.92 | −80.92 | −80.92 | −58.26 | −59.44 | −53.43 | −69.02 | −69.02 |
| D07 | −77.10 | −77.45 | −73.22 | −78.66 | −78.66 | −61.00 | −51.38 | −49.14 | −69.16 | −69.16 |
| D08 | −86.62 | −77.26 | −83.61 | −84.12 | −86.62 | −59.30 | −52.73 | −58.06 | −68.04 | −68.04 |
| D09 | −93.19 | −79.02 | −82.65 | −79.48 | −93.19 | −64.11 | −53.73 | −50.25 | −62.47 | −64.11 |
| D10 | −90.97 | −79.97 | −76.31 | −84.04 | −90.97 | −62.40 | −53.84 | −55.71 | −71.16 | −71.16 |
| D11 | −81.01 | −80.60 | −80.25 | −81.79 | −81.79 | −63.64 | −56.57 | −52.17 | −68.55 | −68.55 |
| D12 | −82.27 | −77.31 | −78.87 | −80.79 | −82.27 | −62.48 | −53.74 | −54.49 | −68.98 | −68.98 |
| D13 | −74.28 | −79.96 | −73.60 | −77.40 | −79.96 | −59.33 | −56.48 | −48.11 | −66.77 | −66.77 |
| D14 | −85.38 | −84.20 | −84.75 | −83.37 | −85.38 | −62.85 | −58.98 | −48.63 | −66.94 | −66.94 |
| D15 | −85.38 | −84.42 | −83.60 | −87.25 | −87.25 | −59.40 | −58.84 | −52.85 | −71.53 | −71.53 |
| D16 | −79.63 | −77.41 | −78.88 | −81.96 | −81.96 | −62.25 | −54.19 | −61.03 | −68.41 | −68.41 |
| D17 | −82.93 | −82.49 | −83.42 | −85.34 | −85.34 | −59.82 | −54.61 | −60.20 | −71.98 | −71.98 |
| D18 | −78.20 | −79.81 | −78.29 | −80.97 | −80.97 | −59.19 | −50.64 | −63.65 | −70.45 | −70.45 |
| D19 | −81.75 | −75.44 | −77.40 | −79.79 | −81.75 | −59.16 | −51.03 | −55.37 | −67.62 | −67.62 |
| D20 | −69.63 | −72.09 | −62.05 | −73.18 | −73.18 | −61.09 | −45.48 | −36.44 | −57.23 | −61.09 |
| TP/(TP + TN) | 4/7 | 2/7 | 2/7 | 3/7 | 4/7 | 6/7 | 1/7 | 3/7 | 2/7 | 2/7 |
| Se (%) | 57 | 29 | 29 | 43 | 57 | 86 | 14 | 43 | 29 | 29 |
| Sp (%) | 77 | 62 | 62 | 69 | 77 | 92 | 54 | 69 | 62 | 62 |
| Acc (%) | 70 | 50 | 50 | 60 | 70 | 90 | 40 | 60 | 50 | 50 |
| EF | 1.63 | 0.82 | 0.82 | 1.22 | 1.63 | 2.45 | 0.41 | 1.22 | 0.82 | 0.82 |
The above analysis relied on using one single cutoff value to classify compounds into actives and non-actives. To evaluate the models more thoroughly, we also generated ROC curves and calculated AUC which required repeating the calculations of sensitivity and specificity using different cutoff values for classifying compounds into actives and non-actives. Figure 6 shows these curves for different B Models. Table 6 gives the corresponding AUC. From these ROC plots and AUC values, we further confirmed that B-Model I performed the best for both solvation models. However, the mixed structural model had AUC almost as good. Therefore, our results did not significantly favor B-Model I over the mixed model. In future lead optimization, one may want to use both models to obtain consensus scoring and only suggests new derivatives that score well with both B-Model I and the mixed model for experimental testing.
Table 6. Area under curve (AUC) for receiver operating characteristics (ROC) plots | B or D Model | B Series | D Series |
|---|
| ε(r) = 4r | GBMV | ε(r) = 4r | GBMV |
|---|
| Model I | 1 | 0.97 | 0.71 | 0.81 |
| Model II | 0.88 | 0.85 | 0.52 | 0.53 |
| Model III | 0.87 | 0.67 | 0.55 | 0.52 |
| Model IV | 0.94 | 0.71 | 0.55 | 0.45 |
| Mixed | 0.97 | 0.99 | 0.69 | 0.45 |
Likewise, Figure 7 shows the ROC curves and Table 6 presents the corresponding AUC for compounds in the D series. This time, only D-Model I performed best with both solvation models. Therefore, this might be the most likely docking mode adopted by compounds in the D series.
Figure 8 shows how the ligand interacted with the protein in the lowest-energy structures for B-Model I and D-Model I as Model I was among the highest performance structural models for both series of compounds. Table S2 gives key protein-ligand interactions for these two predicted structures. The salicylic ring docked into the phosphotyrosine-binding pocket for both docking structures. In B-Model I (Figure 8A), the hydroxyl group formed hydrogen bonds with the main-chain NH groups of three P-loop residues (Arg-404, Ala-405 and Gly-406). One oxygen of the carboxylic group interacted with the guanidinium side chain of Arg-409 and the other oxygen hydrogen-bonded with the amino side chain of Gln-450 and the backbone NH group of Gln-357. The nitrogens in the triazole-ring interacted with the side-chain oxygen of Gln-357 and formed two hydrogen bonds with the terminal amino group of Lys-447. In addition, the oxygen and nitrogen from the amide hydrogen-bonded with the side-chain NH group of Arg-205 and the main-chain carbonyl group of Gln-446. One phenyl ring of the ligand situated in a pocket formed inside a bundle of three α-helices (α1, α6, and α7) and the other phenyl ring protruded into solution. The latter phenyl ring might be removed in the future to reduce the size of the compounds so that other more useful functional groups can be introduced elsewhere.
The structure of D-Model I demonstrated with ligand D09 (Figure 8B) had the phenylmorpholine ring tightly bound inside the bundle of three helices: α1, α6, and α7. The hydroxyl group and the carboxylate groups of the salicylic acid core formed extensive hydrogen bonds with the side chains of Arg-409, Lys-447 and the backbone NH groups of Gln-357, Arg-404 and Ala-405. Similar to B-Model I, the oxygen in the furan ring hydrogen bonded with NH2 of Arg-404. On the other hand, the three nitrogens of the triazole-ring formed hydrogen bonds with the side chains of Gln-357 and Lys-447. The amide interacted with Arg-205 and Gln-446 while the morpholine ring interacted with the side chains of Asp-448 and Arg-437.
Table 7 summarizes the AUC values obtained from the eight different rigid-protein docking models using Autodock described above. The best AUC values were smaller than those obtained from the flexible-receptor models, suggesting that the rigid-protein model did not perform as well as the flexible-protein model. It is therefore better to use the best docking modes obtained from flexible-receptor docking to guide future lead optimization.
Table 7. Area under curve (AUC) for ROC plots from rigid-protein docking using Autodock | B/D Model | B Series | D Series |
|---|
|
| Model I | 0.94 | 0.60 |
| Model II | 0.90 | 0.62 |
| Model III | 0.72 | 0.42 |
| Model IV | 0.57 | 0.51 |
| 1QZ0 | 0.75 | 0.37 |
| 1PA9 | 0.89 | 0.48 |
| 1YPT | 0.66 | 0.52 |
| Mixed | 0.77 | 0.49 |
Conclusions
- Top of page
- Abstract
- Methods
- General Computational Strategy
- Results and Discussions
- Conclusions
- Acknowledgments
- References
- Supporting Information
In this work, we have developed two series of forty compounds derived from the salicylic acid core and found 16 to have micromolar inhibition activity against YopH. The initial design strategy relied on introducing two chemical moieties, linked together by a flexible hydrocarbon chain, to target two pockets in the active site of the protein. We used the salicylic acid moiety to target the pocket that phosphotyrosine bind and tried different chemical entities on the opposite end to target a nearby secondary pocket.
To predict how these compounds bind to YopH, we have started with flexible ligand-flexible protein docking using two different solvation models: ε(r) = 4r and GBMV. The docking suggested four possible docking modes, three had the salicylic acid core bound to the phosphotyrosine-binding pocket and one had the other end of the ligands bound instead. As the docking models yielded similar energy for these docking modes, it was difficult to single out the best docking mode. We therefore used each one of these docking modes to perform binding affinity calculations to examine which docking modes gave results most consistent with experimental IC50. We also considered a mixed structural model in which not all ligands were required to bind to the same binding mode. Instead, the docking mode – of four major docking modes identified from flexible-receptor docking – that yielded the most favorable binding affinity was taken. Different performance analysis such as calculating accuracy and area under receiver operating characteristics curve suggested that compounds in the B series might prefer a binding mode (B-Model I) in which all ligands bound with the salicylic acid core situated in the phosphotyrosine-binding pocket. However, the mixed model that allowed different ligands to take on different ones of the four possible major binding modes was also possible. On the other hand, compounds in the D series preferred a binding mode similar to B-Model I – i.e., D-Model I – in which all ligands bound in roughly the same way to the protein with minor adjustments at the protein–ligand interface.
We had also performed rigid-receptor docking using Autodock but the performance was not as good as the molecular dynamics-based flexible-receptor docking. The two best docking models found for the B series (B-Model I and the mixed model) and the best model found for the D series found from flexible-receptor docking might thus be the best to use in the immediate future to guide future optimization of these two series of compound, before experimental structures of protein–ligand complexes are available.