Volume 7, Issue 10
Article
Free Access

Cluster analysis of consensus water sites in thrombin and trypsin shows conservation between serine proteases and contributions to ligand specificity

Paul C. Sanschagrin

Protein Structural Analysis and Design Laboratory, Department of Biochemistry, Michigan State University, East Lansing, Michigan 48824‐1319

Search for more papers by this author
Leslie A. Kuhn

Corresponding Author

E-mail address: kuhn@agua.bch.msu.edu

Protein Structural Analysis and Design Laboratory, Department of Biochemistry, Michigan State University, East Lansing, Michigan 48824‐1319

Department of Biochemistry, Michigan State University, East Lansing, Michigan 48824‐1319 web site: http://www.bch.msu.edu/labs/kuhnSearch for more papers by this author
First published: 31 December 2008
Citations: 51

Abstract

Cluster analysis is presented as a technique for analyzing the conservation and chemistry of water sites from independent protein structures, and applied to thrombin, trypsin, and bovine pancreatic trypsin inhibitor (BPTI) to locate shared water sites, as well as those contributing to specificity. When several protein structures are superimposed, complete linkage cluster analysis provides an objective technique for resolving the continuum of overlaps between water sites into a set of maximally dense microclusters of overlapping water molecules, and also avoids reliance on any one structure as a reference. Water sites were clustered for ten superimposed thrombin structures, three trypsin structures, and four BPTI structures. For thrombin, 19% of the 708 microclusters, representing unique water sites, contained water molecules from at least half of the structures, and 4% contained waters from all 10. For trypsin, 77% of the 106 microclusters contained water sites from at least half of the structures, and 57% contained waters from all three. Water site conservation correlated with several environmental features: highly conserved microclusters generally had more protein atom neighbors, were in a more hydrophilic environment, made more hydrogen bonds to the protein, and were less mobile. There were significant overlaps between thrombin and trypsin conserved water sites, which did not localize to their similar active sites, but were concentrated in buried regions including the solvent channel surrounding the Na+ site in thrombin, which is associated with ligand selectivity. Cluster analysis also identified water sites conserved in thrombin but not trypsin, and vice versa, providing a list of water sites that may contribute to ligand discrimination. Thus, in addition to facilitating the analysis of water sites from multiple structures, cluster analysis provides a useful tool for distinguishing between conserved features within a protein family and those conferring specificity.

Number of times cited according to CrossRef: 51

  • The role of hydration effects in 5-fluorouridine binding to SOD1: insight from a new 3D-RISM-KH based protocol for including structural water in docking simulations, Journal of Computer-Aided Molecular Design, 10.1007/s10822-019-00239-3, (2019).
  • Solvation Free Energy as a Measure of Hydrophobicity: Application to Serine Protease Binding Interfaces, Journal of Chemical Theory and Computation, 10.1021/acs.jctc.9b00742, (2019).
  • Predicting Displaceable Water Sites Using Mixed-Solvent Molecular Dynamics, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00268, 58, 2, (305-314), (2018).
  • Protein–ligand interfaces are polarized: discovery of a strong trend for intermolecular hydrogen bonds to favor donors on the protein side with implications for predicting and designing ligand complexes, Journal of Computer-Aided Molecular Design, 10.1007/s10822-018-0105-2, 32, 4, (511-528), (2018).
  • Placement of Water Molecules in Protein Structures: From Large-Scale Evaluations to Single-Case Examples, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.8b00271, 58, 8, (1625-1637), (2018).
  • Exploiting the 2-Amino-1,3,4-thiadiazole Scaffold To Inhibit Trypanosoma brucei Pteridine Reductase in Support of Early-Stage Drug Discovery , ACS Omega, 10.1021/acsomega.7b00473, 2, 9, (5666-5683), (2017).
  • Structure-Based Design of Scaffolds Targeting PDE10A by INPHARMA-NMR, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00246, 57, 6, (1488-1498), (2017).
  • Solvent effects on ligand binding to a serine protease, Physical Chemistry Chemical Physics, 10.1039/C6CP07899K, 19, 17, (10753-10766), (2017).
  • The Roles of Water in the Protein Matrix: A Largely Untapped Resource for Drug Discovery, Journal of Medicinal Chemistry, 10.1021/acs.jmedchem.7b00057, 60, 16, (6781-6827), (2017).
  • Protein conformational flexibility modulates kinetics and thermodynamics of drug binding, Nature Communications, 10.1038/s41467-017-02258-w, 8, 1, (2017).
  • Identification of Conserved Water Sites in Protein Structures for Drug Design, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00443, 57, 12, (3094-3103), (2017).
  • Chroman-4-One Derivatives Targeting Pteridine Reductase 1 and Showing Anti-Parasitic Activity, Molecules, 10.3390/molecules22030426, 22, 3, (426), (2017).
  • Exploring tapping potential of solar energy: Prioritization of Indian states, Renewable and Sustainable Energy Reviews, 10.1016/j.rser.2015.12.056, 58, (397-406), (2016).
  • Profiling of Flavonol Derivatives for the Development of Antitrypanosomatidic Drugs, Journal of Medicinal Chemistry, 10.1021/acs.jmedchem.6b00698, 59, 16, (7598-7616), (2016).
  • Hydration of proteins and nucleic acids: Advances in experiment and theory. A review, Biochimica et Biophysica Acta (BBA) - General Subjects, 10.1016/j.bbagen.2016.05.036, 1860, 9, (1821-1835), (2016).
  • Role of Water Molecules and Hydration Properties in Modeling Ligand–Protein Interaction and Drug Design, In Silico Drug Discovery and Design, 10.1201/b18799, (393-410), (2015).
  • WATCLUST: a tool for improving the design of drugs based on protein-water interactions: Fig. 1., Bioinformatics, 10.1093/bioinformatics/btv411, 31, 22, (3697-3699), (2015).
  • Conserved water molecules in bacterial serine hydroxymethyltransferases, Protein Engineering Design and Selection, 10.1093/protein/gzv026, 28, 10, (415-426), (2015).
  • Including Explicit Water Molecules as Part of the Protein Structure in MM/PBSA Calculations, Journal of Chemical Information and Modeling, 10.1021/ci4001794, 54, 2, (462-469), (2014).
  • PyWATER: a PyMOL plug-in to find conserved water molecules in proteins by clustering, Bioinformatics, 10.1093/bioinformatics/btu424, 30, 20, (2978-2980), (2014).
  • A systematic method for analysing the protein hydration structure of T4 lysozyme, Journal of Molecular Recognition, 10.1002/jmr.2290, 26, 10, (479-487), (2013).
  • Analysis of water patterns in protein kinase binding sites, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.23032, 79, 7, (2109-2121), (2011).
  • Probing the Dynamic Nature of Water Molecules and Their Influences on Ligand Binding in a Model Binding Site, Journal of Chemical Information and Modeling, 10.1021/ci200052j, 51, 10, (2581-2594), (2011).
  • A GRID-Derived Water Network Stabilizes Molecular Dynamics Computer Simulations of a Protease, Journal of Chemical Information and Modeling, 10.1021/ci200138u, 51, 11, (2860-2867), (2011).
  • Consideration of Water and Solvation Effects in Virtual Screening, Virtual Screening, undefined, (263-289), (2011).
  • Ligand−Protein Cross-Docking with Water Molecules, Journal of Chemical Information and Modeling, 10.1021/ci900345h, 50, 3, (415-421), (2010).
  • Computational approaches to identifying and characterizing protein binding sites for ligand design, Journal of Molecular Recognition, 10.1002/jmr.984, 23, 2, (209-219), (2009).
  • Molecular Mechanism of the Hydration of Candida antarctica Lipase B in the Gas Phase: Water Adsorption Isotherms and Molecular Dynamics Simulations, ChemBioChem, 10.1002/cbic.200900544, 10, 18, (2913-2919), (2009).
  • CONSERVED WATER MOLECULES IN X-RAY STRUCTURES HIGHLIGHT THE ROLE OF WATER IN INTRAMOLECULAR AND INTERMOLECULAR INTERACTIONS, Journal of Bioinformatics and Computational Biology, 10.1142/S0219720008003588, 06, 04, (775-788), (2008).
  • Ligand−Protein Docking with Water Molecules, Journal of Chemical Information and Modeling, 10.1021/ci700285e, 48, 2, (397-408), (2008).
  • Conserved Water Molecules Stabilize the Ω-Loop in Class A β-Lactamases, Antimicrobial Agents and Chemotherapy, 10.1128/AAC.01035-07, 52, 3, (1072-1079), (2008).
  • Cluster analysis of hydration waters around the active sites of bacterial alanine racemase using a 2‐ns MD simulation, Biopolymers, 10.1002/bip.20893, 89, 3, (210-219), (2007).
  • NAD+‐dependent DNA ligase (Rv3014c) from Mycobacterium tuberculosis: Novel structure‐function relationship and identification of a specific inhibitor, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.21457, 69, 1, (97-111), (2007).
  • Exploring structurally conserved solvent sites in protein families, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.21014, 64, 2, (404-421), (2006).
  • Protein Surface Dynamics: Interaction with Water and Small Solutes, Journal of Biological Physics, 10.1007/s10867-005-0171-2, 31, 3-4, (433-452), (2005).
  • Molecular Dynamics of a Protein Surface: Ion-Residues Interactions, Biophysical Journal, 10.1529/biophysj.105.058917, 89, 2, (768-781), (2005).
  • Optimal Clustering for Detecting Near-Native Conformations in Protein Docking, Biophysical Journal, 10.1529/biophysj.104.058768, 89, 2, (867-875), (2005).
  • Did Protein Kinase Regulatory Mechanisms Evolve Through Elaboration of a Simple Structural Component?, Journal of Molecular Biology, 10.1016/j.jmb.2005.06.057, 351, 5, (956-972), (2005).
  • A new explicit hydration penalty score for ligand–protein interactions, Chemical Physics Letters, 10.1016/j.cplett.2004.10.019, 399, 1-3, (271-275), (2004).
  • Molecular Dissection of Na + Binding to Thrombin , Journal of Biological Chemistry, 10.1074/jbc.M401756200, 279, 30, (31842-31853), (2004).
  • WaterScore: a novel method for distinguishing between bound and displaceable water molecules in the crystal structure of the binding site of protein-ligand complexes, Journal of Molecular Modeling, 10.1007/s00894-003-0129-x, 9, 3, (172-182), (2003).
  • Anwendung und Grenzen kristallographischer Daten im strukturbezogenen Liganden‐ und Wirkstoff‐Design, Angewandte Chemie, 10.1002/ange.200200539, 115, 24, (2822-2841), (2003).
  • Application and Limitations of X‐ray Crystallographic Data in Structure‐Based Ligand and Drug Design, Angewandte Chemie International Edition, 10.1002/anie.200200539, 42, 24, (2718-2736), (2003).
  • Electrostatic properties in the catalytic site of papain: A possible regulatory mechanism for the reactivity of the ion pair, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.10368, 52, 2, (236-253), (2003).
  • Virtual screening with solvation and ligand-induced complementarity, Virtual Screening: An Alternative or Complement to High Throughput Screening?, 10.1007/0-306-46883-2, (171-190), (2002).
  • Decomposition of Protein Tryptophan Fluorescence Spectra into Log-Normal Components. III. Correlation between Fluorescence and Microenvironment Parameters of Individual Tryptophan Residues, Biophysical Journal, 10.1016/S0006-3495(01)75825-0, 81, 3, (1735-1758), (2001).
  • The Identification of Tryptophan Residues Responsible for ATP-Induced Increase in Intrinsic Fluorescence of Myosin Subfragment 1, Journal of Biomolecular Structure and Dynamics, 10.1080/07391102.2000.10506651, 18, 1, (113-125), (2000).
  • Exploring potential solvation sites of proteins by multistart local minimization, Optimization in Computational Chemistry and Molecular Biology, 10.1007/978-1-4757-3218-4_14, (243-261), (2000).
  • Continuum electrostatic analysis of preferred solvation sites around proteins in solution, "Proteins: Structure, Function, and Bioinformatics", 10.1002/(SICI)1097-0134(20000201)38:2<176::AID-PROT6>3.0.CO;2-O, 38, 2, (176-188), (2000).
  • Unexpected crucial role of residue 225 in serine proteases, Proceedings of the National Academy of Sciences, 10.1073/pnas.96.5.1852, 96, 5, (1852-1857), (1999).
  • The C-terminal Sequence Encodes Function in Serine Proteases, Journal of Biological Chemistry, 10.1074/jbc.274.40.28063, 274, 40, (28063-28066), (1999).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.