Cluster analysis of consensus water sites in thrombin and trypsin shows conservation between serine proteases and contributions to ligand specificity
Abstract
Cluster analysis is presented as a technique for analyzing the conservation and chemistry of water sites from independent protein structures, and applied to thrombin, trypsin, and bovine pancreatic trypsin inhibitor (BPTI) to locate shared water sites, as well as those contributing to specificity. When several protein structures are superimposed, complete linkage cluster analysis provides an objective technique for resolving the continuum of overlaps between water sites into a set of maximally dense microclusters of overlapping water molecules, and also avoids reliance on any one structure as a reference. Water sites were clustered for ten superimposed thrombin structures, three trypsin structures, and four BPTI structures. For thrombin, 19% of the 708 microclusters, representing unique water sites, contained water molecules from at least half of the structures, and 4% contained waters from all 10. For trypsin, 77% of the 106 microclusters contained water sites from at least half of the structures, and 57% contained waters from all three. Water site conservation correlated with several environmental features: highly conserved microclusters generally had more protein atom neighbors, were in a more hydrophilic environment, made more hydrogen bonds to the protein, and were less mobile. There were significant overlaps between thrombin and trypsin conserved water sites, which did not localize to their similar active sites, but were concentrated in buried regions including the solvent channel surrounding the Na+ site in thrombin, which is associated with ligand selectivity. Cluster analysis also identified water sites conserved in thrombin but not trypsin, and vice versa, providing a list of water sites that may contribute to ligand discrimination. Thus, in addition to facilitating the analysis of water sites from multiple structures, cluster analysis provides a useful tool for distinguishing between conserved features within a protein family and those conferring specificity.
Citing Literature
Number of times cited according to CrossRef: 51
- Vijaya Kumar Hinge, Nikolay Blinov, Dipankar Roy, David S. Wishart, Andriy Kovalenko, The role of hydration effects in 5-fluorouridine binding to SOD1: insight from a new 3D-RISM-KH based protocol for including structural water in docking simulations, Journal of Computer-Aided Molecular Design, 10.1007/s10822-019-00239-3, (2019).
- Johannes Kraml, Anna Sophia Kamenik, Franz Waibl, Michael Schauperl, Klaus R. Liedl, Solvation Free Energy as a Measure of Hydrophobicity: Application to Serine Protease Binding Interfaces, Journal of Chemical Theory and Computation, 10.1021/acs.jctc.9b00742, (2019).
- Sarah E. Graham, Richard D. Smith, Heather A. Carlson, Predicting Displaceable Water Sites Using Mixed-Solvent Molecular Dynamics, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00268, 58, 2, (305-314), (2018).
- Sebastian Raschka, Alex J. Wolf, Joseph Bemister-Buffington, Leslie A. Kuhn, Protein–ligand interfaces are polarized: discovery of a strong trend for intermolecular hydrogen bonds to favor donors on the protein side with implications for predicting and designing ligand complexes, Journal of Computer-Aided Molecular Design, 10.1007/s10822-018-0105-2, 32, 4, (511-528), (2018).
- Eva Nittinger, Florian Flachsenberg, Stefan Bietz, Gudrun Lange, Robert Klein, Matthias Rarey, Placement of Water Molecules in Protein Structures: From Large-Scale Evaluations to Single-Case Examples, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.8b00271, 58, 8, (1625-1637), (2018).
- Pasquale Linciano, Alice Dawson, Ina Pöhner, David M. Costa, Monica S. Sá, Anabela Cordeiro-da-Silva, Rosaria Luciani, Sheraz Gul, Gesa Witt, Bernhard Ellinger, Maria Kuzikov, Philip Gribbon, Jeanette Reinshagen, Markus Wolf, Birte Behrens, Véronique Hannaert, Paul A. M. Michels, Erika Nerini, Cecilia Pozzi, Flavio di Pisa, Giacomo Landi, Nuno Santarem, Stefania Ferrari, Puneet Saxena, Sandra Lazzari, Giuseppe Cannazza, Lucio H. Freitas-Junior, Carolina B. Moraes, Bruno S. Pascoalino, Laura M. Alcântara, Claudia P. Bertolacini, Vanessa Fontana, Ulrike Wittig, Wolfgang Müller, Rebecca C. Wade, William N. Hunter, Stefano Mangani, Luca Costantino, Maria P. Costi, Exploiting the 2-Amino-1,3,4-thiadiazole Scaffold To Inhibit Trypanosoma brucei Pteridine Reductase in Support of Early-Stage Drug Discovery , ACS Omega, 10.1021/acsomega.7b00473, 2, 9, (5666-5683), (2017).
- Luca Codutti, Manuela Grimaldi, Teresa Carlomagno, Structure-Based Design of Scaffolds Targeting PDE10A by INPHARMA-NMR, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00246, 57, 6, (1488-1498), (2017).
- Srinivasa M. Gopal, Fabian Klumpers, Christian Herrmann, Lars V. Schäfer, Solvent effects on ligand binding to a serine protease, Physical Chemistry Chemical Physics, 10.1039/C6CP07899K, 19, 17, (10753-10766), (2017).
- Francesca Spyrakis, Mostafa H. Ahmed, Alexander S. Bayden, Pietro Cozzini, Andrea Mozzarelli, Glen E. Kellogg, The Roles of Water in the Protein Matrix: A Largely Untapped Resource for Drug Discovery, Journal of Medicinal Chemistry, 10.1021/acs.jmedchem.7b00057, 60, 16, (6781-6827), (2017).
- M. Amaral, D. B. Kokh, J. Bomke, A. Wegener, H. P. Buchstaller, H. M. Eggenweiler, P. Matias, C. Sirrenberg, R. C. Wade, M. Frech, Protein conformational flexibility modulates kinetics and thermodynamics of drug binding, Nature Communications, 10.1038/s41467-017-02258-w, 8, 1, (2017).
- Marko Jukič, Janez Konc, Stanislav Gobec, Dušanka Janežič, Identification of Conserved Water Sites in Protein Structures for Drug Design, Journal of Chemical Information and Modeling, 10.1021/acs.jcim.7b00443, 57, 12, (3094-3103), (2017).
- Flavio Di Pisa, Giacomo Landi, Lucia Dello Iacono, Cecilia Pozzi, Chiara Borsari, Stefania Ferrari, Matteo Santucci, Nuno Santarem, Anabela Cordeiro-da-Silva, Carolina Moraes, Laura Alcantara, Vanessa Fontana, Lucio Freitas-Junior, Sheraz Gul, Maria Kuzikov, Birte Behrens, Ina Pöhner, Rebecca Wade, Maria Costi, Stefano Mangani, Chroman-4-One Derivatives Targeting Pteridine Reductase 1 and Showing Anti-Parasitic Activity, Molecules, 10.3390/molecules22030426, 22, 3, (426), (2017).
- Amritpal Singh, Gaurav Vats, Dinesh Khanduja, Exploring tapping potential of solar energy: Prioritization of Indian states, Renewable and Sustainable Energy Reviews, 10.1016/j.rser.2015.12.056, 58, (397-406), (2016).
- Chiara Borsari, Rosaria Luciani, Cecilia Pozzi, Ina Poehner, Stefan Henrich, Matteo Trande, Anabela Cordeiro-da-Silva, Nuno Santarem, Catarina Baptista, Annalisa Tait, Flavio Di Pisa, Lucia Dello Iacono, Giacomo Landi, Sheraz Gul, Markus Wolf, Maria Kuzikov, Bernhard Ellinger, Jeanette Reinshagen, Gesa Witt, Philip Gribbon, Manfred Kohler, Oliver Keminer, Birte Behrens, Luca Costantino, Paloma Tejera Nevado, Eugenia Bifeld, Julia Eick, Joachim Clos, Juan Torrado, María D. Jiménez-Antón, María J. Corral, José M Alunda, Federica Pellati, Rebecca C. Wade, Stefania Ferrari, Stefano Mangani, Maria Paola Costi, Profiling of Flavonol Derivatives for the Development of Antitrypanosomatidic Drugs, Journal of Medicinal Chemistry, 10.1021/acs.jmedchem.6b00698, 59, 16, (7598-7616), (2016).
- Lada Biedermannová, Bohdan Schneider, Hydration of proteins and nucleic acids: Advances in experiment and theory. A review, Biochimica et Biophysica Acta (BBA) - General Subjects, 10.1016/j.bbagen.2016.05.036, 1860, 9, (1821-1835), (2016).
- Alfonso García-Sosa, Role of Water Molecules and Hydration Properties in Modeling Ligand–Protein Interaction and Drug Design, In Silico Drug Discovery and Design, 10.1201/b18799, (393-410), (2015).
- Elias D. López, Juan Pablo Arcon, Diego F. Gauto, Ariel A. Petruk, Carlos P. Modenutti, Victoria G. Dumas, Marcelo A. Marti, Adrian G. Turjanski, WATCLUST: a tool for improving the design of drugs based on protein-water interactions: Fig. 1., Bioinformatics, 10.1093/bioinformatics/btv411, 31, 22, (3697-3699), (2015).
- Teresa Milano, Martino Luigi Di Salvo, Sebastiana Angelaccio, Stefano Pascarella, Conserved water molecules in bacterial serine hydroxymethyltransferases, Protein Engineering Design and Selection, 10.1093/protein/gzv026, 28, 10, (415-426), (2015).
- Yong-Liang Zhu, Paul Beroza, Dean R. Artis, Including Explicit Water Molecules as Part of the Protein Structure in MM/PBSA Calculations, Journal of Chemical Information and Modeling, 10.1021/ci4001794, 54, 2, (462-469), (2014).
- H. Patel, B. A. Gruning, S. Gunther, I. Merfort, PyWATER: a PyMOL plug-in to find conserved water molecules in proteins by clustering, Bioinformatics, 10.1093/bioinformatics/btu424, 30, 20, (2978-2980), (2014).
- J. Kysilka, J. Vondrášek, A systematic method for analysing the protein hydration structure of T4 lysozyme, Journal of Molecular Recognition, 10.1002/jmr.2290, 26, 10, (479-487), (2013).
- Caterina Barillari, Anna L. Duncan, Isaac M. Westwood, Julian Blagg, Rob L. M. van Montfort, Analysis of water patterns in protein kinase binding sites, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.23032, 79, 7, (2109-2121), (2011).
- Daniel Cappel, Rickard Wahlström, Ruth Brenk, Christoph A. Sotriffer, Probing the Dynamic Nature of Water Molecules and Their Influences on Ligand Binding in a Model Binding Site, Journal of Chemical Information and Modeling, 10.1021/ci200052j, 51, 10, (2581-2594), (2011).
- Hannes G. Wallnoefer, Klaus R. Liedl, Thomas Fox, A GRID-Derived Water Network Stabilizes Molecular Dynamics Computer Simulations of a Protease, Journal of Chemical Information and Modeling, 10.1021/ci200138u, 51, 11, (2860-2867), (2011).
- Johannes Kirchmair, Gudrun M. Spitzer, Klaus R. Liedl, Consideration of Water and Solvation Effects in Virtual Screening, Virtual Screening, undefined, (263-289), (2011).
- Ramasamy Thilagavathi, Ricardo L. Mancera, Ligand−Protein Cross-Docking with Water Molecules, Journal of Chemical Information and Modeling, 10.1021/ci900345h, 50, 3, (415-421), (2010).
- Stefan Henrich, Outi M. H. Salo‐Ahen, Bingding Huang, Friedrich F. Rippmann, Gabriele Cruciani, Rebecca C. Wade, Computational approaches to identifying and characterizing protein binding sites for ligand design, Journal of Molecular Recognition, 10.1002/jmr.984, 23, 2, (209-219), (2009).
- Ricardo J. F. Branco, Marianne Graber, Vinciane Denis, Jürgen Pleiss, Molecular Mechanism of the Hydration of Candida antarctica Lipase B in the Gas Phase: Water Adsorption Isotherms and Molecular Dynamics Simulations, ChemBioChem, 10.1002/cbic.200900544, 10, 18, (2913-2919), (2009).
- EVGENIY AKSIANOV, OLGA ZANEGINA, ALEXANDER GRISHIN, SERGEY SPIRIN, ANNA KARYAGINA, ANDREI ALEXEEVSKI, CONSERVED WATER MOLECULES IN X-RAY STRUCTURES HIGHLIGHT THE ROLE OF WATER IN INTRAMOLECULAR AND INTERMOLECULAR INTERACTIONS, Journal of Bioinformatics and Computational Biology, 10.1142/S0219720008003588, 06, 04, (775-788), (2008).
- Benjamin C. Roberts, Ricardo L. Mancera, Ligand−Protein Docking with Water Molecules, Journal of Chemical Information and Modeling, 10.1021/ci700285e, 48, 2, (397-408), (2008).
- Fabian Bös, Jürgen Pleiss, Conserved Water Molecules Stabilize the Ω-Loop in Class A β-Lactamases, Antimicrobial Agents and Chemotherapy, 10.1128/AAC.01035-07, 52, 3, (1072-1079), (2008).
- Hung‐Chung Huang, Daniel Jupiter, Meikang Qiu, James M. Briggs, Vincent VanBuren, Cluster analysis of hydration waters around the active sites of bacterial alanine racemase using a 2‐ns MD simulation, Biopolymers, 10.1002/bip.20893, 89, 3, (210-219), (2007).
- Sandeep Kumar Srivastava, Divya Dube, Vandana Kukshal, Ashok Kumar Jha, Kanchan Hajela, Ravishankar Ramachandran, NAD+‐dependent DNA ligase (Rv3014c) from Mycobacterium tuberculosis: Novel structure‐function relationship and identification of a specific inhibitor, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.21457, 69, 1, (97-111), (2007).
- Christopher A. Bottoms, Tommi A. White, John J. Tanner, Exploring structurally conserved solvent sites in protein families, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.21014, 64, 2, (404-421), (2006).
- Ran Friedman, Esther Nachliel, Menachem Gutman, Protein Surface Dynamics: Interaction with Water and Small Solutes, Journal of Biological Physics, 10.1007/s10867-005-0171-2, 31, 3-4, (433-452), (2005).
- Ran Friedman, Esther Nachliel, Menachem Gutman, Molecular Dynamics of a Protein Surface: Ion-Residues Interactions, Biophysical Journal, 10.1529/biophysj.105.058917, 89, 2, (768-781), (2005).
- Dima Kozakov, Karl H. Clodfelter, Sandor Vajda, Carlos J. Camacho, Optimal Clustering for Detecting Near-Native Conformations in Protein Docking, Biophysical Journal, 10.1529/biophysj.104.058768, 89, 2, (867-875), (2005).
- Natarajan Kannan, Andrew F. Neuwald, Did Protein Kinase Regulatory Mechanisms Evolve Through Elaboration of a Simple Structural Component?, Journal of Molecular Biology, 10.1016/j.jmb.2005.06.057, 351, 5, (956-972), (2005).
- Ricardo L. Mancera, A new explicit hydration penalty score for ligand–protein interactions, Chemical Physics Letters, 10.1016/j.cplett.2004.10.019, 399, 1-3, (271-275), (2004).
- Agustin O. Pineda, Christopher J. Carrell, Leslie A. Bush, Swati Prasad, Sonia Caccia, Zhi-Wei Chen, F. Scott Mathews, Enrico Di Cera, Molecular Dissection of Na + Binding to Thrombin , Journal of Biological Chemistry, 10.1074/jbc.M401756200, 279, 30, (31842-31853), (2004).
- Alfonso T. García-Sosa, Ricardo L. Mancera, Philip M. Dean, WaterScore: a novel method for distinguishing between bound and displaceable water molecules in the crystal structure of the binding site of protein-ligand complexes, Journal of Molecular Modeling, 10.1007/s00894-003-0129-x, 9, 3, (172-182), (2003).
- Andrew M. Davis, Simon J. Teague, Gerard J. Kleywegt, Anwendung und Grenzen kristallographischer Daten im strukturbezogenen Liganden‐ und Wirkstoff‐Design, Angewandte Chemie, 10.1002/ange.200200539, 115, 24, (2822-2841), (2003).
- Andrew M. Davis, Simon J. Teague, Gerard J. Kleywegt, Application and Limitations of X‐ray Crystallographic Data in Structure‐Based Ligand and Drug Design, Angewandte Chemie International Edition, 10.1002/anie.200200539, 42, 24, (2718-2736), (2003).
- Laurent E. Dardenne, Araken S. Werneck, Marçal de Oliveira Neto, Paulo M. Bisch, Electrostatic properties in the catalytic site of papain: A possible regulatory mechanism for the reactivity of the ion pair, "Proteins: Structure, Function, and Bioinformatics", 10.1002/prot.10368, 52, 2, (236-253), (2003).
- Volker Schnecke, Leslie A. Kuhn, Virtual screening with solvation and ligand-induced complementarity, Virtual Screening: An Alternative or Complement to High Throughput Screening?, 10.1007/0-306-46883-2, (171-190), (2002).
- Yana K. Reshetnyak, Yuly Koshevnik, Edward A. Burstein, Decomposition of Protein Tryptophan Fluorescence Spectra into Log-Normal Components. III. Correlation between Fluorescence and Microenvironment Parameters of Individual Tryptophan Residues, Biophysical Journal, 10.1016/S0006-3495(01)75825-0, 81, 3, (1735-1758), (2001).
- Ya. K. Reshetnyak, O. A. Andreev, J. Borejdo, D. D. Toptygin, L. Brand, E. A. Burstein, The Identification of Tryptophan Residues Responsible for ATP-Induced Increase in Intrinsic Fluorescence of Myosin Subfragment 1, Journal of Biomolecular Structure and Dynamics, 10.1080/07391102.2000.10506651, 18, 1, (113-125), (2000).
- Sheldon Dennis, Carlos J. Camacho, Sandor Vajda, Exploring potential solvation sites of proteins by multistart local minimization, Optimization in Computational Chemistry and Molecular Biology, 10.1007/978-1-4757-3218-4_14, (243-261), (2000).
- Sheldon Dennis, Carlos J. Camacho, Sandor Vajda, Continuum electrostatic analysis of preferred solvation sites around proteins in solution, "Proteins: Structure, Function, and Bioinformatics", 10.1002/(SICI)1097-0134(20000201)38:2<176::AID-PROT6>3.0.CO;2-O, 38, 2, (176-188), (2000).
- E. R. Guinto, S. Caccia, T. Rose, K. Futterer, G. Waksman, E. Di Cera, Unexpected crucial role of residue 225 in serine proteases, Proceedings of the National Academy of Sciences, 10.1073/pnas.96.5.1852, 96, 5, (1852-1857), (1999).
- Maxwell M. Krem, Thierry Rose, Enrico Di Cera, The C-terminal Sequence Encodes Function in Serine Proteases, Journal of Biological Chemistry, 10.1074/jbc.274.40.28063, 274, 40, (28063-28066), (1999).




