Computational chemistry approach to protein kinase recognition using 3D stochastic van der Waals spectral moments
Article first published online: 31 JAN 2007
DOI: 10.1002/jcc.20649
Copyright © 2007 Wiley Periodicals, Inc.
Additional Information
How to Cite
González-díaz, H., Saíz-urra, L., Molina, R., González-díaz, Y. and Sánchez-gonzález, A. (2007), Computational chemistry approach to protein kinase recognition using 3D stochastic van der Waals spectral moments. Journal of Computational Chemistry, 28: 1042–1048. doi: 10.1002/jcc.20649
Publication History
- Issue published online: 1 MAR 2007
- Article first published online: 31 JAN 2007
- Manuscript Accepted: 10 JUL 2006
- Manuscript Revised: 20 JUN 2006
- Manuscript Received: 27 APR 2006
Funded by
- Xunta de Galicia
- Programa Isidro Parga Pondal
- PGIDIT research projects. Grant Numbers: PGIDT05BTF20302PR-2, PXIB20304PR
- Axuda para a incorporacion de investigadores tecnologos/visitantes da CONSELLEREIA DE INOVACION, INDUSTRIA E COMERCIO IN8061 2005/63-0
Keywords:
- protein structure-function relationships;
- kinases;
- Markov models;
- moments;
- van der Waals interactions
Abstract
Three-dimensional (3D) protein structures now frequently lack functional annotations because of the increase in the rate at which chemical structures are solved with respect to experimental knowledge of biological activity. As a result, predicting structure-function relationships for proteins is an active research field in computational chemistry and has implications in medicinal chemistry, biochemistry and proteomics. In previous studies stochastic spectral moments were used to predict protein stability or function (González-Díaz, H. et al. Bioorg Med Chem 2005, 13, 323; Biopolymers 2005, 77, 296). Nevertheless, these moments take into consideration only electrostatic interactions and ignore other important factors such as van der Waals interactions. The present study introduces a new class of 3D structure molecular descriptors for folded proteins named the stochastic van der Waals spectral moments (oβk). Among many possible applications, recognition of kinases was selected due to the fact that previous computational chemistry studies in this area have not been reported, despite the widespread distribution of kinases. The best linear model found was Kact = −9.44°β0(c) +10.94°β5(c) −2.40°β0(i) + 2.45°β5(m) + 0.73, where core (c), inner (i) and middle (m) refer to specific spatial protein regions. The model with a high Matthew's regression coefficient (0.79) correctly classified 206 out of 230 proteins (89.6%) including both training and predicting series. An area under the ROC curve of 0.94 differentiates our model from a random classifier. A subsequent principal components analysis of 152 heterogeneous proteins demonstrated that βk codifies information different to other descriptors used in protein computational chemistry studies. Finally, the model recognizes 110 out of 125 kinases (88.0%) in a virtual screening experiment and this can be considered as an additional validation study (these proteins were not used in training or predicting series). © 2007 Wiley Periodicals, Inc. J Comput Chem 2007

1096-987X/asset/JCC_centre.gif?v=1&s=b0d6b2f567f5e92bfd33499dcef2c15d54c9375d)
1096-987X/asset/cover.gif?v=1&s=4429aac2462ebd499c13b3d7fe983679c5767778)
