Testing similarity measures with continuous and discrete protein models
Article first published online: 20 NOV 2002
Copyright © 2002 Wiley-Liss, Inc.
Proteins: Structure, Function, and Bioinformatics
Volume 50, Issue 1, pages 144–157, 1 January 2003
How to Cite
Wallin, S., Farwer, J. and Bastolla, U. (2003), Testing similarity measures with continuous and discrete protein models. Proteins, 50: 144–157. doi: 10.1002/prot.10271
- Issue published online: 20 NOV 2002
- Article first published online: 20 NOV 2002
- Manuscript Accepted: 15 AUG 2002
- Manuscript Received: 15 MAR 2002
- protein folding;
- distance measure;
- protein structure comparison;
- root-mean-square deviation
There are many ways to define the distance between two protein structures, thus assessing their similarity. Here, we investigate and compare the properties of five different distance measures, including the standard root-mean-square deviation (cRMSD). The performance of these measures is studied from different perspectives with two different protein models, one continuous and the other discrete. Using the continuous model, we examine the correlation between energy and native distance, and the ability of the different measures to discriminate between the two possible topologies of a three-helix bundle. Using the discrete model, we perform fits to real protein structures by minimizing different distance measures. The properties of the fitted structures are found to depend strongly on the distance measure used and the scale considered. We find that the cRMSD measure very effectively describes long-range features but is less effective with short-range features, and it correlates weakly with energy. A stronger correlation with energy and a better description of short-range properties is obtained when we use measures based on intramolecular distances. Proteins 2003;50:144–157. © 2002 Wiley-Liss, Inc.