Research Article
Protein fragment clustering and canonical local shapes
Article first published online: 30 JAN 2003
DOI: 10.1002/prot.10309
Copyright © 2003 Wiley-Liss, Inc.
Issue
1097-0134/asset/cover.gif?v=1&s=d817e79b67ba6cacf8bdcce1a819c04de300a7e3)
Proteins: Structure, Function, and Bioinformatics
Volume 50, Issue 4, pages 580–588, 1 March 2003
Additional Information
How to Cite
Hunter, C. G. and Subramaniam, S. (2003), Protein fragment clustering and canonical local shapes. Proteins, 50: 580–588. doi: 10.1002/prot.10309
Publication History
- Issue published online: 30 JAN 2003
- Article first published online: 30 JAN 2003
- Manuscript Accepted: 19 SEP 2002
- Manuscript Received: 2 MAY 2002
Funded by
- National Science Foundation. Grant Numbers: NSF-DBI-9974555, NSF-DBI-0003231
- Abstract
- Article
- References
- Cited By
Keywords:
- protein structure;
- protein fragments;
- local structure;
- fragment clustering
Abstract
A novel clustering method is used to cluster protein fragments by shape. The centroids (mean fragments from each cluster) form a basis set of structural motifs. A database of 156,643 seven-residue fragments is used, and eight different basis sets with varying levels of resolution are generated. Coarse basis sets contain tens of centroids and provide meaningful local shapes, which are more detailed than the traditional secondary structure categories. High-resolution basis sets contain thousands of centroids and can be used to model tertiary structure of longer segments. The basis sets generated fit nontraining set proteins with the expected accuracy. Proteins 2003;50:580–588. © 2003 Wiley-Liss, Inc.

1097-0134/asset/PROT_centre.gif?v=1&s=77b56b1f2cdaba74cb3bb149bd9b029cd8803cdb)