FOR THE RECORD
Signal peptide prediction based on analysis of experimentally verified cleavage sites
Article first published online: 1 JAN 2009
DOI: 10.1110/ps.04682504
Copyright © 2004 The Protein Society
Additional Information
How to Cite
Zhang, Z. and Henzel, W. J. (2004), Signal peptide prediction based on analysis of experimentally verified cleavage sites. Protein Science, 13: 2819–2824. doi: 10.1110/ps.04682504
Publication History
- Issue published online: 1 JAN 2009
- Article first published online: 1 JAN 2009
- Manuscript Accepted: 14 JUL 2004
- Manuscript Revised: 7 JUL 2004
- Manuscript Received: 9 FEB 2004
Keywords:
- signal peptide;
- cleavage site;
- experimental verification;
- SWISS-PROT annotation;
- computational prediction
Abstract
A number of computational tools are available for detecting signal peptides, but their abilities to locate the signal peptide cleavage sites vary significantly and are often less than satisfactory. We characterized a set of 270 secreted recombinant human proteins by automated Edman analysis and used the verified cleavage sites to evaluate the success rate of a number of computational prediction programs. An examination of the frequency of amino acid in the N-terminal region of the data set showed a preference of proline and glutamine but a bias against tyrosine. The data set was compared to the SWISS-PROT database and revealed a high percentage of discrepancies with cleavage site annotations that were computationally generated. The best program for predicting signal sequences was found to be SignalP 2.0-NN with an accuracy of 78.1% for cleavage site recognition. The new data set can be utilized for refining prediction algorithms, and we have built an improved version of profile hidden Markov model for signal peptides based on the new data.

1469-896X/asset/olbannerleft.gif?v=1&s=d218899ae53b2862ab119790ed504b8d72122fb3)
1469-896X/asset/olbannerright.gif?v=1&s=59470eb9a1d9b7b13b1be75e9445e6c46ee2214f)
1469-896X/asset/cover.gif?v=1&s=0ff483325083001100314f63c633a394aff24478)