As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re-organized for online delivery, but are not copy-edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.


Table S1. Distribution of cleavage sites among types of protein secondary structure predicted by six different bioinformatics methods. (a) Number of cleavage sites. (b) Total number of amino acids of particular secondary structure type. (c) Cleavage rate per peptide bond.

Table S2. Comparison of frequencies of cleavage sites between central and peripheral parts of b-strands predicted by six different bioinformatics methods.

Table S3. Significance of associations (-log(p-value)) of proteolytic events and sequence-derived protein structural features predicted by different bioinformatics methods.

Table S4. Prediction capabilities of sequence-derived structural features estimated by (a) Area under ROC Curve (AUC), (b) F-score, (c) Sensitivity, (d) Specificity metrics. AUC values, which are less than 0.5, are inverted (1-AUC) and maked by dark background. Binary features contain additional column with metrics calculated for their confidence values.

Table S5. Area under ROC Curve estimates calculated separately for substrates of a four protease types.

Please note: Wiley Blackwell is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.