SEARCH

SEARCH BY CITATION

FilenameFormatSizeDescription
men12240-sup-0001-FigS1-S4-TableS1-S9.pdfapplication/PDF871K

Fig. S1 Abundance of taxonomic orders in the iBOL data release package 3.75 – v1 (N = 86 306). Each insect order is followed by the number of sequences represented in the iBOL data set.

Fig. S2 Proportion of correctly classified queries during ‘leave-one-out cross-validation’ (LOOCV) testing of the GenBank-barcode trained classifier. No bootstrap support cut-off was used to filter results.

Fig. S3 Proportion of correctly classified queries during ‘leave-one-out cross-validation’ (LOOCV) testing of the GenBank-family trained classifier. No bootstrap support cut-off was used to filter results.

Fig. S4 Country of origin for sequences in the iBOL data release package 3.75 – v1. The top ten most abundant countries are shown for a) Lepidoptera (N = 8647) and b) Diptera (N = 46 233). Each country is followed by the number of sequences represented in the iBOL data set.

Table S1 List of Mantodea genera included during testing and their GenBank Accessions nos.

Table S2 Number of unique taxa in three insect COI training sets.

Table S3 Number of taxa represented by a single sequence (singletons) in three insect COI training sets (and proportion of total taxa from Table S2 in parentheses).

Table S4 Proportion of sequences misclassified at the genus rank for each insect order after performing ‘leave-one-out cross-validation’ (LOOCV) testing with the GenBank trained classifier.

Table S5 Bootstrap support cut-offs that result in at least 99% correctly classified queries during leave-one-out cross-validation of the GenBank-barcode trained insect COI classifier.

Table S6 Bootstrap cut-offs that result in at least 99% correctly classified queries during leave-one-out cross-validation of the GenBank-family trained insect COI classifier.

Table S7 Comparison of order rank taxonomic assignments using three versions of the classifier vs. known taxonomic assignments based on morphological characters.

Table S8 Naïve Bayesian classifier automated taxonomic assignments of Mantodea sequences (N = 82) verified to match known taxonomic assignments from the iBOL data release package 3.75 - v1 using bootstrap support cut-offs from Tables 3, S5, and S6.

Table S9 Lepidoptera (N = 8647) and Diptera (N = 46 233) sequences from iBOL data release package 3.75 - v1 that were originally taxonomically assigned to the order rank but are putatively refined to the family and genus ranks using the naïve Bayesian classifier using bootstrap support cut-offs from Table 3, S5, and S6.

Please note: Wiley Blackwell is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.