†Present Address: Hawkesbury Institute for the Environment, University of Western Sydney, Richmond, NSW, 2753, Australia Correspondence site: http://www.respond2articles.com/MEE/
Accounting for uncertainty in species delineation during the analysis of environmental DNA sequence data
Article first published online: 31 MAY 2011
DOI: 10.1111/j.2041-210X.2011.00122.x
© 2011 The Author. Methods in Ecology and Evolution © 2011 British Ecological Society
Additional Information
How to Cite
Powell, J. R. (2012), Accounting for uncertainty in species delineation during the analysis of environmental DNA sequence data. Methods in Ecology and Evolution, 3: 1–11. doi: 10.1111/j.2041-210X.2011.00122.x
- †
†Present Address: Hawkesbury Institute for the Environment, University of Western Sydney, Richmond, NSW, 2753, Australia Correspondence site: http://www.respond2articles.com/MEE/
Publication History
- Issue published online: 1 FEB 2012
- Article first published online: 31 MAY 2011
- Received 2 December 2010; accepted 21 April 2011 Handling Editor: Emmanuel Paradis
Keywords:
- 454 pyrosequencing;
- biodiversity;
- bioinformatics;
- coalescent;
- GMYC model;
- speciation
Summary
1. Defining species boundaries represents a significant challenge in biodiversity studies, especially as these studies increasingly rely on high-throughput DNA sequencing technologies. A promising approach for delineating species in environmental sequence data combines phylogenetics and coalescence theory to estimate species boundaries from distributions of lineage birth rates within multispecies coalescent trees.
2. Existing methods for interpreting these models utilize hypothetico-deductive reasoning to identify thresholds associated with a mixed speciation-coalescent model that fits the data better than a null model. Here, I describe an alternative approach that ranks and assigns weights to models based on their fit to the data using information criteria and then uses model averaging to estimate parameters and species probabilities.
3. This approach is applied to data from two independent studies that address (i) patterns of cospeciation in an aphid–bacterial symbiosis and (ii) diversity of bacterial communities associated with the human gut. In both of these cases, accounting for uncertainty during model selection allowed greater flexibility to detect variable (with respect to time) speciation-coalescent thresholds among lineages.
4. The precision of the predicted species boundaries varied among the studies, and the variance-to-mean ratio for richness estimates ranged from 0.023 to 0.079. Sample-based estimates of gut bacteria richness revealed that accounting for uncertainty during species delineation increased the variance in the estimates of population means (by individual from which the samples were taken or by sex of the individuals) by up to 7.5%.
5. In ecological and evolutionary studies, conclusions are highly dependent on the classification system that is adopted; given the uncertainty in species boundaries observed here, ignoring this source of error (as is common practice) likely results in inflated type I error rates. The approach described here represents an objective, theory-based method for predicting species boundaries and explicitly incorporates uncertainty in the classification system into biodiversity estimation, thus allowing researchers to better address the causes and consequences of biodiversity.

2041-210X/asset/olbannerleft.gif?v=1&s=1e80b9367ecb31f1b5cc2830ff8e98a4f5dd6f23)
2041-210X/asset/olbannerright.gif?v=1&s=0a5762ce8b20769bd5f856575232909b64428b79)
