The influence of family groups on inferences made with the program Structure
Article first published online: 29 OCT 2008
DOI: 10.1111/j.1755-0998.2008.02355.x
© 2008 The Authors. Journal compilation © 2008 Blackwell Publishing Ltd
Additional Information
How to Cite
ANDERSON, E. C. and DUNHAM, K. K. (2008), The influence of family groups on inferences made with the program Structure. Molecular Ecology Resources, 8: 1219–1229. doi: 10.1111/j.1755-0998.2008.02355.x
Publication History
- Issue published online: 29 OCT 2008
- Article first published online: 29 OCT 2008
- Received 16 July 2008; revision accepted 6 August 2008
- Abstract
- Article
- References
- Cited By
Keywords:
- assignment test;
- family structure;
- multilocus genotype;
- sibling inference;
- unsupervised clustering
Abstract
Unsupervised clustering algorithms, like the program Structure, are increasingly used to infer the presence of population structure from a sample of genotyped individuals. We evaluate the extent to which the presence of related individuals can lead such algorithms to the false inference that there is population structure. First, we demonstrate this problem using a real data set from a rainbow trout (Oncorhynchus mykiss) population. Then we perform an extensive series of simulations involving the program Structure. Our simulations encompass both a simple scenario with fixed numbers of full- and half-siblings in the sample, and a more complicated scenario in which we investigate 360 combinations of population divergence, fraction of population sampled, variance in family size, mating system and number of loci. We find that the inclusion of family members in a sample may produce very strong evidence of population structure, even when population structure is absent. This problem becomes more pronounced when more loci are genotyped, and it is particularly likely in studies of monogamous species, especially if variance in family size is high and a large fraction of a small population has been sampled. Researchers working in such situations should test observed clusters for the presence of family members to distinguish family-induced structure from real population structure. Additionally, this work shows that Structure's ability to estimate the number of subpopulations may be influenced by a number of factors, and therefore should be interpreted guardedly.

1755-0998/asset/olbannercenter.gif?v=1&s=461bba13dc862037c63fd362e625b54da4c20f13)
