A means of estimating the completeness of haplotype sampling using the Stirling probability distribution


I present a tool for use in phylogeography that helps estimate the completeness of haplotype sampling, based on the number of individuals analysed and the number of different haplotypes they show. Applying the Stirling probability distribution and Bayes’ theorem, a posterior probability distribution of the total number of haplotypes, including those yet to be observed, may be obtained. This enables one to deduce if the data are complete enough for further analysis. A program for calculating the posterior probabilities is available at http://www.botanik.univie.ac.at/plantchorology/haplo.htm.