Statistical inferences in phylogeography



    1. Departments of Integrative Biology and Statistics, University of California, Berkeley, 4096 VLSB, Berkeley, CA 94720, USA
    2. Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Kbh O, Denmark
    1. School of Biological Sciences, University of Reading, PO Box 68, Whiteknights, Reading RG6 6BX, UK
Rasmus Nielsen


In conventional phylogeographic studies, historical demographic processes are elucidated from the geographical distribution of individuals represented on an inferred gene tree. However, the interpretation of gene trees in this context can be difficult as the same demographic/geographical process can randomly lead to multiple different genealogies. Likewise, the same gene trees can arise under different demographic models. This problem has led to the emergence of many statistical methods for making phylogeographic inferences. A popular phylogeographic approach based on nested clade analysis is challenged by the fact that a certain amount of the interpretation of the data is left to the subjective choices of the user, and it has been argued that the method performs poorly in simulation studies. More rigorous statistical methods based on coalescence theory have been developed. However, these methods may also be challenged by computational problems or poor model choice. In this review, we will describe the development of statistical methods in phylogeographic analysis, and discuss some of the challenges facing these methods.