Cattle
Traditionally, taurine cattle (Bos taurus) and zebu (Bos indicus) are considered as separate species despite their complete interfertility (Lenstra & Bradley 1999). One of the first contributions of DNA research to a reconstruction of the domestication of cattle was a comparison of the mitochondrial DNA (mtDNA) of taurine and indicine cattle (Bradley et al. 1996). The divergence of their control regions implied separate domestications, which most likely started c. 8000 years bc in Southwestern Asia and the Indus valley respectively (Zeder et al. 2006).
Zebus were probably imported into Africa after the Arabian invasions in the 7th century (Bradley et al. 1998). Interestingly, the discovery that African zebus carry taurine mtDNA implies that African zebus were the result of crossing zebu bulls with taurine cows (Bradley et al. 1998). The resulting distribution of taurine, indicine and mixed phenotypes correlates with the Y-chromosomal INRA124 microsatellite alleles (Hanotte et al. 2000), satellite DNA polymorphism and AFLP patterns (Nijman et al. 1999). Microsatellite genotypes allowed a reconstruction of zebu migration routes (Hanotte et al. 2002). In West Africa, zebu introgression is counteracted by the tsetse resistance of the native taurine breeds (Freeman et al. 2004, 2006b; Ibeagha-Awemu et al. 2004).
In Asia, zebu and taurine cattle dominate in the south and the north respectively. This again established central hybrid zones in China (Cai et al. 2006, 2007; Lai et al. 2006; Zhang et al. 2007a) and Central Asia (Kantanen et al. 2009). More to the north, indicine mtDNA was found in Mongolia (20%), but Japanese and Korean cattle are completely taurine (Mannen et al. 2004). Kikkawa et al. (2003) described male taurine introgression in zebus from Bangladesh and Nepal. Interestingly, of six Nepalese zebus, five carried the expected zebu mtDNA, but one animal originated via the maternal lineage from yak (Bos grunniens).
Following the European discovery of America in 1492, cattle were brought over from Spain and Portugal. Later, Indian zebu cattle were imported to Central and South America because of their adaptation to hot and dry conditions. Because mainly bulls were imported and crossed with Creole cattle, the Brahman zebu breed carries taurine mtDNA, while Brazilian Nellore and Gir carry both taurine and indicine haplotypes (Meirelles et al. 1999). The humpless Creole cattle are thought to be descendants of Iberian imports, but depending on the breed 40–100% of the bulls harbour the zebu Y-chromosome (Giovambattista et al. 2000; Ginja et al. 2010). For Argentinean and Bolivian Creole cattle, autosomal microsatellites indicate 2–5% zebu admixture (Liron et al. 2006b). A network analysis of microsatellite-based genetic distances and model based clustering showed an intermediate position of five Brazilian Creole breeds between modern taurine and Brazilian zebu breeds with 10–20% zebu introgression (Egito et al. 2007).
Crosses of zebu and taurine with banteng (Bos javanicus), which are wild cattle from Southeast Asia, yield fertile female and sterile male offspring (Lenstra & Bradley 1999). Domestic cattle in Southeast Asia and Indonesia are thought to be of hybrid origin via crossing of zebu with Bali cattle, which is a domestic form of the banteng. Indeed, Kikkawa et al. (2003) and Mohamad et al. (2009) found banteng mtDNA in Indonesian zebus, most notably in the Madura (56%) and Galekan (94%) breeds. The mixed species origin of Indonesian zebus was confirmed by microsatellite analysis (Mohamad et al. 2009). Analysis of mtDNA, Y-chromosomal DNA and microsatellites indicated a purely banteng origin of Indonesian Bali cattle. However, mtDNA and nuclear DNA in a Bali cattle population kept in Malaysia was of mixed zebu-banteng origin (Nijman et al. 2003).
The wild ox or aurochs (Bos primigenius), which is the ancestor of both taurine and indicine cattle, lived in the European forests until its extinction in 1627, so hybridization with domestic cattle originating from Southwestern Asia (Troy et al. 2001) is an obvious possibility. In 59 fossil aurochs bones, Edwards et al. (2007b) found one mtDNA haplogroup (P) in all except one sample, which had a different haplotype (E). Both P and E are distinct from the taurine haplogroup T. This would exclude a recruitment of aurochs cows for use as livestock, but exceptions seem to confirm the rule: the P haplotype is present in less than 0.1% of modern cattle samples (Achilli et al. 2009; Stock et al. 2009), while the related Q and R haplotypes are also found sporadically (Achilli et al. 2008, 2009).
However, the extent to which aurochs contributed to modern cattle via male introgression is not yet clear. Götherström et al. (2005) defined a Y1 haplotype in most North-European breeds and a Y2 haplotype in most other European cattle and in Southwest Asia. Y1 was also found in fossil aurochs remains, but this was not in agreement with later findings (Svensson & Götherström 2008). Bollongino et al. (2008) found Y2 haplotypes in several European samples for which the aurochs origin was verified via the mtDNA P-haplotype, which raises the possibility that Y2 carrying bulls have also descended from aurochs bulls.
Mitochondrial DNA, as well as nuclear polymorphisms, have revealed several other aspects of the early differentiation of taurine cattle. The predominance of one taurine mtDNA haplogroup (T1) in Africa (Troy et al. 2001) and a new haplogroup in Eastern Asia (T4: Mannen et al. 2004; Kantanen et al. 2009) suggested two other regions of domestication. However, complete mtDNA sequences showed that T1 and T4 are closely related to the major T3 haplogroup, so their predominance probably reflects founder effects in Africa and Eastern Asia respectively (Achilli et al. 2009).
The T3 mtDNA haplogroup is predominant in most European breeds and Northern Asia (Kantanen et al. 2009) and is one of the four major haplogroups (T, T1, T2 and T3) in Southwestern Asia. By contrast, in the African taurine cattle haplogroup T1 is dominant, which is rare in Southwestern Asia. These observations are in line with a Southwest-Asian origin of European cattle, confirming the paleontological evidence of a gradual introduction of domestic cattle in Europe from Southwestern Asia (Zeder et al. 2006). There are two interesting exceptions to the T3 dominance in Europe. First, four ancient breeds from Tuscany have almost the same mtDNA diversity as found in Southwestern Asia, suggesting an ancient maternal origin and a direct link between Tuscan and Western-Asian cattle (Pellecchia et al. 2007). For the Chianina breed this was confirmed by microsatellite data (European Cattle Genetic Diversity Consortium, unpublished results). Microsatellites also indicated that two other Tuscan breeds, the Maremmana in the south and the Cabannina in the north have been subject to Podolian and Brown Mountain breed introgression respectively. Cattle east of the Appennines and on Sicily are of the Podolian type and were most likely introduced during the Middle Ages (Felius 1995).
Autosomal protein polymorphisms (Medjugorac et al. 1994), microsatellite data (Cymbron et al. 2005; Li et al. 2007; Medugorac et al. 2009) and AFLP fingerprinting (Negrini et al. 2007) are in line with a demic expansion of agriculture from southeastern to northwestern Europe. Cymbron et al. (2005) observed that the correlations between genetic and geographical distances are different for Mediterranean and Northern breeds; it is proposed that this reflects the separate Neolithic migrations along the Mediterranean coasts and the Danube respectively. A larger set of microsatellite data (Lenstra et al. 2006b; Lenstra 2008) indeed indicates a separate position of Mediterranean cattle, but divides the Transalpine cattle into two different clusters of breeds: Central-European (Alpine, Southern-French) and Northern European. The separate position of Central-European cattle was also indicated by AFLP data (Negrini et al. 2007). Strikingly, the Northern-European cluster largely coincides with a high diversity of milk protein genes (Beja-Pereira et al. 2003), the distribution of the human lactase persistence alleles and the location of Neolithic cattle farming sites. This led to the suggestion of a gene-culture co-evolution between cattle and humans (Beja-Pereira et al. 2003).
Molecular data have also generated information on the history of individual breeds. A major determinant of the genetic constitution of a breed is its degree of isolation from other breeds. For instance, the Jersey is a typical island breed that has been kept isolated since 1789. This has led to a limited degree of inbreeding (Chikhi et al. 2004), but has also preserved unique features. Inbreeding has gone further in two Balearic Island breeds, in a Betizu subpopulation (Martin-Burriel et al. 2007), and in the Spanish Lidia (fighting cattle: Cañón et al. 2008). The most extreme inbreeding has been observed in English Chillingham cattle, which have become almost completely homogeneous by strict isolation of one herd for hundreds of years (Visscher et al. 2001). Often, but not always, genetic isolation has led to phenotypic uniqueness. This has also been the case for the Italian Chianina (see above) and is an obvious argument for conservation.
At the other end of the scale are the several breeds that have been shaped by gene flow from other breeds. For instance, Northern-Russian cattle have been influenced heavily by modern commercial cattle (Li et al. 2007; Kantanen et al. 2009). On a comparable scale, several Scandinavian breeds have been upgraded by the Scottish Ayrshire (Tapio et al. 2006a). A rustic Spanish breed, Serrana di Teruel, was clearly influenced by brown mountain cattle (Martin-Burriel et al. 2007). Several other introgressions have been indicated by a Europe-wide microsatellite dataset (European Cattle Genetic Diversity Consortium, unpublished results). This has been rather extreme for the Portuguese Minhota, which has been upgraded with German Yellow bulls (Felius 1995) to the point that it has become virtually identical to the German breed.
Genotypes from 30 microsatellites for 69 European breeds were used for testing formal criteria for conservation (Lenstra & the European Cattle Genetic Diversity Consortium 2006a). The popular Weitzman method, based on genetic distances, favours highly inbred populations even if these have been derived recently from other populations. Ranking of conservation priorities on the basis of marker-estimated kinships was less influenced by inbreeding, and favoured Mediterranean breeds (Lenstra & the European Cattle Genetic Diversity Consortium 2006a). These breeds indeed have a relatively high degree of molecular diversity, which next to phenotypic uniqueness is an obvious argument for conservation. Moreover, the Busa and Anatolian breeds were considered to be valuable genetic resources on the basis of their high genetic diversity (Medugorac et al. 2009). Conservation priorities of Nordic cattle were analysed by Bennewitz et al. (2006) and Tapio et al. (2006a).
Yak
Yak (Poephagus grunniens) is a bovine species that can hybridize with taurine and zebu cattle and produce fertile females but sterile males (Lenstra & Bradley 1999). It is a unique livestock species on the Qinghai-Tibetan Plateau of western China, in the Mongolian and Russian steppes and in other Himalayan countries (Wiener et al. 2003; Wiener & Jianlin 2005). The state of development of molecular markers and genetic research on the yak was reviewed by Jianlin (2003).
Recently, the genetic diversity of yak has been examined. mtDNA cytochrome b and D-loop sequences revealed two halogroups within domestic yak, which diverged at least 100 000 years ago (Guo et al. 2006; Lai et al. 2007). Haplotypes of both groups were found in a single, small, wild yak population, thus indicating that the domestic Chinese yak were derived from a single wild gene pool. A domestication event was estimated to have taken place around the early Holocene, within 10 000 years before present (YBP) in Qinghai and Tibet. No pattern of phylogeographical distribution of major clades in Chinese yak sampled from different localities in south-western and north-western China was found (Guo et al. 2006; Lai et al. 2007). A study with intensive sampling of domestic yak from all the yak-keeping countries, including China, Bhutan, Nepal, India, Pakistan, Kyrgyzstan, Mongolia and Russia, revealed a third, less frequent, haplogroup (Qi et al. 2008). Geographical clines in the haplogroup diversity indicated that a single domestication on the Eastern Qinghai-Tibetan Plateau was followed by a westward migration passing through the Himalayan and Kunlun mountain ranges, and northward migration through South Gobi and the Gobi Altai mountains to Mongolia and Siberia.
Cross-species amplification of 136 bovine microsatellite markers revealed a high success rate up to 95% (Minqiang et al. 2003; Nguyen et al. 2005; Xuebin et al. 2005). Several of these are included in the list of markers recommended by the ISAG/FAO working group for yak (Hoffmann et al. 2004). Using 15 microsetellites, Xuebin et al. (2005) found high genetic diversity within the Mongolian and Russian yak populations. The Gobi Altai, south Gobi and north Hangai populations in Mongolia are closely related, as are the Hovsgol and the Buryatia populations in Mongolia and Russia respectively. These groups of populations should therefore be considered as distinct genetic entities for conservation and breeding programmes.
Cross-species amplification of bovine Y-chromosome specific markers now allows the analysis of paternal lineages (Xuebin et al. 2002). In addition, a complete yak mtDNA genome sequence (Gu et al. 2007) and several bovine SNPs that are also polymorphic in yak will contribute further to the understanding of the genetic constitution of yak populations.
Water buffalo
The domestic water buffalo Bubalus bubalis is thought to have been domesticated in the Indus and Yangtze valley civilizations 5000 years ago (Cockrill 1981). Domestication was also proposed to have occurred in China as early as 7000 years ago (Chen & Li 1989). However, this was not in agreement with mtDNA sequences of ancient remains of the endogenous Bubalus mephistopheles, which did not establish a link with the modern domestic water buffalo (Yang et al. 2008). Representations of buffalo appear on seals of the Indus valley and Mesopotamia from the third millennium bc (Zeuner 1963). The ancestral wild water buffalo Bubalus arnee was common across the Indian subcontinent, but numbers have decreased because of environmental pressures and hybridization with domestic populations. The wild form is now listed as endangered and is thought to survive only in a few areas of India, Nepal, Bhutan and Thailand (Scherf 2000).
Water buffalo have historically been divided into swamp and river buffalo based on morphological, behavioural and geographical criteria. The two types also differ in chromosome number: swamp 2n = 48, river 2n = 50 (Ulbrich & Fischer 1967; Fischer & Ulbrich 1968), because of a telomere-centromere tandem fusion between two chromosomes in river buffalo (Di Berardino & Iannuzzi 1981). River and swamp buffalo will only mate if reared together from calfhood and while first generation hybrids are fertile it has not been confirmed whether fertility persists in subsequent generations (Fischer & Ulbrich 1968). They are sometimes referred to as different subspecies; river as Bubalus bubalis bubalis and swamp as Bubalus bubalis carabenesis. Swamp buffalo bear a closer morphological resemblance to wild buffalo than do river buffalo.
Swamp buffalo are found throughout Southeast Asia and China. There are no recognized breeds, although some geographical populations have local names and have been shown to differ in morphology and environmental adaptation (Chen & Zu 2004). River buffalo are mainly found in the Indian subcontinent and westwards through Southwestern Asia and Mediterranean countries. Buffalo have recently been introduced to Africa, South America and Australia. Well-recognized and morphologically defined river buffalo breeds exist in India and Pakistan, but 70% of river buffalo do not belong to any named breed and are classified as non-descript (Arora et al. 2004). The geographical ranges of river and swamp buffalo overlap in East India and Bangladesh. Sri Lankan buffalo are morphologically similar to swamp buffalo but analyses of chromosome number, microsatellites and mtDNA identify them as river buffalo (Barker et al. 1997b; Lau et al. 1998). Genetic differentiation of both river and swamp populations is of the same order of magnitude as that between well-recognized breeds of other domestic species (Barker et al. 1997b).
Estimates of the time of divergence of river and swamp buffalo vary widely, but all predate the domestication of buffalo. The estimates range between 10 000–15 000 YBP (Barker et al. 1997a), 28 000–87 000 YBP (Lau et al. 1998), more than 700 000 YBP (Tanaka et al. 1995), 1 million YBP (Amano et al. 1994) and 1.7 million YBP (Tanaka et al. 1996). A study of 30 microsatellites found a river-swamp differentiation of 30.2% (Zhang et al. 2007b), and studies of mtDNA found an average sequence divergence of 8.6% for D-loop and 2.6% for cytochrome b (Kumar et al. 2007a), of the same order as the differentiation between Bos taurus and Bos indicus in cattle.
Initial analyses of a short region of the mitochondrial D-loop found haplotypes shared between river and swamp buffalo, which is consistent with a single domestication event (Lau et al. 1998; Kierstein et al. 2004). However, studies of longer regions of the D-loop and of cytochrome b all support the hypothesis of separate domestications of river and swamp buffalo, probably in the Indus and Yangtze valley civilizations in the second millennium bc (Kumar et al. 2007b; Lei et al. 2007). Research into Chinese swamp buffalo populations revealed two maternal lineages: Swamp A and B. The more common lineage A was found in 81.5% of samples, but lineage B was present in five of the seven populations sampled, with no clear geographical pattern of A and B distribution. The estimated time of divergence of the two lineages was 18 000 YBP, and while both show indications of population expansion, lineage A appears to be a more recent expansion (Lei et al. 2007).
Microsatellite analyses in buffalo have focused on the defined river breeds of India and local swamp populations of China. Most genetic diversity in buffalo lies within breeds, and estimates of the percentage of diversity between populations vary between 2.8% in Chinese swamp populations (Zhang et al. 2007b), 3.4–9.69% in Indian river breeds and local populations (Kumar et al. 2006; Vijh et al. 2008), and 5.7% in Italian, Greek and Egyptian river breeds (Moioli et al. 2001). Most of these values are low compared with other species [7.11% for cattle (MacHugh et al. 1998), 8% in horses (Cañón et al. 2000), 13% in pigs (Martinez et al. 2000)]. This may be because buffalo have not undergone the same degree of isolation and rigorous selection and widespread use of artificial insemination in the creation of established breeds. Mean expected heterozygosity also varies between studies; it was 0.535 in Chinese swamp populations (Zhang et al. 2007b), 0.506 in Southeast-Asian swamp populations (Barker et al. 1997a), 0.71–0.78 and 0.63–0.73 in Indian river populations (Kumar et al. 2006; Vijh et al. 2008), and 0.577–0.605 in river buffalo of Mediterranean countries (Moioli et al. 2001).
The previous systems of grouping buffalo breeds based on morphology and geography (Cockrill 1981) do not correlate well with genetic diversity patterns. amova analysis of microsatellite data found that, when Indian breeds were divided by either geography or morphology, <1% of the genetic diversity lay between groups (Kumar et al. 2006). However, a DA distance tree and principal component and structure analyses of microsatellite genotypes in Chinese swamp buffalo populations revealed several geographical clusters: in the upper and middle reaches of the Yangtze valley, in the lower reaches of the valley, in Southern China and in Southwestern China. The first two components of the principal component analysis also divided populations on North/South and East/West axes (Zhang et al. 2007b). The genetic distances between Chinese populations also correlated with the geographical distance between them; one study of Indian populations found no such correlation and another only found it after the removal of a population and several loci, which were out of Hardy–Weinberg equilibrium (Kumar et al. 2006; Zhang et al. 2007b; Vijh et al. 2008).
Most analyses of river buffalo have focused on the minority that form recognized breeds. Inclusion of two local non-descript populations in a microsatellite study showed similar levels of within- and between-population diversity as the recognized breeds (Vijh et al. 2008). Such local populations may be valuable reservoirs of genetic diversity, which is threatened by modern breeding practices. Murrah buffalo are a popular breed, and increased use of Murrah sperm for artificial insemination is decreasing the genetic diversity between buffalo populations (Sethi 2001). In a multidimensional scaling analysis of microsatellite data, Murrah buffalo cluster with several other breeds of northern, central and western India, possibly because of this ongoing admixture. The Toda breed is reared by the Toda tribe in the Nilgiri hills of South India and is both culturally and religiously significant to the tribe, and is also endangered as a result of its low numbers. Microsatellite and mtDNA studies identified the Toda breed as genetically distinct from other recognized breeds and in need of conservation, but Vijh et al. (2008) found that the geographically close local population of Kalasthi buffalo cluster with the Toda breed, demonstrating the importance of considering local populations as well as breeds when deciding on conservation priorities.
Sheep
Sheep (Ovis aries) were domesticated in Southwestern Asia about 12 000 years ago and thus represent one of the earliest livestock animals (Zeder et al. 2006). As with other domestic animals (Bruford et al. 2003), relationships with ancestor species have been investigated via comparison of mtDNA data. Hiendleder et al. (2002) found two haplogroups A and B, which both were different from the sequences in any extant Ovis species. The European mouflon (Ovis musimon) carries haplogroup B, but this is a feral form of early European domesticates. Most likely, sheep descend from one or more Asiatic mouflon (Ovis orientalis) populations (Hiendleder et al. 2002).
Several reports have further analysed the geographical distribution of haplotypes. The most relevant information has been summarized by Meadows et al. (2007). The main haplotypes A and B are both found in Asia, while B dominates in Europe (see also Bruford et al. 2003; Meadows et al. 2005). A high frequency of A in New Zealand resulted from early imports of Indian animals into Australia (Hiendleder et al. 2002). Haplotype C is less frequent, but has been found in Portugal, Turkey, the Caucasus and China (Tapio et al. 2006b). Haplotype D, present in Rumanian Karachai and Caucasian animals, is possibly related to the A haplotype. Haplotype E, which is intermediate between A and C, is also rare and has only been found in two Turkish animals.
This mtDNA diversity with distinct haplogroups is comparable with what is observed in goats and cattle, although the divergence of sheep haplogroups is less pronounced than the taurine–zebu divergence (Bruford et al. 2003). Furthermore, in contrast to the taurine cattle haplotypes, the sheep haplogroups hardly correlate with geographical origin. Different lineages might reflect multiple regions of origin, but another obvious possibility is a coexistence of different maternal lineages in the predomestic population.
By contrast, little variation has been observed in the paternal lineage. One SNP in the Y-chromosomal SRY mutation has a high frequency in European breeds (Meadows et al. 2004) and is probably of European origin. The microsatellite SRYM18 defines other haplotypes (Meadows et al. 2006), but except for the major haplotypes, these were of low frequency and dispersed over different continents.
A recent study of retrovirus integrations (Chessa et al. 2009) has provided additional information on the introduction of sheep into Europe. A high frequency of one integration or the lack of other integrations indicated an early arrival of the primitive sheep populations (European mouflons, North-Atlantic Island breeds). Another informative retrovirus copy is present in most other European breeds and probably indicates the later arrival of wool-producing sheep. This study also indicated an interesting genetic link of English Jacob sheep with Asian or African populations.
Although several groups have studied the diversity of sheep as revealed by microsatellites, this has provided relatively little insight into the relationship between breeds. One drawback is the unfortunate use of different microsatellite panels, which precludes the combination of datasets (for an overview, see http://www.globaldiv.eu/docs/Microsatellite%20markers.pdf). Another drawback is that there is only little phylogeographical structure; this in contrast to the clear correlation of genetic and geographical structure observed in cattle and goats. In a study of 20 European breeds, amova analysis showed that only 1% of the variation is between regions and less than 3% is between seven types of breed (Lawson Handley et al. 2007). In Baltic breeds, Tapio et al. (2005a) found a general lack of differentiation at the breed level.
On the other hand, with eight microsatellites, Buchanan et al. (1994) observed a clustering of three English breeds relative to Merino-type breeds and to Awassi. So far, most data on phylogeographical relationships of breeds came from the EU Econogene project, which analysed 57 breeds with 31 microsatellites (Peter et al. 2007). Separate positions were observed for three clusters of breeds: Southwest-Asian, Southeast-European and Central- and Western-European. Within the last group, there was a weak differentiation of Merino and Alpine breeds. There was also a clear decline of the heterozygosity and allelic richness from Southwestern Asia and Southeastern Europe to the west and the north-west (Peter et al. 2007), reflecting repeated founder effects during the gradual introduction of domestic sheep into Europe.
In another study, independent coordination analysis suggested a separate position of Northern-European short-tailed sheep, which could be divided into a north-western, northeastern and a heterogeneous Swedish-Norwegian cluster (Tapio et al. 2005b). Santos-Silva et al. (2008) studied the relationships of Portuguese sheep, which were clearly different from the imported Assaf breed. Cinkulov et al. (2008) analysed genetic differentiation of the Pramenka, an indigenous mountain sheep breed of the Balkans. Furthermore, Gizaw et al. (2007) observed a partial differentiation of three breed groups that had been successively introduced to Ethiopia: thin tailed, short- and long-fat-tailed and thick-rumped breeds.
The differentiation of European and Asian sheep and the weak geographical structure of European sheep were confirmed by analysis of a 1536-SNP dataset (Kijas et al. 2009). This study also showed a difference between Asian and African populations and a separate position of the North-Atlantic Soay sheep.
The diversity pattern of European sheep breeds, which is clearly more panmictic than observed for cattle and goats, probably reflects a history of cross-breeding promoted by commercial interests (Lenstra 2005). From the 17th century onwards, Merino sheep from Spain were exported to several European countries (Wood & Orel 2001), while English or Texel rams were also popular sires.
Goats
Goats (Capra hircus) were domesticated about 10 000 years ago in Southwestern Asia, thus in the same period and in the same region as sheep. Although the species are of a similar size, goats found their own use because of their adaptation to marginal conditions. Goats most likely descend from the wild bezoar, Capra aegagrus, (Naderi et al. 2007, 2008). The information available on mtDNA haplogroups has been summarized by Naderi et al. (2007). More than 90% of goats worldwide carry haplogroup A. Haplogroup B has so far been found mainly in Asia and South Africa, C in Southern Europe, D in Asia, F only in the Sicilian Girgentata breed, and G in Southwestern Asia and Northern Africa. Subgroup B1 is restricted to China and Mongolia. Another subgroup of B is reported to be specific to the Canary Islands, which is possibly due to their genetic isolation since their arrival 3000 years ago (Amills et al. 2004). Data on African goats are relatively scarce.
Haplogroups A–G are all present in the bezoar goat (Naderi et al. 2008). The distribution of the haplogroups suggested that eastern Anatolia and possibly Northern and Central Zagros were the most important domestication centres. The diversity of the C-haplogroup indicated a second domestication on the Central Iranian plateau and in the Southern Zagros, but this domestication centre probably did not contribute significantly to the current domestic goat gene pool.
Mitochondrial DNA haplotypes suggested a genetic link between Southwest-Asian and Iberian goats (Pereira et al. in press) and between Southern/Central American goats and Canarian goats (Amills et al. 2009), both via maritime transport.
The prevalent notion that the geographical structure of goats is weaker than for cattle and sheep (Luikart et al. 2001) rests mainly on the worldwide prevalence of haplogroup A. However, the dispersal of A haplotypes seems to be predomestic, and Y-chromosomal data show considerable geographical partitioning. Three Y-chromosomal haplotypes belong to two haplogroups, Y1 and Y2 (Lenstra 2005; Pereira et al. in press). Y2 has not been found in Switzerland and Germany and is scarce in Italy, while it is predominant elsewhere.
Microsatellites also reveal a high degree of geographical structuring, although incompatibility of datasets again limits the scope of most studies to the regional scale. Barker et al. (2001) found a clear correlation of tree topology and genetic distance for Southeast Asian goats. The largest dataset described so far (Cañón et al. 2006) comprises 45 breeds from Europe and the Middle East. Four discrete groups were found: Middle East, central Mediterranean, western Mediterranean and central/northern Europe. Again there was a decline in allelic richness from south-east to north-west, presumably the result of founder effects that also explain the distribution of Y-chromosomal alleles (Cañón et al. 2006). Geographical structuring of microsatellite genotypes was also reported for goat populations from Burkina-Faso (Traoréet al. 2009), India (Rout et al. 2008) and northern Vietnam (Berthouly et al. 2009). Conservation value of Swiss goat breeds on the basis of microsatellite diversity was explored by Glowatzki-Mullis et al. (2008).
The clear phylogeographical structure of European goats probably reflects the style of husbandry. In contrast to the situation of sheep and cattle and with the exception of the widespread use of Swiss dairy animals, goats are of more limited economic importance, and breeding has remained largely a local affair.
In the Econogene dataset, Western Europe was only partially represented. Comparison with Asian and African breeds will probably define additional clusters of breeds. We conclude that further molecular analyses of autosomal and Y-chromosomal diversity of goats offer excellent perspectives to retrieve the history of their domestication and subsequent migrations.
Camelids
The Camelidae family comprises four domesticated species belonging to three genera. The Bactrian camel (Camelus bactrianus) is found throughout Central Asia, and the distribution of the dromedary (Camelus dromedarius) ranges from Central Asia and Southwestern Asia to Northern Africa. The llama (Lama glama) and alpaca (Vicugna pacos) are found in the Andean mountains in South America (Jianlin 2005a,b). All species of the family have the same conservative karyotype (2n = 74) and can produce fertile hybrids between species, both within and even between genera (Skidmore et al. 1999; Potts 2004; Mengoni Goñalons & Yacobaccio 2006; Wheeler et al. 2006). mtDNA sequences and nuclear microsatellite markers support a clear genetic differentiation of wild guanaco (Lama guanicoe) from wild vicuña (Vicugna vicugna) (Stanley et al. 1994; Kadwell et al. 2001; Palma et al. 2002). They also provide genetic evidence for two geographically isolated wild subspecies of the guanaco (L. g. cacsilensis and L. g. guanicoe) (Palma et al. 2002; Gonzalez et al. 2006) and the vicuña (V. v. vicugna and V. v. mensalis) (Palma et al. 2002; Marín et al. 2007). Independent domestications of the llama from L. g. cacsilensis and the alpaca from V. v. mensalis have subsequently been demonstrated (Palma et al. 2002; Wheeler et al. 2006). These occurred 4000–4500 YBP for llama in the South-Central Andes (Mengoni Goñalons & Yacobaccio 2006) or 6000–7000 YBP for both alpaca and llama in the Central Andes (Wheeler et al. 2006). mtDNA analyses recognize the extant wild Bactrian camel as a separate lineage (Jianlin et al. 1999; Ji et al. 2009). Combined mtDNA and microsatellite data further support the recognition of the wild Bactrian camel as a separate subspecies (Camelus gobi or Camelus bactrianus gobi), and suggest different ancestors and separate domestication events for the dromedary and the Bactrian camel (H. Jianlin et al. unpublished data). For the Bactrian camel, this took place 4000 YBP in the eastern part of Central Asia (Mason 1984; Peters & von den Driesch 1997; FAO 2007b). For the dromedary, this occurred 4500–5000 YBP in the Southern Arabian Peninsula (Mason 1984; Peters 1997).
Jianlin (2005a) have reviewed the development of camelid microsatellite markers. Recently, Matéet al. (2005) reported an additional four microsatellite markers. Twenty-five markers are included in the current list of markers recommended by the ISAG/FAO working group for both the New and Old World camelids (Hoffmann et al. 2004). So far, these markers have been used only for studies with a regional scope. Jianlin et al. (2004) suggested that the domestic Bactrian camels from China and Mongolia should be considered as distinct populations in conservation and breeding programmes. Nolte et al. (2005) found no evidence for loss of genetic diversity within, and a very low differentiation among 16 southern African dromedary populations. Mburu et al. (2003) identified two separate genetic entities present in Kenyan dromedaries, namely the Somali dromedary and a group including the Gabbra, Rendille and Turkana populations. Vijh et al. (2007) indicated that there were two distinct genetic clusters in the Indian dromedaries, with the Mewari breed being differentiated from the Bikaneri, Kutchi and the Jaisalmeri breeds.
For the New World camelids, Rieder et al. (2000) found high genetic variation at six microsatellite loci within Swiss New World camelid breeds. Sarno et al. (2001) observed much less variation in an island guanaco population than in the mainland population and a significant genetic differentiation between the two populations in southern Chile. Bustamante et al. (2002) and Matéet al. (2005) reported a high level of genetic diversity in Argentine llamas and guanacos, indicating the Patagonian guanaco to be an important genetic resource for conservation or economic utilization programmes. Sarno et al. (2004) detected higher levels of microsatellite allelic diversity in V. v. mensalis than in V. v. vicugna in Bolivia and Chile.
The structure and organization of the D-loop region of four South American camelid species in Argentina were reported by Matéet al. (2004, 2007), and a high degree of heteroplasmy was found. Complete mtDNA sequences and structure are available for alpacas (Ursing et al. 2000; Arnason et al. 2004), a dromedary (16 643 bp) and domestic and wild Bactrian camels (Cui et al. 2007). Genomic sequence data with 2× coverage (O’Brien et al. 2008) and the identification of 1516 microsatellite loci (Reed & Chaves 2008) as well as 750 000 SNP markers of alpaca will facilitate further studies of the diversity of camelids.
Pigs
Molecular data have shed light on pig domestication by tracing mtDNA. Initial mtDNA studies showed that European and Chinese pigs were domesticated independently from European and Asian subspecies of wild boar (Giuffra et al. 2000), but later studies suggested at least seven domestication events across Eurasia (Larson et al. 2005) and East Asia (Wu et al. 2007). These studies also suggested the occurrence of introgression of Asian domestic pigs into some European breeds during the 18th and 19th centuries. Larson et al. (2007) demonstrated that domestic pigs of Near Eastern ancestry were introduced into Europe during the Neolithic, and that the European wild boar was also domesticated by this time. Once domesticated, European pigs rapidly replaced the introduced domestic pigs of Near Eastern origin throughout Europe.
Y-chromosomal variation demonstrated the existence of two highly divergent and ancient lineages, with an estimated divergence time of c. 0.33 Myr, i.e. in the order of the species age (O. Ramírez, personal communication). A recent study (Ramirez et al. 2009) based on microsatellite, mtDNA and Y-chromosomal data has confirmed the divergence of East-Asian and European pigs. In both regions, wild and domestic populations were found to be related to each other. All three marker types showed that Southwest Asian, African and American pigs were most closely related to the European population, but East-Asian mtDNA and Y-chromosomal haplotypes occurred in East-African and Nicaraguan populations. Anglosaxon, African local pigs and especially the international breeds (e.g. Large White, Landrace and Pietrain) are of mixed European-Asian origin. The almost complete predominance of the HY1 Y-chromosomal haplotype in Europe, including in the international breeds, and in Southwestern Asia, argues against male-mediated introgression and suggests that Chinese introgression in British breeds was mainly maternal.
Fang et al. (2009) investigated genetic variation in the melanocortin receptor 1 (MC1R) gene among 15 wild and 68 domestic pigs from both Europe and Asia to address why coat colour is so much more variable in domestic animals than in their wild ancestors. They found that all mutations were silent in wild animals, suggesting purifying selection, but nine of ten mutations found in domestic pigs resulted in altered protein sequence, suggesting that early farmers intentionally selected for novel coat colour.
Across the world, nearly 400 breeds have been exploited, the largest number of breeds being found in Asia and Europe. In a collaborative EU project (PigBioDiv1), 58 European populations, including local breeds, national varieties of international breeds, privately owned commercial populations, and the Chinese Meishan breed as an outgroup were genotyped for 50 microsatellites and 148 AFLP markers. Data from 11 breeds included in the PiGMaP study (Laval et al. 2000) were also included. The microsatellite data showed that the individual breed contributions to between-breed diversity ranged from 0.04% to 3.94% of the total European between breed-diversity, and that the local breeds accounted for 56% of the total, followed by commercial lines and international breeds (Ollivier et al. 2005). They also applied a cryopreservation potential criterion as proposed by Weitzman (1993), taking into account the risks of extinction. SanCristobal et al. (2006a), analysing the same data, showed a clear structure of the European pig breeds with a FST value of 0.21. With the exception of five local breeds, the between-breed general structure exhibited a star-like tree with no visible phylogenetic relationship between the local and the main international breeds. Even the inclusion of the Chinese Meishan breed as an outgroup did not allow the tree of European breeds to be rooted. SanCristobal et al. (2006b) proposed that AFLPs produce diversity patterns similar to microsatellites and can be combined with microsatellite data. However, Foulley et al. (2006) highlighted the problems arising in the analysis of these types of markers and suggested that AFLPs are more sensitive than microsatellites to selection and/or other forces.
Amaral et al. (2008) evaluated LD and haplotype block structure in 15–25 individuals from each one of 10 European and 10 Chinese breeds genotyped for 1536 SNPs in three genomic regions. The LD extends up to 2 cM in Europe and up to 0.05 cM in China. The authors suggest two possible explanations: either European ancestral stock has a higher level of LD, or modern breeding programmes have increased the extent of LD in Europe. The haplotypic diversity has also been studied in other material, focusing on the IGF2 gene (Ojeda et al. 2008).
The ongoing project PigBioDiv2 covers 50 Chinese breeds and mtDNA and Y-chromosomal regions in addition to the microsatellite data of the European breeds. Trait gene loci and markers will be analysed to seek insight into the functional differences between breeds. The first results on microsatellites using pooled DNA samples indicate that the Chinese breeds show a higher degree of genetic variability than the European breeds both within and between breeds (Megens et al. 2008).
Detailed studies have also been carried out for local breeds in several countries. The important local Iberian breed was analysed using microsatellite markers by Fabuel et al. (2004), who compared different methods, such as Weitzman, cluster analysis or optimal contributions, to establish conservation priorities. Moreover, Alves et al. (2003) showed that, unlike other European breeds, the Iberian breed has not been introgressed with Asian mtDNA. Finally, an allelic richness analysis indicated that the desirable integration of allelic richness into the diversity theory currently poses some unsolved difficulties (Rodrigáñez et al. 2008).
Horses
A total of over 100 distinct equine mtDNA haplotypes have been described in multiple studies focusing on the domestication of horses in general, or on the origin of specific breeds (Bowling et al. 2000; Vila et al. 2001; Jansen et al. 2002; Kavar et al. 2002; reviewed in Kavar & Dovc 2008). Joint analyses showed that, in contrast to the double broomstick topology of mtDNA networks of the principal livestock species (Troy et al. 2001; Bruford et al. 2003), the equine mtDNA network shows a typical star-like branching structure (Jansen et al. 2002; Kavar & Dovc 2008). In a dataset comprising extant horse breeds, as well as wild horses from 12 000 to 28 000 years ago, an unexpectedly high genetic divergence between clades was found (Vila et al. 2001). Analyses of DNA from horses of Chinese tombs from the 3rd century bc (Keyser-Tracqui et al. 2005) and from the Bronze Age (Lei et al. 2009) showed that the high mtDNA diversity is of ancient origin. Clearly, the divergence of horse mtDNA must have predated domestication, which on the basis of archaeological evidence has been dated at around 6000 YBP in a broad region of the Eurasian Steppe. This is consistent with the notion that capture and exploitation of wild mares took place independently in multiple locations over a broad time span (Lister 2001; Vila et al. 2001; Hill et al. 2002; Kavar & Dovc 2008). Apparently, the know-how required for domestication, rather than the domestic animals themselves, spread from one region to the next, challenging the suggestion that the domestication process was confined to a restricted area. However, the horse domestication scenario has recently been complicated by the analysis of matrilines from Lusitano and Soraia populations, which suggests a role of the Iberian Peninsula as a glacial refugium and a possible second centre of horse domestication (Lopes et al. 2005). Interestingly, analysis of fossil remains showed that domestication of horses from 5000 YBP onward was followed by the spread of mutations resulting in a large variety of coat colours (Ludwig et al. 2009).
amova analysis of 72 populations from Europe, Southwestern Asia, Eastern Asia and Africa revealed a non-random distribution of diversity among populations and a clear, although weak, geographical partitioning of mtDNA variation (McGahern et al. 2006). In a few instances, mtDNA has provided evidence for the origins of specific horse breeds. Luis et al. (2006) found Iberian haplotypes in New World breeds with a high frequency, which is in line with historic evidence for the origin of American horses, while Yang et al. (2002) identified Mongolian haplotypes in the Korean Cheju breed. The association of haplogroup F with Eastern-Asia was proposed as an argument for a Chinese domestication of the haplogroup (Lei et al. 2009).
Analysis of Y-chromosomal data supported a strong sex-bias in the domestication process. Lindgren et al. (2004) screened 14.3 kb of non-coding Y chromosome sequence in 52 male horses of 15 different breeds and did not identify a single segregating site. Even though their observations cannot exclude the possibility that Y-chromosomal variation was low before domestication took place, their results strongly suggest that only a few stallions have contributed genetically to the domestic horse.
Several studies have compared horse breeds or assessed the genetic structure of single breeds on the basis of microsatellites. Most of these targeted local breeds and used their own marker panel, meaning that data from different studies cannot be compared directly. Thus, for many breeds, data on genetic diversity are available, but insights into breed relationships are still fragmentary.
The so far unrealized potential of a standardized microsatellite panel for the elucidation of breed relationships is illustrated by three well-supported clusters of two riding breeds (Arabian, Hanoverian), two ‘primitive’ breeds (Exmoor and Sorraia) and six German cold-blooded breeds (Aberle et al. 2004). Similarly, Bigi et al. (2007) found, using only 12 markers, significant clustering of the Thoroughbred and Anglo-Arabian breeds and of Haflinger, Italian heavy draught and Bodaglino. Based on 17 protein and 12 microsatellite markers, Luis et al. (2007) reported eight breed groups among 33 breeds, of which four groups were well supported (Andalusian with Lusitano; Friesian with two pony breeds; Morgan, Standardbred, Rocky Mountain and American Saddlebred; Irish Draught, Quarter Horse, Hanoverian, Holsteiner and Thoroughbred). Microsatellites have also been used to assess possible origins of specific horse breeds. For instance, Kakoi et al. (2007) found evidence for a Mongolian origin of Japanese breeds. Evidence for a relationship of Mongolian and Norwegian breeds on the basis of 26 microsatellites was only incomplete (Bjornstad et al. 2003), but is consistent with the morphological appearance of the Nordic breeds.
Chickens
Among poultry species, chickens are the most important and provide an important source of human food. The red jungle fowl (Gallus gallus) is believed to be the progenitor of the domesticated chicken and has its widest distribution in East Asia, from Pakistan through China, Eastern India, Burma, most of Indo-China, and on the islands of Sumatra, Java and Bali (Crawford 1990). As in other livestock species, sequence variation in mtDNA, in particular in the highly polymorphic control region, has been used to study domestication events and relationships in the chicken. First results with representatives of each of the four wild Gallus species, domestic chickens from Indonesia and two commercial breeds, has suggested that domestic chickens descend from only one species, Gallus gallus, and that a single domestication event took place in Thailand and its adjacent regions (Fumihito et al. 1996). Subsequent studies of samples from various regions in Europe and Asia suggested multiple origins of domestic chickens in South and Southeast Asia, which is consistent with archaeological data (West & Zhou 1988; Liu et al. 2006; Oka et al. 2007). Moreover, whole mtDNA sequences and two nuclear markers revealed that, besides Gallus gallus, Gallus sonneratii and Gallus lafayettii might have also contributed to the genetic make-up of contemporary domesticated chickens, although to a lesser extent (Nishibori et al. 2005). Recently Eriksson et al. (2008) provided further evidence of a hybrid origin of the domestic chickens. They studied sequence variation of the BCDO2 gene in domestic chickens and closely related wild species. BCDO2 encodes beta-carotene dioxygenase 2, which cleaves colourful carotenoids to colourless apocarotenoids and is an obvious candidate gene for skin colour. Sequence comparison revealed that yellow skin, a common feature of many breeds of domestic chicken, does not originate from the red jungle fowl (Gallus gallus), but is most likely from the grey jungle fowl (Gallus sonneratii), a wild relative of domestic fowl found in India. A study of African domestic chickens revealed the presence of two maternal lineages among Zimbabwean, Sudanese and Malawian chickens, one of Southeast Asian and the other of presumably Indian origin (Muchadeyi et al. 2008). mtDNA analyses also showed that modern Chilean breeds, presumed to be of Polynesian origin (pre-Columbian), are actually of Indo-European and Asian origin. Ancient mtDNA haplotypes found in pre-Columbian archaeological chicken remains on Easter Island support the theory of early Polynesian/Pacific chicken transport. Either these haplotypes never reached South America, or they were subsequently displaced by new introductions (Gongora et al. 2008).
Since domestication, chickens have been distributed throughout various countries, continents and cultures. As a result of many years of adaptation and breeding, a wide range of chicken breeds exist today. These encompass more or less unselected indigenous chickens and ecotypes from various regions in the world, standardized fancy breeds selected for morphological traits and maintained for leisure activities, and experimental and commercial lines. An increasing number of local chicken breeds are under threat of extinction, and valuable genotypes and traits may be at risk of being lost (Blackburn 2006).
Insight into the extent of diversity of chicken breeds worldwide has been gained using microsatellites in numerous studies (Wimmers et al. 2000; Berthouly et al. 2008; Chen et al. 2008), including the European research project AVIANDIV and follow-up studies (Rosenberg et al. 2001; Hillel et al. 2003; Granevitze et al. 2007, 2009). Overall, results suggest that Jungle Fowl populations and traditional unselected breeds are widely heterogeneous populations, which include a large portion of the total genetic diversity. Within commercial chickens, broiler lines were slightly more polymorphic than layers. Among the layers, the white layers were less polymorphic than the brown layers. In recent years, there has been concern about reduced genetic variability in commercial white egg layers that have originated from a sole breed, the Single Comb White Leghorn. Although findings of the AVIANDIV project support this concern to some extent, commercial lines still exhibit a considerable amount of variation at microsatellite loci.
Hillel et al. (2007) undertook a large-scale analysis of 2000 individuals from 65 populations representing different chicken types from various geographical regions. Individuals were genotyped at 29 microsatellite loci. Model-based clustering (as implemented in Structure (Pritchard et al. 2000)) indicated that the 65 populations split into groups corresponding to their geographical origin and cultivation history, i.e. Asia, Europe and Africa (Hillel et al. 2007). Using the same dataset, Granevitze et al. (2007) showed that the degree of polymorphism varies between clusters. The relatively low genetic diversity observed in the native European breeds, mainly standardized fancy breeds, was presumably resulting from positive assortative mating and small effective flock size. By contrast, native populations from Africa and Asia had high genetic diversity and did not show a typical population structure. Differentiation was only observed between populations from distant areas and countries (Muchadeyi et al. 2007; Mwacharo et al. 2007; Berthouly 2008; Chen et al. 2008). Rosenberg et al. (2001) demonstrated that it was possible to assign individuals to their correct breeds with 90% efficiency based on only 12 microsatellite markers genotyped in 30 animals from 20 diverse chicken populations. By increasing the number of loci used to 24, accuracy was close to 97% (Rosenberg et al. 2001). Furthermore, small-scale studies analysing only a few local Italian or Japanese fancy breeds showed that these breeds can be genetically identified and that they generally display low genetic diversity (Tadano et al. 2007, 2008; Zanetti et al. 2007).
For 28 of the 30 FAO-recommended microsatellite markers, data for around 100 breeds are currently published. Additional studies, including of West African, South African and Vietnamese breeds, are underway (M. Tixier-Boichard and H. Jianlin, personal communication; S. Weigend, unpublished data). Although merging microsatellite datasets generated in different laboratories is often problematic, Berthouly et al. (2008) succeeded, after calibrating 14 of 22 markers, in combining genotypes from different laboratories.
SNPs now have also become a well-established genetic marker system. In the AVIANDIV project, one SNP per 50 bp was found on average in a subset of ten highly diverse chicken populations (10 individuals per population, Schmid et al. 2005). This frequency is higher than that found by comparing different domestic breeds (Wong et al. 2004). The high frequency found in the AVIANDIV project presumably reflects the wide genetic spectrum of chicken breeds collected. SNP arrays with over 2500 informative SNPs in commercial chicken lines and other resource populations indicated that individual commercial breeding lines have lost 50% or more of their genetic diversity. Only a limited fraction of this loss can be recovered by combining all stocks of commercial poultry (Muir et al. 2008). However, it appears that modern breeding was not the primary source of this loss of alleles, and that many alleles were lost prior to the formation of the current industry.
Andreescu et al. (2007) assessed the extent of LD in nine commercial broiler breeding populations using genotype data for 959 and 398 SNPs on chromosomes 1 and 4 respectively. Results showed that in these lines LD did not extend much beyond approximately 0.5 cM, which is shorter than previously reported for other livestock species. However, it seems to be much larger in White Leghorn-based breeds. Within 1 cM, LD tended to be consistent across related populations. Calculating the correlation of LD between neighbouring SNPs within and between populations closely matched the line relationships based on marker allele frequencies. Thus, there are indications that this approach is equivalent to estimating kinship coefficients, and it might also be of interest for other livestock species. As in other farm animal species, initiatives are underway to develop genomic 60 K SNP arrays for chickens.