Nei's gene diversity and the index of average differences are identical measures of diversity within populations
Article first published online: 29 OCT 2003
Volume 52, Issue 5, pages 533–535, October 2003
How to Cite
Kosman, E. (2003), Nei's gene diversity and the index of average differences are identical measures of diversity within populations. Plant Pathology, 52: 533–535. doi: 10.1046/j.1365-3059.2003.00923.x
- Issue published online: 29 OCT 2003
- Article first published online: 29 OCT 2003
- Accepted 12 June 2003
- diversity indices;
- population genetics
It has been established that Nei's measure of the average genetic diversity per locus, HS, and the measure of average differences between isolates with respect to simple mismatch dissimilarity, are identical measures of diversity within populations. The Müller index of diversity can be considered as the correction of Nei's measure for small samples.
Various indices are used for analyses and comparisons of diversity within plant pathogen populations. These include Nei's measure of the average gene diversity per locus, HS (Nei, 1973); the measure of average differences (dissimilarities) within populations, ADW (McCain et al., 1992); and the Müller index, Mu (Müller et al., 1996). The Nei index is the most frequently used in applications, although the average dissimilarity between isolates is also employed quite often (Adhikari et al., 1999; Kolmer & Liu, 2000; Menzies et al., 2003). Sometimes these two indices are used together (Gale et al., 2002) to enhance the significance of conclusions. The objective of this short communication is to prove that the three above-mentioned indices are actually the same measure of diversity within populations in the case of binary data. This statement also holds true in the case of multiple states of multiallelic loci. The proof for multiallelic loci is similar to that for binary data, but is rather lengthy – it is not included in this letter, but is available from the author on request.
Consider a sample from population P, which consists of n individuals tested on k differentiating factors and represented by binary patterns. The frequency of appearance 1 at the sth differentiating factor for population P is denoted by qs. For example, if the differentiating factors represent a typical set of differential host lines used in virulence tests, qs would be the frequency of virulence in population P on the sth differential line.
The indices of average differences within populations depend on the measure of dissimilarity between isolates. Consider the simple mismatch coefficient of dissimilarity
where xi and xj are isolates from population P(i, j = 1, … , n) and d(xi, xj) is the number of characters for which two isolates xi and xj respond differently. The index of average differences with respect to the simple mismatch coefficient m is defined as:
The product of ADWm by the number of differentiating characters k is also a usable index (Table 1 in Gale et al., 2002):
This is the average number of characters for which two arbitrary isolates from population P respond differently.
Müller's mean dissimilarity index Mu (Müller et al. (1996) was defined as the mean number of virulence loci differences between all pairs of different isolates in a sample. Its normalized version (dividing by the total number of loci) has the form:
where 1 ≤ i < j ≤ n, that is, only one of the pairs of isolates (xi, xj) and (xj, xi) is considered, and [n(n − 1)]/2 is the total number of such pairs.
The Müller diversity Mu within population P can be expressed by the index of average differences within population ADWm with respect to the simple mismatch dissimilarity:
Nei's measure of the average gene diversity per locus HS (Nei, 1973) is determined by the formula:
where k is the total number of loci (differentiating factors), HSs = 1 − − (1 − qs)2, and qs is the frequency of one of the two alleles at the sth diallelic locus (or virulence frequency, or band frequency, or frequency of appearance 1 at the sth differentiating factor).
It was mentioned by Manisterski et al. (2000) that the Müller index is a function of virulence frequencies, and the corresponding formula was presented. It will be demonstrated here that the measure of average differences with respect to the simple mismatch coefficient ADWm equals the Nei gene diversity parameter HS, and the Müller index Mu could be considered as the correction of Nei's measure HS for small samples.
Dissimilarity ds between two isolates xi and xj with regard to any character s can be measured as follows: ds(xi, xj) = 0 if xi and xj respond identically on s, and ds(xi, xj) = 1 if xi and xj respond differently on s. The dissimilarity d between these isolates equals to the following sum:
There are nqs and n(1 − qs) isolates with positive (1) and negative (0) responses, respectively, on the differential s. Then (nqs)2 pairs of isolates respond positively on s, [n(1 − qs)]2 pairs of isolates respond negatively on s, and n2 – (nqs)2 –[n(1 – qs)]2 = 2n2qs(1 − qs) pairs of isolates respond differently on s, where n2 is the total number of pairs. Thus:
and the following equalities are fulfilled:
This means that Nei's measure of the average gene diversity per locus, HS, and the index of average differences with respect to the simple mismatch coefficient, are identical measures of diversity within populations. In the case of diallelic loci (binary data) the maximum value of the HS and ADWm indices equals 0·5. The Müller index Mu could be considered as the correction of Nei's measure HS for small samples because:
A more accurate unbiased estimate of HS for a small sample size is given by the formula:
Let population P consist of six isolates (n = 6) tested on eight differentials (k = 8):
The Nei index scores the following value of within-population diversity:
The matrix of dissimilarities between isolates according to the simple mismatch coefficient m has the following form:
The index of average differences with respect to the simple mismatch coefficient equals the sum of all entries of the dissimilarity matrix divided by the total number of entries:
and the Müller index equals the sum of all entries above the diagonal of the dissimilarity matrix divided by the number of entries above the diagonal:
Thus the results obtained by Nei's measure of the average gene diversity per locus, HS, the measure of average differences within population, ADWm, and the Müller index are absolutely comparable. The simultaneous application of these measures is unnecessary.
This study was partially supported by the Leiberman–Okinow foundation.
- 1999. Genotypic and pathotypic diversity in Xanthomonas oryzae pv. oryzae in Nepal. Phytopathology 89, 687–94. , , ,
- 2002. Population analysis of Fusarium graminearum from wheat fields in eastern China. Phytopathology 92, 1315–22. , , , , ,
- 2000. Virulence and molecular polymorphism in international collections of the wheat leaf rust fungus Puccinia triticina. Phytopathology 90, 427–36. , ,
- 2000. Comparative analysis of indices in the study of virulence diversity between and within populations of Puccinia recondita f. sp. tritici in Israel. Phytopathology 90, 601–7. , , , ,
- 1992. Inter- and intrapopulation isozyme variation in collections from sexually reproducing populations of the bean rust fungus, Uromyces appendiculatus. Mycologia 84, 329–40. , , ,
- 2003. Use of inter-simple sequence repeats and amplified fragment length polymorphisms to analyze genetic relationships among small grain-infecting species of Ustilago. Phytopathology 93, 167–75. , , , , ,
- 1996. Analysis of diversity in populations of plant pathogens: the barley powdery mildew pathogen across Europe. European Journal of Plant Pathology 102, 385–95. , , , ,
- 1973. Analysis of gene diversity in subdivided populations. Proceedings of the National Academy of Sciences, USA 70, 3321–3. ,
- 1978. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics 89, 583–90. ,