How should a dendrogram‐based measure of functional diversity function? A rejoinder to Petchey and Gaston
Abstract
This paper is a response to Petchey and Gaston's criticism of our previous paper on the measurement of dendrogram‐based functional diversity. In contrast to their suggestions, we insist that Euclidean distance is unsuitable to the analysis of variables with mixed scale types and maintain our earlier view that both the distance function and the clustering method can influence the dendrogram‐based measure of functional diversity. We propose an extension of Gower's formula to accommodate nominal traits with non‐exclusive categories and emphasize the necessity of the methodological standardization of functional diversity measures.
Petchey and Gaston (this issue) comment on dendrograms and derived functional diversity (FD) measures, as a response to our criticism (Podani and Schmera 2006) of their earlier work summarized in Petchey and Gaston (2002). The purpose of these authors is “… to refute, clarify, and provide more appropriate solutions to a number of issues …” we raised. We feel, however, that while some of the new proposals contain useful elements and are welcome, refutation of most of our points fails and the way they clarify the problems increases, rather than diminishes the methodological confusion. Therefore, some comments on their paper are in order.
Comparison of the incomparable
Petchey and Gaston are fair enough to acknowledge at least implicitly the inappropriateness of Euclidean distance in the analysis of functional variables with mixed scale types. Nevertheless, with the obvious purpose of defending their old and bad strategy, they redraw the dendrogram for the arctic vegetation data (Chapin et al. 1996) which is created in a manner still unknown to us. Then, they try to show that correct analysis by the Gower measure adds relatively “minor” information and that general conclusions are insensitive to the distance measure used. However, their attitude towards this issue is potentially dangerous and carries a bad message for the scientific community, for several reasons. 1) The fundamental problem is that results produced by an admittedly unsuitable method do not make sense, cannot justify any scientific argumentation and should not be compared to anything else to validate a bad approach. Ecology is often considered by many as “soft science”, and the criticism is right when such inadmissible evaluations appear in the ecological literature. 2) Technical details of the computations are presented neither in Petchey and Gaston (2002) nor in their recent commentary, so that the results cannot be reproduced and tested by anyone else which would otherwise be an essential criterion in scientific research. We do not know how they handled nominal and ordinal variables and missing values in the data when calculating Euclidean distances – which would be interesting to see even if such calculations can only be incorrect. We therefore repeat our warnings that nominal and ordinal variables cannot be standardized in the classical way and cannot be subjected to metric computations. Details of calculating the Gower formula are also missing, so the reader cannot appreciate how Petchey and Gaston handled the single presence–absence variable (double zeros considered or disregarded?) and how they incorporated the six ordinal variables present in the arctic vegetation dataset. The program package they used for clustering is not mentioned. Not surprisingly, we were therefore unable to reproduce exactly the results in Petchey and Gaston's Fig. 1a and 1b, although we tried several combinations of input options. 3) If we forget about the above reservations and insist on comparing the results, we find that the conclusions by Petchey and Gaston are not entirely right. The dendrograms of Fig. 1a and 1b, no matter how obtained, appear radically different in shape at first sight, although there are some agreements in minor details. The trends, displayed by the diagrams on the right of Fig. 1a and 1b, i.e. that FD, presumably standardized by the maximum, increases along with species richness, would be probably similarly monotonous for even more differing dendrograms. But the picture is in fact not as simple as suggested, because the Gower measure produced apparently smaller variances than Euclidean distance, an observation that escaped the attention of the authors (this figure was then corrected before printing). So, we do not understand the reply by Petchey and Gaston that the choice of the method “… may have little effect on quantitative patterns in FD.” In their example, both the dendrograms and the scattergrams exhibit striking differences so the only pattern they may refer to is the monotonous trend, which is less surprising anyway. 4) We cannot interpret the scattergrams of Fig. 1c simply because no captions to vertical axes are given, so the reader can only guess what is going on there. The top diagram appears to be a graphical comparison between the Gower formula and the Euclidean distance, this time without standardization by the maxima (around 15 and 10, respectively). If this is the case, then the differences between the two coefficients are not negligible at all. The diagram on the bottom applies standardization again, making its interpretation difficult not only because the caption on the vertical axis is missing. In general, comparisons are hard to follow when the scale oscillates between the raw and standardized ones! 5) The statement that results are insensitive “… in fact also to the clustering algorithm …” is not substantiated at all, the authors did not demonstrate this, leaving the readers totally unconvinced. In fact, the choice of the clustering method may or may not have substantial effects, which is never known a priori unless a mathematical proof is given. Table 2 in our paper (Podani and Schmera 2006) clearly shows that FD decreases when we shift from complete linkage (CL) to UPGMA and then to single linkage (SL).
The suggestion that the choice of distance coefficient (when measurement scales are uniform) and the clustering method is immaterial because it is the cophenetic correlation coefficient that needs to be maximized also deserves attention. Yes, it is quite possible that the highest cophenetic correlation is not the one produced by UPGMA for a given problem. This is not simply because there are some noted alternatives (centroid, median, SL, CL) but because there is an infinite series of distance optimizing clustering procedures available through the general strategy suggested by Lance and Williams (1966). Therefore, maximization would always require a complete screening of results derived by this general algorithm, of which UPGMA, SL and CL are only special cases, although absolute optima could not be found easily because the controlling parameter of this general procedure is continuous. That is, maximization of cophenetic correlation would overcomplicate the calculations unnecessarily. The same holds true for the choice of the distance coefficient, of which we can also imagine an infinite number of possibilities, such as those offered by the general formula of Faith (1984). That is, the theoretically well‐sound requirement of maximizing the cophenetic correlation coefficient is not feasible in practice and standard use of the same clustering method (such as UPGMA) with Euclidean distance (when appropriate) or Gower formula (when appropriate) is preferable. In their abstract, Petchey and Gaston reject our suggestion to use the above combination of distances and clustering procedure, whereas in the next sentence they argue that the methods “… may have little effect on quantitative patterns in FD”. But then, why worry so much about maximizing the cophenetic correlation?
Multiplicity of character states

The number of traits
The question whether the number of functional traits influences FD is considered irrelevant by Petchey and Gaston. This may be generally true that two different assemblages described in terms of different numbers of traits deserve no comparison, but our examples showed how FD changes when the same assemblage is analyzed with one trait removed at a time. We maintain that this is a meaningful ecological question, because in this way we can identify traits that are most influential in determining FD of a given community. Of course, our purpose with the comparison of the FD of Patagonian forbs with the FD of insectivorous birds was not to ask “… whether a kilometre is longer than an hour”, as Petchey and Gaston interpret the issue. In fact, we demonstrated via this example the lack of positive relationship of Gower‐distance‐based FD and the number of species, rather than the number of traits, so that Petchey and Gaston entirely misunderstood this part of our paper. (Actually, the Patagonian forbs data comprise 6 traits, the bird data include 24, and FD was higher for the bird data in which, however, the number of species is smaller than in the Patagonian forbs dataset.)
Species number
We are glad to see that the improved version of dendrogram‐based FD is freed from the problems the old concept suffered from. It is especially welcome that zero diversity is obtained for the single‐species case and we do not have to consider communities “without species”. The set monotonicity criterion is met by this new measure, but only if examined in “downwards” direction! There is a starting dendrogram derived for a given set of species, and species removals can only decrease – or leave unchanged – the total of remaining branch lengths (Ricotta 2005). This is an obvious advantage over our procedure which does not satisfy this requirement. However, the set monotonicity criterion assumes in general that diversity should not decrease by the addition of a new species to a given species set. As we understand, the new FD does not apply to species additions, and therefore cannot be adapted to situations in which new species appear in a community. Thus, the set monotonicity condition only partially holds. The price that must be paid to guarantee set monotonicity is high because fundamental species pools, which can only be reduced in size according to the new definition of FD, are not always readily available in ecological studies. Also, there is good reason to assume that the functionality of a community may change considerably when, for example, a new invasive species occurs. Recomputing dendrograms may therefore still be a useful strategy in more realistic situations without a predefined species pool such that both species extinction and immigration are allowed. Petchey and Gaston's argument that “… it is not the use of total branch length, but the recalculation of the functional dendrogram for each assemblage, that causes the violation of set monotonicity” by our method is merely a play with words, because total branch length is the statistic and the recalculation of the dendrogram is the process that causes change in this statistic, and these two are not separable in our case.
Conclusions
We maintain that measurement of functional diversity via dendrograms is a complex matter; data treatment, distance function and clustering method used are all potentially influential or even decisive, and that methodological standardization is still possible and necessary. It is difficult to find a procedure such that all mathematical requirements are met at the same time, especially if one thinks not only in terms of species extinction as the only factor affecting species richness. Surely enough, set monotonicity is not a condition that is superior to compatibility of distances with measurement scales. Correct answers to particular ecological questions require careful methodological thinking which does not mean that statistics is more important than ecology. If the ecologist gives more thought to statistical aspects, however, the statistical methods may only better serve the purposes of our field.
Acknowledgements
The financial support to J.P. in form of a Hungarian Research Grant, OTKA T43732 is gratefully acknowledged.
References
Citing Literature
Number of times cited according to CrossRef: 22
- Carlo Ricotta, Fabien Laroche, László Szeidl, Sandrine Pavoine, From alpha to beta functional and phylogenetic redundancy, Methods in Ecology and Evolution, 10.1111/2041-210X.13353, 11, 4, (487-493), (2020).
- Essohouna Modom Banla, Daniel Kwadjo Dzidzienyo, Mouhamadou Moussa Diangar, Leander Dede Melomey, Samuel Kwame Offei, Pangirayi Tongoona, Haile Desmae, Molecular and phenotypic diversity of groundnut (Arachis hypogaea L.) cultivars in Togo, Physiology and Molecular Biology of Plants, 10.1007/s12298-020-00837-8, 26, 7, (1489-1504), (2020).
- Sandrine Pavoine, Carlo Ricotta, A simple translation from indices of species diversity to indices of phylogenetic diversity, Ecological Indicators, 10.1016/j.ecolind.2019.01.052, 101, (552-561), (2019).
- Karen Neves, Mario R. Moura, Jonas Maravalhas, Renata Pacheco, Marcio R. Pie, Ted R. Schultz, Heraldo L. Vasconcelos, Functional richness shows spatial scale dependency in Pheidole ant assemblages from Neotropical savannas, Ecology and Evolution, 10.1002/ece3.5672, 9, 20, (11734-11741), (2019).
- Danish Ali Ahmed, Peter M. van Bodegom, Arnold Tukker, Evaluation and selection of functional diversity metrics with recommendations for their use in life cycle assessments, The International Journal of Life Cycle Assessment, 10.1007/s11367-018-1470-8, (2018).
- Karine Borges Machado, Fabrício Barreto Teresa, João Carlos Nabout, Assessing the spatial variation of functional diversity estimates based on dendrograms in phytoplankton communities, Acta Botanica Brasilica, 10.1590/0102-33062017abb0018, 31, 4, (571-582), (2017).
- Dénes Schmera, Jani Heino, János Podani, Tibor Erős, Sylvain Dolédec, Functional diversity: a review of methodology and current knowledge in freshwater macroinvertebrate research, Hydrobiologia, 10.1007/s10750-016-2974-5, 787, 1, (27-44), (2016).
- Mariana A. Tsianou, Athanasios S. Kallimanis, Different species traits produce diverse spatial functional diversity patterns of amphibians, Biodiversity and Conservation, 10.1007/s10531-015-1038-x, 25, 1, (117-132), (2015).
- Eric Marcon, Bruno Hérault, Decomposing phylodiversity, Methods in Ecology and Evolution, 10.1111/2041-210X.12323, 6, 3, (333-339), (2014).
- Chun-Huo Chiu, Anne Chao, Distance-Based Functional Diversity Measures and Their Decomposition: A Framework Based on Hill Numbers, PLoS ONE, 10.1371/journal.pone.0100014, 9, 7, (e100014), (2014).
- Sandrine Pavoine, Carlo Ricotta, TESTING FOR PHYLOGENETIC SIGNAL IN BIOLOGICAL TRAITS: THE UBIQUITY OF CROSS‐PRODUCT STATISTICS, Evolution, 10.1111/j.1558-5646.2012.01823.x, 67, 3, (828-840), (2012).
- Laura Pla, Fernando Casanoves, Julio Di Rienzo, Laura Pla, Fernando Casanoves, Julio Di Rienzo, Functional Diversity Indices, Quantifying Functional Biodiversity, 10.1007/978-94-007-2648-2_3, (27-51), (2012).
- E. Feoli, Diversity patterns of vegetation systems from the perspective of similarity theory, Plant Biosystems - An International Journal Dealing with all Aspects of Plant Biology, 10.1080/11263504.2012.740090, 146, 4, (797-804), (2012).
- Vojtěch Lanta, Jiří Doležal, Lenka Zemková, Jan Lepš, Communities of different plant diversity respond similarly to drought stress: experimental evidence from field non-weeded and greenhouse conditions, Naturwissenschaften, 10.1007/s00114-012-0922-4, 99, 6, (473-482), (2012).
- Karsten Wesche, Benjamin Krause, Heike Culmsee, Christoph Leuschner, Fifty years of change in Central European grassland vegetation: Large losses in species richness and animal-pollinated plants, Biological Conservation, 10.1016/j.biocon.2012.02.015, 150, 1, (76-85), (2012).
- E. Feoli, P. Ganis, R. Venanzoni, V. Zuccarello, Toward a framework of integrated knowledge of terrestrial vegetation system: The role of databases of phytosociological relevés, Plant Biosystems - An International Journal Dealing with all Aspects of Plant Biology, 10.1080/11263504.2011.602739, 145, sup1, (74-84), (2011).
- MARIA L. LONGHI, BEATRIX E. BEISNER, Patterns in taxonomic and functional diversity of lake phytoplankton, Freshwater Biology, 10.1111/j.1365-2427.2009.02359.x, 55, 6, (1349-1366), (2010).
- Sandrine Pavoine, Jeanne Vallet, Anne-Béatrice Dufour, Sophie Gachet, Hervé Daniel, On the challenge of treating various types of variables: application for improving the measurement of functional diversity, Oikos, 10.1111/j.1600-0706.2009.16668.x, 118, 3, (391-402), (2009).
- Owen L. Petchey, Kevin J. Gaston, Dendrograms and measures of functional diversity: a second instalment, Oikos, 10.1111/j.1600-0706.2009.17403.x, 118, 7, (1118-1120), (2009).
- Dénes Schmera, János Podani, Tibor Erős, Measuring the contribution of community members to functional diversity, Oikos, 10.1111/j.1600-0706.2009.17076.x, 118, 7, (961-971), (2009).
- Sandrine Pavoine, Jeanne Vallet, Anne‐Béatrice Dufour, Sophie Gachet, Hervé Daniel, On the challenge of treating various types of variables: application for improving the measurement of functional diversity, Oikos, 10.1111/j.1600-0706.2008.16668.x, 118, 3, (391-402), (2009).
- Maud Mouchet, François Guilhaumon, Sébastien Villéger, Norman W. H. Mason, Jean‐Antoine Tomasini, David Mouillot, Towards a consensus for calculating dendrogram‐based functional diversity indices, Oikos, 10.1111/j.0030-1299.2008.16594.x, 117, 5, (794-800), (2008).




