Climate change, adaptation, and phenotypic plasticity: the problem and the evidence

Many studies have recorded phenotypic changes in natural populations and attributed them to climate change. However, controversy and uncertainty has arisen around three levels of inference in such studies. First, it has proven difficult to conclusively distinguish whether phenotypic changes are genetically based or the result of phenotypic plasticity. Second, whether or not the change is adaptive is usually assumed rather than tested. Third, inferences that climate change is the specific causal agent have rarely involved the testing – and exclusion – of other potential drivers. We here review the various ways in which the above inferences have been attempted, and evaluate the strength of support that each approach can provide. This methodological assessment sets the stage for 11 accompanying review articles that attempt comprehensive syntheses of what is currently known – and not known – about responses to climate change in a variety of taxa and in theory. Summarizing and relying on the results of these reviews, we arrive at the conclusion that evidence for genetic adaptation to climate change has been found in some systems, but is still relatively scarce. Most importantly, it is clear that more studies are needed – and these must employ better inferential methods – before general conclusions can be drawn. Overall, we hope that the present paper and special issue provide inspiration for future research and guidelines on best practices for its execution.


Introduction
Adaptive evolution occurs when the genetic constitution of a population changes as a consequence of natural selection. Thus, to demonstrate that adaptation has occurred, evidence is needed for genetic change and that natural selection has been the causal force. Given that the genetic underpinnings of most traits are still not known (Ellegren and Sheldon 2008;Mackay et al. 2009;Anderson et al. 2013) and that the accurate measurement of natural selection is difficult (Kingsolver et al. 2001(Kingsolver et al. , 2012Kruuk et al. 2003;Hersch and Phillips 2004;Morrissey and Hadfield 2012), obtaining hard evidence to conclusively demonstrate adaptive evolution in the wild represents a major and ongoing challenge. Given these logistical difficulties, and following from the traditional view that natural selection is a powerful force (Endler 1986), a frequent default has been to infer adaptive evolution based simply on evidence that mean phenotypes have changed in ways that intuition suggests are adaptive. This is not enough.
One major reason the above intuition-based inferences are not sufficient is that phenotypic change can be the result of either genetic change or phenotypic plasticity, the latter occurring when individuals of a given genotype adjust their phenotype according to the conditions they experience (West-Eberhard 2003). Indeed, a number of phenotypic differences originally thought to be genetically based were subsequently attributed to phenotypic plasticity (e.g., James 1983;Charmantier et al. 2008;Teplitsky et al. 2008). In the wake of such demonstrations phenotypic plasticity is increasingly adopted as a parsimonious model that is to be rejected only if direct evidence is obtained for genetic change: That is, plasticity is treated as a null model. The reality, however, is that inferences regarding phenotypic plasticity also should be supported by a specific set of evidentiary criteria (see below).
Another major reason why the above intuition can fail is that phenotypic changes may or may not be adaptive. On the one hand, maladaptive or nonadaptive genetic changes can occur owing to genetic drift or gene flow, rather than natural selection. On the other hand, maladaptive or nonadaptive plastic changes might occur as a result of stress, nutrient limitation, and a host of other reasons (Gotthard and Nylin 1995;Grether 2005). In reality, then, genetic versus plastic changes and adaptive versus nonadaptive changes should be treated as alternative models to be compared based on the weight of evidence. In the present paper, we consider these issues in the context of contemporary global (climate) change.
The possibility of evolution in response to climate change started to gain attention in the late 1980s and early 1990s (e.g. , Holt 1990;Lynch and Lande 1993;B€ urger and Lynch 1995). Interest has since accelerated rapidlyin the last two decades more than 30 review and perspective articles have emerged on the topic (Meril€ a 2012), and 12 new reviews are published in this issue. At the same time, concerns have been voiced about the quality of inference: In particular, few studies have convincingly demonstrated that observed phenotypic shifts have a genetic basis (Gienapp et al. 2008;Meril€ a 2012). Likewise, despite the promise and enthusiasm associated with using molecular genetic and genomic approaches to detect climate-driven evolution (Hoffmann and Daborn 2007;Reusch and Wood 2008), few studies have been able to realize it (but see : Umina et al. 2005;Balany a et al. 2006) and fewer still have been able to link genetic changes to phenotypes and to confirm that the changes are adaptive.
In addition to the scarcity of evidence for genetic change, increasing support has emerged for its main alternative. That is, many studies have now reported a role for plasticity in shaping phenotypic responses to contemporary disturbances such as climate change (e.g., Charmantier et al. 2008;Gienapp et al. 2008;Hendry et al. 2008;Teplitsky et al. 2008). Partly for this reason, it has become increasingly common for authors to assume a role for plasticity unless direct evidence for genetic change has been obtained: that is, the null model approach mentioned above. In most cases, however, the lack of genetic evidence stems from the lack of actual tests for genetic change rather than concrete evidence that no genetic change has occurred. In such instances, all that can be stated with certainty is that the relative contributions of genetic change and plasticity are unknown. In addition, the general point made above that plastic responses might or might not be adaptive also applies to climate change. For instance, Teplitsky et al. (2008) provided evidence that climate-driven plastic decreases in the body size of red-billed gulls (Larus novaehollandiae; Fig. 1) were likely the result of environmental stress, rather than adaptive responses.
Beyond this ambiguity about genetic versus plastic changes and adaptive versus nonadaptive changes, another inferential difficulty comes in determining the specific environmental factor causing a particular phenotypic/genetic change (Endler 1986;Wade and Kalisz 1990;MacColl 2011). A well-known example of this problem is the difficulty in confidently ascribing phenotypic trends in fish stocks to the effects of fishing versus other environmental changes (e.g., Kuparinen and Meril€ a 2008). In the context of climate change, increasing temperatures are often coincident with many other environmental changes that shape phenotypic responses, whether genetic or plastic. Thus, the demonstration that a phenotypic change is genetic or plastic and adaptive or not does not necessarily provide confirmation that climate change, or more specifically climate warming, is the specific cause.
In the present paper, we outline approaches that have been used for making the above inferences in relation to climate change. Our major goal in doing so is to consider in detail what the various methods can and cannot tell us about proximate and ultimate causes. One of our key points is that evolution and plasticity (whether adaptive or not) should be treated as alternative models to be weighed against each other in light of data. That is, one alternative should not be considered a 'null' model to be assumed unless it can be rejected in favor of another.
After drafting the present paper, we provided it to experts who were well positioned to evaluate the strength of evidence for adaptive evolutionary responses to climate change in particular taxa (viz. fish, birds, mammals, amphibians and reptiles, marine animals and plants, terrestrial invertebrates, freshwater invertebrates, terrestrial plants, and marine phytoplankton) or in the context of particular topics (Bergmann's rule, theoretical models). The results of these assessments form the rest of this special issue (Boutin and Lane 2014;Charmantier and Gienapp 2014;Collins et al. 2014;Crozier and Hutchings 2014;Franks et al. 2014;Kopp and Matuszewskib 2014;Reusch 2014;Schilthuizen and Kellermann 2014;Stoks et al. 2014;Teplitsky and Millien 2014;Urban et al. 2014;Fig. 1). We therefore close with a brief synopsis of those contributions. We hope that the end result is a balanced account of what is currently known about nature of phenotypic responses to climate change.

Basic considerations
Studies seeking evolutionary inference face the three basic challenges introduced above. One challenge is to understand the proximate causes of phenotypic change: that is, to what degree do the observed shifts reflect genetic and/or plastic changes. Another challenge is to understand the ultimate cause at a basic level: that is, natural selection, sexual selection, genetic drift, or gene flow. The third challenge is to establish the ultimate cause at a specific level: that is, the precise environmental driver, such as commercial fishing, climate change, or pollution.
Each of the above inferences might be desired in synchronic (i.e., spatial) or allochronic (i.e., longitudinal or temporal) contexts. Synchronic studies compare different populations sampled at approximately the same time, whereas allochronic studies compare the same population sampled at different points in time (Hendry and Kinnison 1999). Most inferential methods are easier to implement in the synchronic case because representatives from the different populations can be reared/raised/grown together under specific conditions and because direct assessments of selection can be implemented in situ. Studies of contemporary climate change, however, are generally allochronic (i.e., we wish to infer whether a population in the present differs from the same population in the past), a context where inferential methods are harder to implement. In particular, individuals from different times usually cannot be reared/raised/grown together, and past selective regimes are hard to infer with any degree of precision. Similar difficulties attend other allochronic situations, such as evolution in response to commercial fisheries (e.g., Meril€ a 2007, 2008) and biological invasions (e.g., Bacigalupe 2009).
If responses to climate change could be assessed synchronically, the present paper would be unnecessary because the various methods have been much discussed and evaluated (e.g., Endler 1986; Kawecki and Ebert 2004;Blanquart et al. 2013). By contrast, methods for inference in allochronic contexts have not been treated in as much detail, including in relation to climate change. In the following sections, we first focus on proximate causes by outlining different approaches for inferring genetic versus plastic changes in allochronic studies. We then address the challenges encountered when attempting to infer ultimate causes in the same context: that is, whether natural selection is the general driver and whether climate change is the specific driver.
An addition point to bear in mindalthough not one we here consider in detailis that phenotypic constancy in the face of environmental change can also represent an important response to climate changeand can be considered in the contexts of 'phenotypic buffering' (in the context of plasticity -Reusch 2014) or 'genetic compensation' (in the context of genetic change - Conover and Schultz 1995;Grether 2005).

Genetic or plastic?
Approaches for inferring genetic change Six basic approaches might be considered for inferring genetic change: 'animal model' analyses, common-garden studies, comparison to model predictions, experimental evolution, space-for-time substitutions, and molecular genetic change (Table 1). Each of these methods has its own strengths and weaknesses, and some allow more robust inferences than do others.

Animal model analyses
This quantitative genetic approach is based on temporal changes in estimates of mean breeding values (i.e., 'genotypic values') for phenotypic traits. The strength of this approach is that it specifically quantifies genetic shifts while controlling for plasticity (e.g., R eale et al. 2003). A major limitation is that doing so requires pedigree data, which is usually available only in long-term studies of individually marked (or genotyped) vertebrate populations (e.g., R eale et al. 2003;Charmantier et al. 2008). Additional shortcomings can include problems in the estimation and interpretation of changes in mean breeding values. In particular, it sometimes can be difficult to ascertain whether observed trendsor the lack thereofare biologically meaningful as opposed to methodological artifacts (Postma 2006;Hadfield et al. 2010). When these problems can be overcome by suitable data and appropriate methods, the animal model remains one of the best approaches for evaluating the genetic basis of phenotypic change in a climate change context. In the present special issue, animal models are emphasized in the contributions of Charmantier and Gienapp (2014), Boutin and Lane (2014), and Teplitsky and Millien (2014).

Common-garden studies
In order to isolate genetic contributions to trait differences, individuals of known family background can be reared/raised/grown under common laboratory or field conditions (Conover and Schultz 1995;Blanquart et al. 2013). In allochronic contexts, this approach can be implemented in two basic procedures. First, populations for which common-garden studies were conducted in the past can be re-analyzed with new common-garden studies in the present. This approach was used to demonstrate climate-associated genetic shifts in the phenology of pitcher-plant mosquitoes (Wyeomyia smithii; Bradshaw and Holzapfel 2001). A difficulty in implementing such studies that the past common-garden environment needs to be very accurately replicated in the new experiment. Second, organisms with dormant stages, such as aquatic invertebrates with resting eggs (e.g., Daphnia, Cousyn et al. 2001) or plants with long-lasting seeds (e.g., Franks et al. 2007), can be resurrected from the past for direct comparison to individuals from the present. In either procedure, it is important to control for maternal effects and to recognize that genotype-by-environment interactions can make extrapolation to the natural environment difficult. In the present special issue, allochronic commongarden studies figure most heavily in the contributions of Stoks et al. (2014) and Franks et al. (2014).

Comparison to model predictions
This approach compares observed phenotypic trends to predictions from evolutionary models, such as the breeder's equation or its multivariate equivalent (Lynch and Walsh 1998). Specifically, selection and additive genetic (co)variances are measured and used to predict evolutionary responses. If observed trends match predicted trends, then adaptive genetic change is inferred (e.g., Swain et al. 2007;Crozier et al. 2011). Several concerns attend this method. First, it is difficult to accurately measure selection in nature (Kingsolver et al. 2001(Kingsolver et al. , 2012Kruuk et al. 2003;Hersch and Phillips 2004;Morrissey and Hadfield 2012). Second, it is even more difficult to accurately estimate additive genetic (co)variances (Lynch and Walsh 1998), not the least because they vary with environmental conditions (Hoffmann and Meril€ a 1999). Third, a large number of other assumptions attend these predictive models (Meril€ a et al. 2001;Morrissey et al. 2010). Given these difficulties, it is perhaps not surprising that predicted and observed evolutionary responses are often (although not always) poorly matched in natural populations (Meril€ a et al. 2001;Kruuk et al. 2008). In the present special issue, comparisons to model predictions are mentioned by Crozier and Hutchings (2014) and models of evolution in response to climate change are reviewed by Kopp and Matuszewskib (2014).

Experimental evolution
The basic idea here is to establish experimental populations exposed to different environmental conditions, such as warming temperature (e.g., Van Doorslaer et al. 2007), increasing CO 2 (e.g., Collins and Bell 2004), or acidification (e.g., Lohbeck et al. 2012). After a period of evolution under these conditions, the different populations can be compared under common conditions to quantify any genetically based phenotypic change, and they can be genotyped to uncover its genetic underpinnings (Kawecki et al. 2012). A major strength of this approach is that, with appropriate replication and controls, it can confirm evolution in response to a specific environmental factor (discussed in more detail later). One limitation of this approach is that logistical constraints often restrict it to small organisms with short generation times. Another limitation is that it cannot provide confirmation that evolution has actually occurred in natural (as opposed to experimental) populationsit is instead a 'proof-of-principle' inferential method. This last concern can be somewhat lessened by conducting experiments with natural populations in natural environmentssuch as in the case of experimental introductions (Reznick and Ghalambor 2005). In the present special issue, the utility of experimental evolution studies in the context of climate change is especially highlighted by Reusch (2014) and Collins et al. (2014).

Space-for-time substitution
Most environmental factors that vary in time also vary in space, andas explained aboveinferences in space (synchronic) are far easier than inferences in time (allochronic). A tempting approach therefore has been to find spatial associations between environmental factors and phenotypes that match the temporal associations for which inferences are desired. If common-garden or reciprocal transplant studies show a genetic basis for phenotypic differences in space, then it might seem reasonable to infer the same for the variation in time. The limitation, of course, is that demonstrating an effect in one situation does not necessarily mean that the same effect holds in another situation (as is the case for experimental evolution), and temporalas opposed to spatialenvironmental changes might be occurring faster than evolution can respond. Fukami and Wardle (2010) provide a good account of the assumptions and potential problems related to spacefor-time substitution in ecological studies, many of which are relevant also for evolutionary studies. At present, spacefor-time substitutions are the most commonand sometimes onlybasis for inference in some taxonomic groups, such as fish (Crozier and Hutchings 2014), marine animals and plants (Reusch 2014), and amphibians and reptiles (Urban et al. 2014), as well as in the context of Bergmann's Rule (Teplitsky and Millien 2014).

Molecular genetic approaches
A demonstration of shifts in allele frequency confirms that evolution has occurredbut genomes contain thousands to millions of loci, making shifts at some of them inevitable in all situations. A major difficulty thenmuch more so than when considering phenotypesis to determine the specific shifts that are relevant. To this end, investigators usually focus on loci known or thought to influence phenotypes or be under selection, or on closely linked neutral loci (Hoffmann and Daborn 2007;Reusch and Wood 2008). However, because the genetic basis of most traits is not known, it is difficult to know in advance which genes or genomic regions to consider. An alternative is to employ genome scans based on candidate genes or anonymous loci, but their relevance to adaptive phenotypic change often remains uncertain (e.g., Storz 2005). To date, the frequencies of some well-characterized genetic polymorphisms have been shown to shift with climate change (e.g., Umina et al. 2005;Balany a et al. 2006); but even in these cases, the relevant phenotypic traits and selective mechanisms are not well understood (see more about inferring selection below). In addition to searching for shifts in allele frequencies, comparative analyses of gene expression can be used to search for evidence of genetic effects related to population differentiation (e.g., Roberge et al. 2007), including climate change responses. Advances in genomic technologies, together with new insights into the genetic basis of traits (Olson-Manning et al. 2012;Wolkovich et al. 2012), hold promise for future efforts to infer molecular genetic changes driven by climate change (Franks and Hoffmann 2012). In the present special issue, genetic shifts are considered most often by Schilthuizen and Kellermann (2014) and Franks et al. (2014).

Approaches for inferring plastic change
We now focus on methods for inferring when plastic changes have contributed to temporal changes in mean phenotype. In doing so, it is important to remember that plasticity can have a genetic basis and can evolve (West-Eberhard 2003) and that both plastic and genetic effects can contribute to a phenotypic trend. The two alternatives are therefore not mutually exclusive but rather potential (co)contributors to a given change: 'cogradient variation' provides an illuminating example (Conover and Schultz 1995). Five basic approaches might be considered for inferring plasticity: 'animal model' analyses, common-garden studies, experimental manipulations, fine-grained population responses, and measures of individual plasticity (Table 1). As was the case for genetic inferences (above), each approach for inferring plasticity has strengths and weaknesses, and some allow more robust inferences than do others.

Animal model analyses
As described above, this quantitative genetic approach can determine the extent to which a phenotypic trend has a quantitative genetic basis. By extension, the portion of the phenotypic trend not explained by genetic change is likely to result from plasticity. Plasticity is thus inferred by excluding or removing genetic effects. In an early example of this approach from a climate change context, R eale et al. (2003) estimated that 62% of the change in parturition date in a population of red squirrels (Tamiasciurus hudsonicus) was due to plasticity. Even more dramatically, Charmantier et al. (2008) showed that phenological changes in Great tits (Parus major) in Oxford were the result of plasticity, rather than genetic change. Hence, with the caveats and limitations listed earlier, animal model approaches can evaluate plastic versus genetic contributions to phenotypic change. In the present special issue, this approach is considered in the most depth by Charmantier and Gienapp (2014) and Boutin and Lane (2014).

Common-garden studies
When a common-garden experiment, as described above, fails to confirm a genetic basis for trait differences, a common conclusion is that the differences observed in nature must result from plasticity. As is the case for animal models, inferences in this approach thus involve excluding or removing genetic effects. Limitations (see also above) include the infeasibility of allochronic common-garden studies for most organisms and the possibility of genotype-by-environment interactions that can make results in a specific common garden misleading with respect to nature (e.g., Winkler and Van Buskirk 2012). These concerns are lessened, although not eliminated, by the use of multiple testing environments, particularly those relevant to the natural situation. In the present special issue, common-garden approaches are particularly common among the studies reviewed by Franks et al. (2014) and Stoks et al. (2014).

Experimental manipulations
Whether in the laboratory or the field, the experimental manipulation of an environmental factor, such as temperature, can be used to quantify plasticity. The tempting inference then is that variation in the same factor in nature will produce the same phenotypic responseand so plasticity can explain similar environment-trait associations in nature. This indirect 'proof-of-principle' approach is limited in having to assume that the relevant environmental factors have been identified and applied at the correct life history stage. If not, failure to find sufficiently explanatory plastic responses could simply mean that the experimental conditions failed to accurately mimic the natural conditions. Alternatively, strong plastic responses in the laboratory might be swamped in nature by other effectsincluding 'countergradient' genetic differences (Conover and Schultz 1995). For these and probably other reasons, experimental studies often do a poor job of predicting observed trends in natural populations (Wolkovich et al. 2012). In the present special issue, papers paying particular attention to experimental manipulations include Franks et al. (2014), Collins et al. (2014), and Urban et al. (2014).

Fine-grained population responses
Environmental factors, such as temperature, do not change in a temporally constant fashion; they instead include a stochastic component that introduces substantial year-to-year variation around any coarse-scale temporal trend. In the present special issue, Crozier and Hutchings (2014) present temperature data that illustrate this phenomenon. This year-to-year environmental variation can be assessed for correlations with year-to-year variation in phenotypes so as to develop a yard-stick for plasticity (e.g., Phillimore et al. 2010)because genetic changes are not expected to be so fine-grained. An important limitation of this approach is the difficulty in verifying whether the change that took place really was the result of plasticity. Also, the relative importance of factors influencing fine-and coarse-grained variation could be quite different. In the present special issue, taxa where fine-grained population responses are frequently considered include birds (Charmantier and Gienapp 2014), mammals (Boutin and Lane 2014), and fish (Crozier and Hutchings 2014).

Individual plasticity in nature
When individual organisms live through multiple occurrences of a given event, investigators can measure within-individual among-event associations between environmental variables and phenotypes. For example, individual birds might breed earlier in warmer years. The resulting estimates of individual plasticity are considerably more natural than those derived from the experimental laboratory-based approach mentioned above. If extrapolation of individual plasticity to the population level can explain temporal trends in mean phenotypes, then plasticity might be considered a sufficient explanation for the population trend (Przybylo et al. 2000;Schiegg et al. 2002;Brommer et al. 2005;Nussey et al. 2005;Reed et al. 2006). A practical limitation of this approach is that it can be applied only in situations where phenotypes of individuals can be tracked across multiple events. Other limitations are that within-individual plasticity might (or might not) be constrained in relation to developmental plasticity, the causes of individual-level and population-level trends might be different, and confirmation of individual plasticity does not rule out the possibility that evolution has also occurred. Several papers in the present special issue discuss the vir-tues of measuring individual plasticity in nature (e.g., Boutin and Lane 2014; Charmantier and Gienapp 2014).

Comparison of approaches
Each of the above approaches has been used to infer genetic or plastic responses to climate change, but each also has a series of limitations. An important question thus becomes: 'Which approaches allow the strongest inferences regarding genetic or plastic effects?' These inferences can come in two basic forms: evidence for a particular effect (genetic change or plasticity) or evidence against a particular effect. The distinction between these forms of inference is important because evidence for or against a particular effect does not necessarily provide evidence for or against the other effect. In particular, evidence that genetic change has taken place does not by itself indicate that plasticity has been unimportant, just as evidence for plasticity does not by itself indicate that genetic change has been unimportant.
When it comes to genetically based phenotypic change, strong inferences are possible through the use of animal modelsalthough interpretational problems can occur (Postma 2006;Hadfield et al. 2010;Charmantier and Gienapp 2014). However, given that such analyses are possible for only a tiny fraction of the situations where inferences are desired, other approaches are necessary. Common-garden studies also allow strong inferences, although they are possible in only some organisms in allochronic contexts. Specifically, the demonstration of phenotypic differences in a common-garden environment confirms that genetic change has taken place, although genotype-by-environment interactions can complicate extrapolations to nature. Such interactions also mean that the failure to demonstrate differences in a common-garden environment is not strictly speaking bullet-proof evidence against genetic change. Molecular approaches can provide direct evidence that genetic change has occurred but, again, failure to find such differences does not mean that they are absent (they could be present at some nonsurveyed genomic regions) and links to adaptive phenotypes are often uncertain. The other approaches (comparison to model predictions, experimental evolution, space-for-time replacement) provide supporting evidence that amounts to 'proof-of-principle' but they cannot confirm that observed changes in nature have been genetic.
When it comes to plasticity, the problem is differentone cannot directly demonstrate that past changes were plastic. Stated another way, demonstrating that plasticity could achieve an observed phenotypic trend does not mean that plasticity actually did cause that trend. So the best approach here is to show that plasticity could explain the observed trend while also showing that genetic change can-not. Animal models provide a solid route to this inference because they can disprove genetic change in a phenotype, while also documenting sufficiently explanatory individual plasticity (R eale et al. 2003;Charmantier et al. 2008). Again, however, this approach will be accessible to only a small number of researchers working on a limited set of organisms. Reasonably strong inferences can also come from common-garden experiments showing that genetic change cannot explain a given phenotypic trend (with the above caveats), in combination with other approaches (experimental studies, fine-grained population responses, or individual plasticity) showing that plasticity could explain them. By themselves, however, these latter approaches fall into the 'proof-of-principle' category.
As the foregoing assessment makes clear, no single approach will be a panacea. The best route to strong and robust inference is therefore to use a combination of methods.

Is the change adaptive?
Establishing that an observed phenotypic shift has a genetic basis is a necessary condition for inferring adaptive evolution in response to climate change, but it is not entirely sufficient because genetic changes might not be adaptive. In particular, genetic drift (Lande 1976), gene flow (Garant et al. 2007), and inbreeding (Keller and Waller 2002) can all cause maladaptive genetic changes. Similarly, establishing that an observed phenotypic shift has a plastic basis does not confirm that the change is adaptiveit might instead be the result of environmental stress. Five basic approaches might be considered for inferring the adaptive nature of phenotypic changes: reciprocal transplants, phenotypic selection estimates, genotypic selection estimates, comparison to neutral expectations, and Q ST -F ST comparisons.

Reciprocal transplants
The most direct test for the adaptive significance of a phenotypic change is to reciprocally transplant individuals between environments (Endler 1986;Kawecki and Ebert 2004;Hereford 2009;Blanquart et al. 2013). This approach is obviously difficult in an allochronic context, but it can be implemented in some situations. In particular, adaptive evolution is confirmed if contemporary genotypes have higher fitness in the contemporary environment than do resurrected genotypes from past environments (e.g., Decaestecker et al. 2007). Given that resurrection is feasible for only a small subset of organisms, an alternative approach is to use laboratory or mesocosm experiments to test contemporary genotypes in simulated present and past environments. Higher fitness in the former implies that adaptation to present environments has reduced fitness (owing to trade-offs) in past environments. On the other hand, failure to find higher fitness in present than in past environments does not mean that adaptive evolution is absent because fitness trade-offs are not always evident between environments (Bennett and Lenski 2007;Blanquart et al. 2013). Note that this approach can be used (with the usual caveats) to assess the adaptive significance of genetic differences (by using individuals from common-garden experiments), plastic differences (by using individuals from experimental manipulations), and their combination (by using individuals captured from the wild).

Phenotypic selection estimates
If a given phenotypic change is adaptive, it was likely driven by selectionand so one inferential approach is to measure selection in nature. Specifically, adaptation can be inferred when selection acting during a phenotypic change would be predicted to favor that change (e.g., Swain et al. 2007;Crozier et al. 2011). Note, however, that measuring contemporary selection is not sufficient for inferring past selectionbecause selection can vary through time (Siepielski et al. 2009;Siepielski et al. 2011). Implementing this approach therefore requires long-term datasets of selection. An additional limitation is that the direction of phenotypic change could correspond to the estimated direction of selection without the two being causally linked (Meril€ a et al. 2001)although the estimation of selection on breeding values can reduce this concern (Hadfield et al. 2010; see above). Also, as noted earlier, accurate estimates of selection are extremely difficult to obtain in most natural populations.

Genotypic selection estimates
Analogous to the phenotypic selection approach, investigators can ask whether observed shifts in allele frequency correspond to selection on those alleles. This selection can be estimated either directly or indirectly. Directly, one can genotype individuals and then measure their fitness (e.g., Fournier-Level et al. 2011) or, if individuals cannot be tracked in this manner, allele frequency differences can be measured between age groups. These approaches are attended by many of the same issues that were discussed immediately above for phenotypic selection. Indirectly, one can test for genomic signatures of past selection (e.g., Storz 2005;Foll and Gaggiotti 2008). Limitations here are that it can be difficult to ascertain the relative contributions of selection as opposed to other factors (e.g., population structure and recombination), and when the past selection took place (Barrett and Hoekstra 2011;Nachman and Payseur 2012). An additional limitation of these genotypic approaches is that selection at the genotypic level does not necessarily reflect selection at the phenotypic level. For example, selection on polygenic traits can be very difficult to detect at the individual loci that influence those traits (McKay and Latta 2002; Le Corre and Kremer 2012).

Comparison to model predictions
A long tradition in evolutionary biologyparticularly in paleontologyhas been the use of null models of evolutionary change (e.g., Brownian motion or genetic drift) to infer the role (or not) of natural selection (review: Sheets and Mitchell 2001). Specifically, natural selection is typically inferred when the rate or directionality of change exceeds the confidence bounds of a null model. A limitation of this approach is its low inferential power even when strong external evidence supports the role of selection (Bell et al. 2006). More recently, increased inferential power has come from formally comparing alternative models with and without natural selection (e.g., Clegg et al. 2008;Hunt et al. 2008). This null model approach is not very common in the context of contemporary climate change, although Crozier et al. (2011) might be considered in this class.

Q ST -F ST comparisons
This approach is also rooted in null model thinking. Specifically, theory predicts that selection will cause patterns of quantitative trait variation to differ from patterns of neutral marker variation (Lande 1992;McKay and Latta 2002;Leinonen et al. 2008;Whitlock 2008). In particular, divergent selection is often inferred when the among-population proportion of genetically based trait variance (Q ST ) exceeds the among-population proportion of neutral marker divergence (F ST ). This approach has a number of limitations. Of particular concern, the accurate estimation of Q ST requires a common-garden study (e.g., Leinonen et al. 2008), which is difficult to implement in an allochronic context. Also in play is the previously noted problem that genotype-by-environment interactions can make inferences misleading with respect to natural populations. A partial solution to this problem can come from using natural populations to estimate the phenotypic Q ST analog (P ST : Leinonen et al. 2006), although interpretation remains difficult (Brommer 2011). Many additional concerns attend Q ST -F ST comparisons (Hendry 2002;Leinonen et al. 2008;Edelaar et al. 2011), although some of these can be reduced through careful planning or by adopting improved approaches (e.g., Ovaskainen et al. 2011;Karhunen et al. 2013). To date, however, Q ST -F ST comparisons have not been much used in the context of contemporary climate change.

Comparison of approaches
Reciprocal transplants are the most direct approach for inferring adaptive change and should be employed whenever possible. When this approach cannot be applied, the next resort should be methods described in the following sectionbecause these methods can reveal not only that adaptation has likely occurred (the present question) but also the specific environmental driver.
What is the specific selective force?
Even if the overall adaptive significance of a phenotypic trend has support from the above approaches, the specific environmental driver remains to be established (Wade and Kalisz 1990;MacColl 2011). This remaining question is of particular concern because climate change is often correlated with other environmental changes that might influence phenotypic change, such as changing rates of exploitation by humans, increasing or decreasing pollution, habitat loss and degradation, and increasing impact from invasive species. Inferring that climate change, or a specific aspect of climate change (e.g., temperature, pH), is the causal force thus requires additional effort. Three basic routes to such inferences are possible: common sense, phenotype-environment correlations, and experimental selection/evolution.

Common sense
The easiest, and certainly still the most common, approach for inferring a specific environmental driver is through existing knowledge and intuition. For example, if (1) organisms reproduce when particular resources become available in the spring, (2) climate change is causing those resources to become available earlier, and (3) the organisms are reproducing earlier, we might feel safe in assuming that climate change is the cause of the observed phenotypic change. Although intuition often might be correct, it is not inevitably or always simplistically so. For example, climate warming is generally assumed to advance the spring phenology of temperate organisms, whereas it actually retards the emergence of alpine ground squirrels (Urocitellus columbianus; Lane et al. 2012). At the end of the day, any adaptive 'story telling' (sensu Gould and Lewontin 1979) should be accompanied by formal hypothesis testing using the following methods.

Phenotype-environment correlations
Another common inferential approach is to quantify the association between trait change and specific aspects of climate change (e.g., Quinn and Adams 1996;Crozier et al. 2011). If a tight correlation is found, especially across multiple independent locations/populations, support for that environmental driver is enhanced. Ideally, multiple potential drivers would be simultaneously assessed so as to determine their independent and interactive effects. A difficulty in implementing this approach is that evolution, and sometimes plasticity, cannot perfectly track environmental change (Quinn and Adams 1996;Both and Visser 2001), and so the failure to find a tight trait-environment association does not necessarily mean that the environmental factor is unimportant. This problem can be partially circumvented by also adding a complementary space-for-time replacement (e.g., Phillimore et al. 2010). If, for instance, temperature is suspected to be the primary factor driving temporal changes in phenotype, similar associations might be expected across spatial temperature gradients. Of course, we return here to the above-mentioned concern that temporal and spatial drivers might not be the same (see also Fukami and Wardle 2010).

Experimental selection/evolution
Experimental studies are often deemed necessary to conclusively isolate the role of a particular environmental factor in shaping a particular ecological or evolutionary response (Bender et al. 1984). The best manifestation in the context of climate change would be to experimentally manipulate a specific candidate driver in nature, such as through artificial warming of the environment experience by a natural population. With appropriate replication and controls, and following the confirmation of genetic or plastic change and its adaptive significance (as above), exceptionally strong evidence thus would be provided for causal effects. This approach is obviously difficult to apply in nature, and so most such studies instead use controlled laboratory or mesocosm settings (e.g., Van Doorslaer et al. 2007). Although such studies are extremely informative, their relevance to nature is sometimes uncertain. And, of course, experimental manipulations cannot conclusively confirm the specific driver of phenotypic/genetic changes in other (nonexperimental) populationsalthough ambiguity is lessened if responses are consistent across replicates/contexts and are very similar to those in natural populations.

Comparison of approaches
As opposed to the previous questions, it is not particularly helpful to here rank the different approaches for the quality of inference they can providebecause each has major inferential limitations. Experimental studies are obviously the best way to demonstrate causation but these are difficult to implement in natural populations and can only conclusively demonstrate causation within the experiment itself. The other approaches are more relevant to natural populations, but are correlational. Thus, the most convincing cases are built based on combined evidence from observational and experimental studies.
Where are we now and what comes next?
Our goal in preparing the present paper was to provide a thorough and honest appraisal of the different approaches for inferring genetic and plastic responses to climate change. Having done so, we asked experts to use our appraisal as a basis for evaluating the evidence for such changes in specific taxa and contexts. In the rest of this special issue, these assessments are provided for bird breeding and migration timing (Charmantier and Gienapp 2014), mammals (Boutin and Lane 2014), fish (Crozier and Hutchings 2014), amphibians and reptiles (Urban et al. 2014), terrestrial plants (Franks et al. 2014), marine plants and animals (Reusch 2014), terrestrial arthropods (Schilthuizen and Kellermann 2014), and freshwater arthropods (Stoks et al. 2014). We also asked for a similar assessment for Bergmann's rule in a climate change context (Teplitsky and Millien 2014) and for a state-of-the art summary of theoretical approaches (Kopp and Matuszewskib 2014). Although these targeted papers should be consulted for details, we now provide a brief summary of their findings.
Even a cursory reading of the special issue reinforces the view that our understanding of genetic and plastic responses to climate change is still in its infancy. In particular, although a huge number of studies have found evidence of climate-associated phenotypic trends (e.g., Parmesan Table 2. A synopsis of studies testing for climate-driven genetic changes in an allochronic (temporal) context in nature. Most of these studies are drawn from the taxonomic reviews in the present special issueto which citations are provided. Owing to our specific focus (genetic change, allochronic studies, and natural populations), we here exclude space-for-time substitutions and experimental evolution in the laboratory. Also for this reason, no studies of amphibians or reptiles (Urban et al. 2014), marine phytoplankton (Collins et al. 2014), or other marine organisms (Reusch 2014) appear in the table. Also indicated is whether the adaptive nature of the change has been confirmed and whether climate change has been established as a causal factor. The numbers in parentheses refer to approaches listed in Table 1 Focusing specifically on studies using strong methods of inference for the first of these topics provides a soberingly short list (Table 2). That is, instances of confirmed genetic change do occur in birds and mammals, and especially in terrestrial plants and insects, but are quite few. By contrast, considerably more studies have found evidence for plastic contributionsas summarized by the various contributions to this special issue. These findings are highlighted by the results of Teplitsky and Millien (2014) who focused on temporal body size clines in vertebrates: None of studies listed in their Table 2 provide evidence for genetic basis of observed body size changes.
Overall, it seems safe to conclude that plasticity often makes a strong contribution to phenotypic trends associated with contemporary climate change. Genetic contributions, however, seem to be weaker and less commonalthough the reality is that only a few definitive tests have been performed. It may be that application of better inferential methods will uncover many more examples of genetic change (many such examples are certainly known in spatial contexts), or it may be that not enough time has passed for substantial evolution to take place. Although this constraint might well be important, plenty of examples certainly do exist of genetically based adaptation to local temperature differences on similar time frames, including in taxa for which few examples appear in Table 2. For instance, the contemporary evolution of temperaturedependent development has been shown for salmonid fish populations introduced to new thermal environments (e.g., Haugen and Vøllestad 2000) and for amphibian populations subject to pond warming as a result of beaver activity (Skelly and Freidenburg 2000). Evolutionary responses to climate change therefore demands considerably more study.
To sum up, this perspective and the accompanying eleven articles have focused on methods and quality of evidence for genetic and phenotypic changes to climate change in nature. While the current picture emerging from all of this work might not seem particularly encouraging, it should provide guidelines, avenues, and inspiration for research to come. Identification of the challenges and knowledge gaps can be viewed as first step toward progress in improving our understanding of the relative roles of genetic change and plasticity in mediating adaptive organismal responses to changing climatic conditions.

Acknowledgements
This paper represents an equal effort by both authors that originally stemmed from their disagreements and argu-ments about the evidentiary methods for inferring evolutionary and plastic responses to climate change. We thank all the people who contributed to this special issueboth for their contributions and for their comments on earlier version of this manuscript. We also specifically thank Anne Charmantier and Steven Franks for help developing the conventions in Table 2. During the preparation of this manuscript, we were supported by the Academy of Finland (grant numbers: 250435, 263722 and 265211) and the Natural Sciences and Engineering Research Council of Canada.