We explore two principal areas of uncertainty associated with paleoclimate reconstructions from speleothem δ18O (δ18Ospel): potential non-stationarity in relationships between local climate and larger-scale atmospheric circulation, and routing of water through the karst aquifer. Using a δ18Ospel record from Turkey, the CSIRO Mk3L climate system model and the KarstFOR karst hydrology model, we confirm the stationarity of relationships between cool season precipitation and regional circulation dynamics associated with the North Sea-Caspian pattern since 1 ka. Stalagmite δ18O is predicted for the last 500 years, using precipitation and temperature output from the CSIRO Mk3L model and synthetic δ18O of precipitation as inputs for the KarstFOR model. Interannual variability in the δ18Ospel record is captured by KarstFOR, but we cannot reproduce the isotopically lighter conditions of the sixteenth to seventeenth centuries. We argue that forward models of paleoclimate proxies (such as KarstFOR) embedded within isotope-enabled general circulation models are now required.
 Speleothem δ18O (δ18Ospel) is an excellent proxy for terrestrial paleoclimate reconstruction. The δ18O of cave drip water (δ18Odw) from which a speleothem forms records the isotopic composition of the proportion of water that infiltrates into the deep soil layers and karst aquifer. Statistical relationships between δ18Ospel and specifically the amount of observed precipitation have been used to generate quantitative hydroclimate reconstructions in regions where there is a reliable isotopically distinct seasonal recharge of ground water [Baker et al., 2007; Cai et al., 2010; Jex et al., 2010; Mattey et al., 2008; Yadava et al., 2004]. Such reconstructions depend upon two key assumptions. First, the application of statistically derived relationships beyond calibration-validation time periods assumes their stationarity. Second, any climatic interpretations also assume that the dominant patterns of atmospheric circulation and the seasonal distribution of precipitation have remained stable in nature throughout the period of study [Sturm et al., 2010]. The first assumption may be addressed by forward modeling a δ18O pseudoproxy time series [Sturm et al., 2010], whilst the second assumption can be tested by examining the relationships between local climate parameters and atmospheric circulations during and beyond the instrumental period using a climate system model.
 Published models to predict δ18Odw include the oxygen isotope drip water and stalagmite model (ODSM) [Wackerbarth et al., 2010] and the KarstFOR karst hydrology model [Baker et al., 2012]. Both have been used to predict δ18Odw and δ18Ospel during the last century [Baker and Bradley, 2010; Baker et al., 2012; Lohmann et al., 2013], and beyond the instrumental period [Baker et al., 2012; Wackerbarth et al., 2012; Baker et al., manuscript in revision]. They consider infiltration of precipitation due to evapotranspiration and evaporative fractionation in soil water and predict δ18Odw. KarstFOR is a lumped parameter karst hydrology model, which, unlike ODSM, allows for seepage and preferential routing of water through five water stores. Input series are temperature (T) and precipitation (P) and δ18O of precipitation (δ18Opptn). The model generates time series for δ18Odw and δ18O of speleothem calcite (δ18Ocmod; following Kim and O'Neil ) that may be attributed to changes in the seasonality of recharge arising from a modified potential evapotranspiration (PET) regime imposed by changing T and P. This avoids the assumption of stationarity inherent in statistically derived relationships. Similarly, storage times and evaporative fractionation of δ18Opptn predicted in the soil zone may be modeled, as well as varying residence times, storage capacity, and the subsequent routing and mixing of old and young groundwater in the karst aquifer. So far, studies using the KarstFOR model have confirmed the importance of considering water balance in PET sensitive regions and identified non-stationarity in δ18Ospel due to changing water balance.
 Here we present the results of a multi-model proxy comparison using KarstFOR in conjunction with a climate system model, to demonstrate how interpretations based on δ18Ospel records may be refined and key uncertainties addressed. We use a published speleothem record used to reconstruct the amount of cool season precipitation of the last 500 years using linear regression methods [Jex et al., 2011] and the CSIRO Mk3L climate system model, a reduced-resolution coupled general circulation model, designed primarily for millennial-scale climate simulation and paleoclimate research [Phipps et al., 2011, 2012]. The speleothem sample (2p) is from Akçakale cave in NE Turkey (40°26′N; 39°32′E; 1530 m above sea level (asl)). The local climate, cave, and speleothem 2p are described by Jex et al. [2010, 2011]. Briefly, in this cave, spring snow melt initiates infiltration into the karst aquifer, whilst in the warm season (May to September), infiltration is limited by a consistent seasonal soil moisture deficit [Jex et al., 2010]. The relationship between the δ18Ospel and the amount of cool season precipitation (i.e., falling in October, November, December, and January) throughout the instrumental period was established and subsequently the amount of cool season precipitation was reconstructed for the previous 500 years. A period of anomalously lower reconstructed rainfall at 1540–1560 A.D. was identified followed by a switch to lighter δ18Ospel inferred as a period of increased rainfall in the late sixteenth century. The amount of cool season precipitation recorded at the local meteorological station was shown to correlate well with pressure fields in Western Russia (Figure 1a), which formed the basis of the paleoclimate interpretations.
 In this study, we address the two principal assumptions and sources of uncertainty in such paleoclimate reconstructions as follows:
 That the dominant patterns of atmospheric circulation and the seasonal distribution of P have remained stable in nature throughout the period of study. We assess the stability of previously observed relationships between cool season precipitation and regional circulation dynamics at this speleothem site over the last 1 ka using the CSIRO Mk3L climate system model (section 2) [Phipps et al., 2011, 2012].
 That the statistical relationship between δ18Ospel and the amount of cool season P have remained stable throughout the period of reconstruction. To avoid making this assumption, we use the T and P output of a three-member ensemble of transient climate model simulations and synthetic δ18Opptn to drive the KarstFOR karst hydrology model. We investigate the variability in δ18Ocmod that is explained by water storage and routing in the karst aquifer at this site over the last 1 ka (section 3) [Baker et al., 2012].
2 Climatic Drivers of the Interannual Variability of Cool Season Precipitation
2.1 Correlations of Observed Climate to Large-Scale Atmospheric Circulation
 In the Near East, the influence of the North Atlantic Oscillation (NAO) on interannual precipitation and temperature trends is reported to be insignificant [Kutiel et al., 2002; Türkeş and Erlat, 2003, 2005], and to our knowledge, no studies have found a significant correlation between precipitation in this region in any season and pressure centers that describe the NAO. Rather, the seasonal position of the Siberian high over Eastern Europe directly impacts upon temperature and precipitation patterns as well as dipole pressure centers over the North Sea and northern Caspian Sea. These demonstrate a robust correlation with the amount of cool season precipitation [Jex et al., 2011] and temperature [Kutiel, 2011; Kutiel and Benaroch, 2002; Tatli, 2007] in the Near East region (Figure 1a).
2.2 Correlations of Simulated Climate to Large-Scale Atmospheric Circulation
 Details of the CSIRO Mk3L model and the simulations used in this study are provided in supplementary information. The model reproduces the observed climatology at the study site (Figure S1) and the relationship between cool season precipitation and the ONDJ 500 mb geopotential height (Figure 1b). This relationship is robust using both raw and smoothed data (Figure S2). Using raw data, a peak correlation of −0.65 occurs at 39°E, 43°N, one grid point to the north of the study site on the model grid. Figure 2 plots the strength of this relationship over the full duration of the 10,000 year control simulation. Generally the correlation is strong (up to −0.7) but there are periods of ~30 years when this correlation weakens substantially and becomes statistically insignificant at the 95% significance level, with correlation coefficients weaker than −0.35. Thus, on periods longer than 30 years, the model suggests that the assumption of stationarity that underlies interpretations of δ18Ospel at this study site is robust.
3 Non-Climatic Drivers of δ18Odw and δ18Ospel
3.1 KarstFOR Model Description and Setup
 We use the modified version of the Bradley et al.  KarstFOR model, exactly as described by Baker et al. (manuscript in revision), to model stalagmite δ18Ο. A full description of the model, input and output data, and a model schematic is provided as auxiliary material. Briefly, the model envisages five water stores: (1) Soil, (2) Epikarst, (3) Karst Store 1, (4) Karst Store 2, and (5) an Overflow Store. Individual drip-water δ18Ο series are produced by assuming that drip-waters are (1) solely derived from a particular water store (e.g., Stal_4), (2) the product of mixing of waters draining from a selection of water stores (e.g., Stal_1, _5, and _6), and (3) a combination of store drainage and recent precipitation δ18Ο (arising through preferential flow through the soil and limestone) (e.g., Stal_2 and Stal_3). Stalagmite δ18Ο series (Stal_1 to Stal_6 in Figure S3) are derived from each drip-water δ18Ο series after allowing for calcite fractionation. As input data, we generated 1000 year synthetic climate series of monthly T, P, PET, and δ18Opttn, representative of the local climate (see Text S2 in the auxiliary material). The karst model was set up with a −11.75‰ initial store value (obtained from stalagmite fluid inclusions at 6 ka) [Rowe et al., 2012] and run for 1 ka, the last 500 years of which overlaps with the duration of growth of speleothem 2p.
3.2 Effect of Flow Routes and Storage Times in the Karst Aquifer
 A number of scenarios were completed for each input series, in which the storage capacity of each store was varied as well as their drainage functions (as summarized in Table S1). All KarstFOR output was smoothed by a 1 year running mean over the instrumental period (1938–2000 A.D.) and by a 6 year running mean for the entire record to reflect δ18Ospel sampling resolution.
 We first present two contrasting examples of the model input series for the instrumental period from the GCM data, their fluxes (F1 to F7) and storage (soil, epikarst, and karst stores), and all of their predicted δ18Ocmod series (Figure 3). Low P input and seasonally high evapotranspiration (ET) limit water stored in the soil (<30 mm month−1 stored) and the subsequent water flux into the epikarst via F1 (<8 mm month−1). The epikarst retains some seasonality in its total amount of storage, as does water flux F3, supplying karst store 1. The epikarst store maintains a supply of water to karst store 1 only, and to drip sites stal_2, _3, _4, and _5 throughout the period of speleothem deposition. In all but one of the scenarios, karst store 2 drains completely and yields no δ18Ocmod output series at stal_1. Stal_6 remains dry throughout in all model runs and drainage scenarios (not shown). Varying the drainage functions and store sizes only impacts on δ18Ocmod at stal_5 and stal_1 (the latter as summarized above). δ18Ocmod at stal_5 is more variable with increasing drainage speed and is essentially a flat line under the slowest draining scenario (not shown). In all drainage and store size scenarios, predicted δ18Ocmod for any one of the sites at stal_2, _3, and _4 is consistent. At these drip sites, only the initial input series and routing through the karst drives variability in their predicted δ18Ocmod series.
3.3 Observed δ18Ospel and Predicted δ18Ocmod Over the Instrumental Period
 Figure 3 shows δ18Ocmod plotted alongside the δ18Ospel for the period 1930 to 2000 A.D. (data for all model runs are plotted in Figure S5). Once smoothed by a 6 year running mean, δ18Ocmod for stal_2, _3, _4, and _5 in both of these model runs captures the low frequency (decadal scale) trends of the observed speleothem record during the twentieth century. However, in the 1 year smoothed data, only stal_2 and _3 (with a preferential flow component) capture the intra-annual variability as observed in the most recent portion of stalagmite 2p. Water from the epikarst store alone is too well mixed to predict observed speleothem variability on an intra-annual scale. Similarly, water from karst stores 1 (stal_5) and 2 (stal_1) (in the slowest draining scenario) is too well mixed to predict observed speleothem variability on an interannual scale.
 Plotting mean δ18O against a measure of variability (coefficient of variation, CV) demonstrates that once smoothed by a 6 year running mean, predicted series for stal_2 in particular have mean values and CVs most similar to the speleothem record for the instrumental period (Figure 4). This suggests that the intra-annual variability evident in the speleothem sampled at a sub-annual scale is likely to be a genuine reflection of routing through the karst aquifer in a manner most similar to that predicted at stal_2 and that integrated sampling effectively eliminates this intra-annual variability in the speleothem and the modeled drip water data of stal_2.
3.4 Observed δ18Ospel Versus Predicted δ18Ocmod Over the Last 500 Years
 The full range of δ18Ocmod series from all model inputs is shown in Figure 5 for the entire 500 years of speleothem deposition (1500–2000 A.D.). The CV values of the model simulations for the entire record, smoothed by 6 years, agree well with that observed in the speleothem, sampled at this resolution. Mean values are, however, offset by as much as 0.5‰ (Figure 4). Whilst the model simulations predict reasonable δ18Ospel values at the earliest and most recent parts of the speleothem record, no model run captures the transition from particularly heavy to light δ18Ospel during the sixteenth and seventeenth centuries, based on any of the input series used. In such an evapotranspiration (ET) sensitive system, any changes to the amount or seasonality of ET would be expected to impact stal_2 and _3 with a preferential flow component. Based on the GCM data for P and T, no such change in ET is present. Possible explanations are
 The speleothem is recording a localized climate signal that is not represented by the adjusted GCM input data or δ18Opptn.
 The GCM does not capture the true variability in cool season precipitation or temperature-related changes in ET at the study site, prior to the instrumental period.
 A process not modeled in KarstFOR with these climate inputs is responsible for the shift in isotopic composition of this particular speleothem: for example, recharge by isotopically lighter groundwater during the late sixteenth and early seventeenth centuries that is not accounted for by the model input series (either the δ18Opptn or the estimated ET based on GCM data).
 This paper tests the validity of a previously published speleothem-based climate reconstruction to provide a guide for how best to apply both climate system and proxy forward models to refine paleoclimate interpretations. The stability of observed relationships between the total amount of local cool season precipitation and regional circulation is assessed using the CSIRO Mk3L climate system model. Reductions in the amount of cool season precipitation and temperature are confirmed as resulting from increased pressure centered over the West Russia/Caspian Sea region, describing a scenario of negative NCP. This relationship is largely stable on millennial timescales, according to a 10,000 year preindustrial control simulation, giving credence to previous climatic interpretations of δ18Ospel. However, during this time, this relationship is observed to break down periodically on time scales less than 30 years.
 The output of multiple simulations of the CSIRO Mk3L climate system model for this location over the last 1 ka, combined with synthetic δ18Opptn, was then used as input data for the KarstFOR hydrology model. We predict the range of possible δ18Ocmod that may occur given these inputs for a range of drainage scenarios and flow routes in the karst aquifer. The model generates δ18Ocmod that matches δ18Ospel over the instrumental period (1930–2000 A.D.). The range of observed interannual variability can be accounted for by the karst model given the GCM input data. However, the lightest values of δ18Ospel during the late sixteenth and early seventeenth centuries and their subsequent transition to modern values cannot be accounted for by the GCM input to the karst model. This suggests that hydrological routing and storage of water cannot explain these particular trends. It is likely that isotopically lighter water recharged the aquifer at this time, which is not accounted for by the model input series of δ18Opptn or PET. If the former is true, then this does not necessarily mean that there was a substantial change to the amount of cool season precipitation, as originally inferred in the speleothem reconstruction of Jex et al. . One solution to the mismatch between proxy data and the karst model would be to use an isotope enabled regional climate model with the KarstFOR model embedded, as an alternative approach for generating a local δ18Opptn input series.
 We have identified and explored some of the sources of uncertainty that underlie the interpretation of speleothem δ18O records. Using the published δ18Ospel record from Turkey and the CSIRO Mk3L and KarstFOR models, we have been able to predict some key features of the observed speleothem record. This supports some of the original interpretations of the capability of this speleothem to reconstruct cool season precipitation and its relevance to understanding regional atmospheric circulation. Finally, we argue that for unambiguous interpretation of δ18Ospel such an approach is necessary and that future methodological improvements include the use of process models such as KarstFOR, embedded within isotope-enabled general or regional circulation models.
 This work was funded by the University of New South Wales, Australia, and undertaken with the assistance of resources provided at the Australian National University through the National Computational Merit Allocation Scheme. The authors thank the reviewers for their thoughtful reviews of the manuscript.
 The Editor thanks Matthew Jones and an anonymous reviewer for their assistance in evaluating this paper.