Geophysical mapping and coring of the central Arctic Ocean seafloor provide evidence for repeated occurrences of ice sheet/ice shelf complexes during previous glacial periods. Several ridges and bathymetric highs shallower than present water depths of ∼1000m show signs of erosion from deep-drafting (armadas of) icebergs, which originated from thick outlet glaciers and ice shelves. Mapped glacigenic landforms and dates of cored sediments suggest that the largest ice shelf complex was confined to the Amerasian sector of the Arctic Ocean during Marine Isotope Stage (MIS) 6. However, the spatial extent of ice shelves can not be well reconstructed from occasional groundings on bathymetric highs. Therefore, we apply a statistical approach to provide independent support for an extensive MIS 6 ice shelf complex, which previously was inferred only from interpretation of geophysical and geological data. Specifically, we assess whether this ice shelf complex comprises a likely source of the deep-draft icebergs responsible for the mapped scour marks. The statistical modeling is based on exploiting relations between contemporary Antarctic ice shelves and their local physical environments and the assumption that Arctic Ocean MIS6 ice shelves scale similarly. Analyzing ice thickness data along the calving front of contemporary ice shelves, a peak over threshold method is applied to determine sources of deep-drafting icebergs in the Arctic Ocean MIS6 ice shelf complex. This approach is novel to modeling Arctic paleoglacial configurations. Predicted extreme calving front drafts match observed deep-draft iceberg scours if the ice shelf complex is sufficiently large.
If you can't find a tool you're looking for, please click the link at the top of the page to "Go to old article view". Alternatively, view our Knowledge Base articles for additional help. Your feedback is important to us, so please let us know if you have comments or ideas for improvement.
 Hypotheses on glacial conditions in the Arctic Ocean range from the suggestion of total absence of sea ice [Ewing and Donn, 1956] to existence of a coherent, thick ice shelf covering the entire Arctic Ocean [Mercer, 1970; Broecker, 1975; Hughes et al., 1977]. These hypotheses were put forward long before icebreakers and nuclear submarines were able to reach the pack-ice covered central Arctic Ocean and map the seafloor. Over the last decade, geophysical mapping of the Arctic seafloor has revealed extensive erosion caused by ice and glacial landforms on ridge crests and plateaus where present water depths are shallower than ∼1000m [Jakobsson, 1999; Polyak et al., 2001].
 Yet not all bathymetric highs shallower than 1000m contain signs of ice impact. Instead, the accumulated geological and geophysical data suggest more limited ice shelves with the most extensive ones constrained to the Amerasian Basin during Marine Isotope Stage (MIS) 6 (Figure 1) [Jakobsson et al., 2010]. In this basin, smaller ice shelves likely also existed during later glacial periods, including the Last Glacial Maximum [England et al., 2009; Polyak and Jakobsson, 2011]. The extent of an ice shelf cannot be reconstructed by geophysical mapping alone, because ice shelves when floating leave no distinct marks behind on the seafloor, in contrast to grounded ice sheets and fast-flowing ice streams. Therefore, spatial reconstructions of the extents of these Amerasian Arctic Ocean ice shelves are necessarily based on indirect evidence (Figure 1) while spatial reconstructions of paleo ice sheets may be based on diagnostic landforms outlining their maximum extents [Kleman et al.2006].
 The widely-used numerical shallow ice approximation ice sheet models do not properly simulate ice streams and coupled ice shelves [van der Veen et al., 2007; Hindmarsh, 2009; Kirchner et al., 2011]. Ongoing modeling efforts focusing on advanced marginal ice dynamics, ice sheet-ice shelf coupling, and disintegration of ice shelves aim to provide improved prognostic centennial-scale simulations in time for the 5th Assessment Report of the IPCC (cf. the community projects Ice2sea and SeaRise, http://www.ice2sea.eu, http://websrv.cs.umt.edu/isis/index.php/SeaRISE-Assessment). Numerical simulations for ice shelf complexes during glacial cycles are rare. Early experiments with focus on the Arctic employed simplified representations of sheet-shelf dynamics [Siegert and Dowdeswell, 1999; Siegert et al., 2001], while more recent simulations account for complex dynamics but have not yet been applied exclusively to Arctic Ocean ice shelves [Pattyn, 2003; DeConto et al.2007; Peyaud et al., 2007; Pollard and DeConto, 2009; Alvarez-Solas et al., 2011; Fyke et al.2011].
 Here we propose a new modeling framework with the aim to amend the spatial reconstruction of the MIS 6 Arctic Ocean ice shelf complex presented by Jakobsson et al.  with an assessment of whether this proposed large Arctic Ocean ice shelf complex can also be supported on statistical grounds. Using appropriate statistical modeling, we specifically address the question of whether the suggested MIS 6 Amerasian ice shelf complex comprised a likely source for the deep-drafting icebergs that grounded in the central Arctic Ocean and of which there is mapped evidence.
 The statistical approach adopted here is based on establishing functional relations between key characteristics determining the configurations of current Antarctic ice shelves. Assuming that similar relations held for the Amerasian ice shelves during MIS6, we predict selected features of this ice shelf complex. Specifically, we derive quantitative estimates of ice shelf area, calving front length and the maximal draft (depth below sea level) along the calving front. We emphasize that we do not claim present-day Antarctic conditions to represent a perfect analog for former glacial conditions in the Arctic Ocean. Nonetheless, our modeling approach provides a range of possible Arctic Ocean paleo ice shelf configurations, based on considering present dimensions and conditions of Antarctic ice shelves as well as firmly based statistical techniques. By construction, our analysis is limited to modeling static ice shelf configurations at arbitrary model times only. Therefore, it is applicable to any other ice shelf system of the Quaternary glacial periods, of which the here investigated MIS6 Amerasian ice shelf complex in the Arctic Ocean is but one example. In particular, we do attempt neither to prove the ice shelf complex's existence nor to perform a realistic paleo simulation comparable to those performed with thermodynamically coupled, continuum-mechanics based coupled ice sheet models.
2 Statistical Modeling Framework: Data and Methodology
 We focus on the MIS6 ice shelf complex put forward as a hypothesis by Jakobsson et al.. Fed by ice streaming in major bathymetric troughs of the Canadian Arctic Archipelago, this complex is reconstructed as several merged ice shelves extending into the Arctic Ocean from the continental shelf between north of Alaska in the west and northern Greenland in the east. This reconstruction represents a generalized view of the maximum MIS6 marine ice sheet extension. It is based on sediment coring and geophysical mapping that revealed iceberg plowmarks and features resembling mega-scale glacial lineations on bathymetric highs (Morris Jesup Rise, Yermak Plateau, Lomonosov Ridge, and Chukchi Borderland) as deep as ∼1000m below present sea level (Figure 1).
 Using statistical modeling, we address the question whether the suggested MIS6 Amerasian ice shelf complex comprised a likely source for the deep-drafting icebergs that grounded in the central Arctic Ocean. This requires modeling of draft values along the calving front of the ice shelf complex. Because no direct evidence exists that outline its spatial extent, we also include modeling of its total area and calving front length. Moreover, since there are no analog ice shelves in the Arctic Ocean today, the statistical modeling is based on relations between characteristic variables for present Antarctic ice shelves and their local physical environments. Once these relations are established, the model is applied to the paleo-Arctic setting. The following three-step methodology is applied (and detailed in the following sections):
 Establish statistical relations between characteristic variables for contemporary Antarctic ice shelves through fitting of a multivariate linear model (MLM), cf. section 2.2.
 Model the largest drafts along the calving front of contemporary Antarctic ice shelves using extreme value theory (EVT), more specifically, apply a peak over threshold (POT) approach to model exceedances of draft above a high, preselected threshold, cf. section 2.3.
 Predict Arctic Ocean MIS6 calving front length, maximal draft along the calving front, and ice shelf area by application of the model derived in steps 1 and 2, cf. section 3.
 In step 1, Antarctic data is used to fit a MLM composed of two regressions with several predictors and correlated errors between both responses. Seventeen Antarctic ice shelves are included, each characterized by eight variables. The latter are classified as “predictors” (x) or “responses” (y), with corresponding subscripts (Tables 1, 2).
Table 1. Contemporary Antarctic Ice Shelves Used in the Statistical Analysis a
Contemporary Antarctic ice shelves used in the analysis, and eight data variables and their abbreviations, as well as the classification of the variables into responses (y), predictors (x), and draft (ydraft) are indicated as follows: Abbot∗(“open”), Amery (“embayed”), Brunt (open), Dotson∗(embayed), Drygalski (open), Ekström (embayed), Filchner (embayed), Fimbul (open), Getz∗(open), Mertz Glacier (open), Ninnis Glacier∗(embayed), Pine Island Glacier∗(embayed), Riiser–Larsen (open), Ronne (embayed), Ross (embayed), Shackleton (open), West (open). xgeom has binary values only (open/embayed) and is therefore listed in parentheses after each ice shelf name in the list of the 17 ice shelves considered. For the five starred ice shelves, the maximal draft values at their respective calving fronts are not considered in the EVT since they are considered to be not representative, see section 2.1 and Figure S6 of the supporting information. Water temperatures are derived from the WOCE database [Orsi and Whitworth, 2004], all other variables are based on Bohlander and Scambos'  MODIS data and DiMarzio et al.'s  ICESat data, the latter in combination with the algorithm of Zwally et al.  to compute ice shelf thickness. Details on the processing of the data are given in the supporting information.
The original data has been log-transformed (natural logarithm). For notational simplicity, we omit the log notation in the text as, from the context, it is clear whether we use the original or transformed scale.
 In step 2, draft along the calving front, ydraft, is extracted as a sequence of observations for each shelf from the Antarctic data set (Table 1). We are interested in extremely large or even the largest drafts and, hence, employ EVT to predict which draft value is exceeded on average at least once along the calving front. We refer to this value as the “return draft”; the concept is borrowed from models of hydrological events in which “return level” and “return period” are used to describe “100 year-floods,” i.e., the return level that has 1% chance of being exceeded in a given year [Coles, 2001; Katz et al., 2002]. In our case, the return period is associated with length of calving front instead of time. Thus, the m-kilometer return draft is expected to be exceeded on average once every m kilometers along the calving front. Note that, for five ice shelves out of the considered 17, the available ydraft data could not be used in the EVT, reducing thus the number of ice shelves considered in this step to 12, cf. also the supporting information, Appendix SA.
 In step 3, the statistical model derived in step 1 and 2 is applied to predict calving front length, ice shelf area, and a probabilistic description of the maximum draft along the calving front of the Arctic Ocean MIS6 ice shelf complex. This is expected to help identify likely source regions for icebergs large enough to scour the seafloor at ∼1000m below sea level. The results also indicate the range of possible ice shelf configurations, of which one is suggested in Jakobsson et al. . Paleo ice shelf variables corresponding to Antarctic predictors are required as input.
2.1 Ice Shelf Data
2.1.1 Contemporary Antarctic Data
 Contemporary Antarctic ice shelf data (ylen, yarea, ydraft, xground, xthick, and xrise, see Table 1) are derived from Bohlander and Scambos'  MODIS Mosaic of Antarctica, in combination with DiMarzio et al.'s  digital elevation model (DEM) obtained from the first seven observation campaigns of the Geoscience Laser Altimeter System instrument (2003–2005) aboard ICESat and Zwally et al.'s  algorithm to compute ice shelf thickness. Elevations in the DEM refer to the EGM96 geoid. Length of calving front, ylen and length of grounding line, xground are defined such that their sum equals the total length of the ice shelf boundary (Figure 2).
 Furthermore, we have introduced a binary variable, xgeom, to classify the shape of an ice shelf as either open or embayed. Specifically, we let the ratio c=ylen/xground determine ice shelves with c≥1 as open, and ice shelves with c<1 as embayed. This geometric criterion may coincide with the intuitive notion of an embayed ice shelf, but does not by necessity. For instance, Abbot ice shelf, Riiser-Larsen ice shelf and Shackleton ice shelf are classified as open, for reasons explained in the supporting information.
 Water temperatures xtemp are retrieved from the World Ocean Circulation Experiment Southern Ocean Data Base [Orsi and Whitworth, 2004], cf. Table 2. A more detailed description of the data is provided in the supporting information, Appendix SA.
 Note that the statistical modeling framework does not limit the number of ice shelves or variables considered but the types of variables. Specifically, we have neither included Antarctic Peninsula ice shelves in our analysis, nor have we accounted for any spatially variable data fields other than ydraft and xtemp, respectively; we provide justification for these restrictions in section 4.
2.1.2 Paleo-Arctic Data
 The paleo ice shelf predictors xground, xrise and xgeom are derived under the assumption that the Laurentide ice sheet reached the continental shelf break and from there extended as an ice shelf. Hence, the present location of the shelf break is taken as the grounding line position of the MIS6 ice shelf. Using the International Bathymetric Chart of the Arctic Ocean (IBCAO) gridded bathymetric model Version 2.0 [Jakobsson et al., 2008a], the shelf break/grounding line is digitized. This clearly implies a maximum scenario for the MIS6 ice sheet extension. The physiographic setting and locations of feeding ice streams, inferred from glacial troughs distinguished in the bathymetry, suggest a subdivision of the grounding line of the ice shelf complex into four segments: A–B, B–C, C–D, and D–E (Figure 1). The segment length is used as input data (xground) for the statistical modeling (Table 3). Note that a meaningful comparison of lengths of paleo and contemporary grounding lines has to compensate for possible differences in measuring resolution (cf. the supporting information, Appendix SB and FigureS2 for details). To prescribe the paleo-predictors xrise and xgeom from inspection of IBCAO, a sea level 92m below the present is assumed for MIS 6 [Rabineau et al., 2006]. The response of the ice shelf complex to different water temperatures, xtemp, along the calving front is investigated. Maximum ice thickness at grounding line, xthick, is not required during model application to the paleo-Arctic.
Table 3. Length of Individual Paleo-Grounding Line Segments in the Amerasian Basin of the Arctic Ocean, and Combinations Thereof a
Length of Grounding Line xground
Number of Ice Rises xrise
Ice Shelf Geometry xgeom
Four different uniform (that is, identical along all ice shelf fronts) water temperatures xtemp are considered (0, −1, −1.8 and −2° C). Segment A–B has not been classified individually as it is only considered in combination with other segments.
2.2 Multivariate Linear Model (MLM)
 Contemporary Antarctic ice shelf data forms the basis of our MLM. Using common statistical practice, we evaluate if responses or predictors need to be transformed to establish functional relations between Antarctic predictors and responses. The responses as well as some predictors have been log-transformed (natural logarithm; see Table 1), but for simplicity we omit the log notation. Note that we do not claim any causality between the predictors and responses as is usually done in classical regression analysis. The intuitive dependence between length of ice shelf calving front ylen and total ice shelf area yarea is confirmed (using Pearson's correlation coefficient as a criterion) and suggests a modeling approach that takes this dependence into account. Therefore, a MLM is applied as opposed to two individual models [Mardia et al., 1979].
 The multivariate linear model used is ylen=xTβlen+εlen and yarea=xTβarea+εarea, where x is a vector containing an intercept and the predictors, where βlen and βarea denote the coefficients, and where εlen and εarea are residual (jointly Gaussian) errors with mean zero, variances Var , Var and correlation Cor(εlen,εarea)=ρ.
 Model space comprises 961 configurations, and model fitting is done with a likelihood approach, i.e., maximizing a bivariate Gaussian density over βlen and βarea, , , and ρ, programmed with the software environment R [Ihaka and Gentleman, 1996; R Development Core Team, 2011]. Over-fitting is avoided through the use of a Bayesian Information Criterion (BIC). Figure 3 shows the 100 best models and the predictors chosen therein (cf. also TableS1, FigureS3 in the supporting information. We select the model with minimal BIC as the optimal one, which leads to the model
where a hat indicates an estimated quantity. Ice thickness at the grounding line, xthick, does not enter the optimal model. Further, the correlation in the errors (estimate of 0.33) is not significant (likelihood ratio test yields a p-value of 0.19). Table 4 lists the estimated coefficients of the chosen, i.e., optimal, model along with their standard error. The choice of the optimal model is not clear cut, but the parameter estimates of the next best models are surprisingly stable. There is no indication of outliers or strong leverage effects in the data. For a more detailed mathematical description of the MLM, as well as a documentation of its predictive skill (cf. the supporting information, Appendices SC1 and SC2, and FiguresS4 and S5).
Table 4. Parameter Estimates of the Regression Analysis Along With the Adjusted Coefficient of Determination (With Standard Errors in Parenthesis) For the Optimal Model a
Length of Calving Front, ylen
Total Ice Shelf Area, yarea
β0 is the intercept, βground the coefficient of xground (log scale), βrise the coefficient of xrise, etc. Hyphen indicates that the predictor is not in the optimal model.
2.3 Extreme Value Theory (EVT)
 In the following, we provide a description of how extreme drafts are statistically modeled. Sections 2.3.1 and 2.3.2 give the theoretical background and detailed statistical justification for, among others, the methodology, threshold selection, chosen densities, etc. In addition, the following paragraph offers also a more intuitive description, facilitated by Figure 4 with data from the Drygalski ice tongue, capturing the essentials of the methodology in a nutshell accessible to a general audience.
 Starting from the raw draft data, evident outliers are identified (Figure 4a) and cleaned data are retained. We are interested in modeling the large drafts: thus, fitting a distribution to all draft values will not adequately address the problem as clearly shown by the histogram in Figure 4b. To restrict the analysis to large drafts, we select a threshold determining these (Figure 4b). Due to the relatively smooth draft, we have to thin the correlated data by identifying clusters and retaining only the cluster maxima (Figure 4c). An appropriate density is then fitted to this subset of draft values (Figure 4d). The fit is validated by comparing theoretical and empirical quantiles (Figure 4e) and by possibly readjusting the threshold. From the parameter estimates, return drafts and their uncertainties can be calculated, as illustrated by Figure 4f.
2.3.1 Rationale and Mathematical Background
 When statistically modeling the maximum value of a series or the maximum values of several series, it is dangerous to fit distributions from first- and second-order quantities and then to extrapolate to (very) small or (very) large quantiles. Consider the following example. Consider independent, standardized Gaussian random variables . The probability that any Xi is larger than 4 is P(Xi>4)=3.167×10−5. However, the probability that out of the samples of sizes 10, 100, and 1000, the maximum that exceeds 4 is , , and and naturally strongly depends on the sample size.
 Further, extreme cases are often caused by different physical mechanisms or processes than those governing the bulk of the data, and one should, therefore, not draw inferences from a distribution fitted over the bulk of the data.
 EVT addresses this issue by modeling the largest observation(s) directly. Indeed, these extreme observations behave asymptotically differently than the central values (which are asymptotically—under suitable conditions—from a normal distribution): it can be shown that for a large class of distributions F, the distribution of the (normalized) maximum maxiXi, where are independent, identically distributed random variables according to F, converges to the so-called generalized extreme value distribution (GEVD) parameterized by a location μ, scale , and a shape parameter ξ [Fisher-Tippett Theorem; Coles, 2001, Thm 3.1.1; Embrechts et al., 1997, Theorem 3.2.3]. The GEVD can also be characterized by the cases ξ<0, ξ=0, and ξ>0.
 Note that the conditions on the distribution F that guarantee convergence to the GEVD are mathematically quite involved and are therefore not detailed here. For example, in the case of ξ>0, one needs to establish that 1−F(x)∼x−αL(x), α>0 for some slowly varying function L [Embrechts et al., 1997, Thm. 3.3.7]. However, virtually all of the classical distributions imply convergence, for example, Cauchy, Pareto, Loggamma (ξ>0); Uniform, Beta (ξ<0); Gamma, Normal, Lognormal (ξ=0).
 Conceptually, EVT comprises three different approaches which are all interlinked. The first approach models only the maximum, the second approach models data exceeding a threshold (peak over threshold, POT), and the third one uses a Poisson process model for POTs. All three approaches are interlinked, but, depending on the situation, some may be more suitable than others. For further details, we refer to the accessible text of Coles , as well as to the supporting information, Appendix SD.
2.3.2 Modeling Extreme Drafts Using EVT/POT
 Because iceberg plowmarks at the Arctic Ocean seafloor, mapped as deep as ∼1000m below present sea level, indisputably represent examples of extreme ice-draft events, EVT/POT is the best-suited statistical modeling framework. Modeling extreme drafts using the POT approach begins with extracting, from the ICESat data, ydraft as a series of observations along the calving fronts of the 17 contemporary Antarctic ice shelves considered in the MLM.
 In a nutshell, let D be a random variable with distribution F. To look at extreme events, we describe the probability of D exceeding a large threshold value u by an additional amount y>0. This conditional probability is
In practice, F is not known. However, under conditions not specified here [Coles, 2001], the distribution for D−u, given D>u, for large enough u, is
and μ, , ξ as in the GEVD (cf. section 2.3.1). The approximation is to be understood as a limiting argument as u increases. The right hand side of equation ((3)) is termed the generalized Pareto distribution (GPD), which, due to the presence of ξand σ(the shape and scale parameters, describing the distribution's tail and spread, respectively), represents a family of distributions. Note that the GPD does not explicitly depend on F.
 To quantify the occurrence of extreme events, we calculate quantiles of the exceedances (that is, values exceeding the threshold u), originally termed “return level” in the context of extreme floods. Mathematically, the calculation of these quantiles requires the derivation of P(D−u>y) (cf. the supporting information, see Appendix SD2 for more details).
 In our specific application, the observations described by the random variable D above are the drafts along the calving front, ydraft. However, the draft data is not independently distributed: nearby values are alike as they fluctuate slower than their resolution (cf. FigureS6). As equation ((3)) is valid for independently distributed variables only, we run a declustering algorithm over the draft data. Essentially, this means that exceedances (values of ydraft which are larger than some preassigned threshold u) are assigned to the same cluster if they are separated by fewer than a specified number (called “run length”) of values above the threshold u. Threshold selection is based on plots of parameter estimates against different thresholds, and we opted for a universal threshold of the 75th percentile of the draft using a run length of 2 for declustering (FiguresSA8 and SA9). Declustering reduces the draft data considerably (for example, to 17 values for Ekström and 71 for the Ross ice shelf (TableS2)). Further, within each cluster, only the largest value is retained and those largest values are referred to as cluster maxima. The cluster maxima are assumed to be independent, and their probability distribution is assumed to converge to a GPD.
 We emphasize that the model is based on (statistically) typical assumptions that are required for reasons of mathematical rigor (for further details on the POT approach and especially on the threshold selection, cf. the supporting information, Appendix SD). Under these assumptions, a GPD is inherently the limiting distribution for increasing length of calving front, and thus, we have eliminated shelves with short, interrupted draft sequences (namely, Abbot ice shelf, Dotson ice shelf, Getz ice shelf, Ninnis Glacier, Pine Island Glacier, cf. Table 1).
 Finally, using the observed cluster maxima, we estimate the parameters ξ and σ of the GPD in equation ((3)) using maximum likelihood (TableS2), cf. also Figure 4d, which illustrates the fit of the GPD to the cluster maxima based on data for Drygalski ice tongue. These estimates are (naturally) sensitive to quantile and run length, but summary statistics based on the estimates from all shelves are very stable. The fitted distributions are assessed with probability-probability and quantile-quantile plots and match the data well (FigureS10). For some shelves, empirical quantiles are slightly higher than theoretical ones, indicating a conservative estimation of the tail heaviness. In our case, a heavier tail implies predictions of larger extreme calving front values, also reflected by the different choices of parameter estimates in section 3. Note that the parameters of the GPD are not expressed in terms of predictors, and thus, per se, no prediction is performed.
3 Results From Model Application in the Paleo-Arctic Setting
 We consider a Paleo-Arctic Ocean scenario and apply the fitted models from Antarctic data described in sections 2.2 and 2.3 in a setting where individual ice shelves form along the grounding line segments B–C, C–D, D–E, A–C, A–E, and B–E (Figure 1). In other words, for these segments, calving front length ylen and ice shelf area yarea are predicted based on the MLM, while maximal draft values and the return draft are estimated using EVT (and, importantly depend on estimates of ylen and thus on xground, xrise, xgeom, and xtemp).
 The first result from model application in the paleo-Arctic setting is presented in Figure 5. There the return drafts of seven Arctic paleo ice shelves are shown, namely: for four ice shelf segments classified as open, and for three ice shelf segments classified as embayed. Segment A–E occurs twice to illustrate the impact of classification on the results. For each segment, return drafts are plotted with ylen set to the predicted mean from the MLM, and for a water temperature xtemp=−1.8°C along the calving front. Note that −1.8°C corresponds to the rounded 25th percentile of xtemp as used in our analysis, a value commonly employed for freezing of seawater (35ppm) and sea ice formation in Ocean Circulation Modeling (cf. FigureS11 for return drafts at 0°C, −1°C, −2°C). With xtemp and ylen set (cf. Table 3), return drafts can be plotted as a function of grounding line length, xground. For all segments except B–C, drafts exceeding 910m are within the 75% upper uncertainty bound if based on the third quartile estimates. If based on median parameter estimates, return drafts do not exceed 500m even within the 75% upper uncertainty bound. Highest return drafts are obtained for segment A–E irrespective of its classification.
 Technically, predictions of the draft along paleo-Arctic ice shelf calving fronts as shown in Figure 5 are obtained as follows. The threshold as well as the GPD estimates and their uncertainties hardly correlate with the predictors used in the MLM. Hence, for the Paleo-Arctic setting, we only assume that similar parameter values are applicable. Thus, we first assume that the average draft threshold over all open/embayed ice shelves is a generic threshold for open/embayed ice shelves (TableS2). Second, we choose the following: (a) the median and (b) the third quartile of the estimates and the uncertainties for open/embayed shelves as shape (ξ) and scale (σ) parameters when applying the model in the paleo-Arctic setting (Table 5). Choices for ξbased on a quartile argument are justified, given the uncertainty in the estimates. For open shelves, the selected value is covered by all confidence intervals except for Mertz Glacier. Thus, the parameters yield a “predictive” distribution along with its quantiles for the return levels. Recall that a typical illustration of the concept of return level is a 100 year flood, for which the return level has 1% chance of being exceeded in a given year; the return period is 1/0.01 = 100 years [Katz et al.2002]. Here the return period is associated with the length of the calving front instead of time. Thus, the m-kilometer return draft ym+uis expected to be exceeded on average once every m kilometers along the calving front.
Table 5. Parameter Estimates (Threshold Draft Values u in Meters, Shape and Scale Parameters as Well as their Uncertainties) Used in the EVT Model Portation for the Two Different Ice Shelf Classes “Open” and “Embayed” a
Var(·) and Cov(·,·) denote the variance and covariance, respectively.
 The second result from model application in the paleo-Arctic setting is presented in Figure 6, where the joint predictive distribution of the calving front length ylen and ice shelf area yarea is plotted for Arctic Ocean ice shelves forming along (combinations of) segments sketched in Figure 1. Results for open shelves (Figure 6a) and embayed shelves (Figure 6b) are displayed, with water temperature xtemp set to −1.8°C along all calving fronts (for 0°C, −1°C, −2°C, cf. FigureS12). Circles denote the mode (highest density, indicating the most likely configuration), crosses denote the predicted mean responses of ylen and yarea. The solid (dashed) curves encompass the 90% (75%) confidence regions of the joint prediction of ylen and yarea. Comparing Figures 6a and 6b, it is seen that in the paleo-Arctic setting, the predicted joint distribution of ylen and yarea has a wider range for open shelves than those for embayed ones. Further, the influence of the classification of ice shelf geometry is illustrated by performing predictions for segment A–E twice, treating it successively as open and embayed. In the particular case of segment A–E, the mean calving front length changes from ∼860km (embayed) to ∼3010km (open), while the associated ice shelf area changes from ∼1.4×106km2 (embayed) to ∼2.9×106km2 (open). It must be emphasized, however, that the predictive distributions do not incorporate any physical constraints, e.g., that for a particular length of the calving front the total area cannot exceed certain values (see section 4).
 Technically, the joint distributions are derived from equations (1) and (2) using the predictors given in Table 3, the estimated coefficients given in Table 4, and the prediction and estimation uncertainty again given in Table 4. The marginal predictive distribution of the log-response is a non-central Student's t-distribution. The joint predictive density of the responses (the product of the back-transformed marginal ones) is not spherically symmetric around its mode. The strong asymmetry in Figures 6 and S12 is induced by the back-transformation to the original scale: An increase of 1° in water temperature results in roughly one unit increase in the length of the calving front in the log scale (Table 2), implying the strong changes in the original scale. Yet predictions for 0°C have to be interpreted with caution as observed temperatures range between −1°C and −2°C.
 The idea of reconstructing Arctic Ocean paleo ice sheets based on analogies with Antarctica is far from new. Mercer  pointed out similarities between the Arctic Ocean and the ancient sea now taken over by the West Antarctic Ice Sheet's domes, ice streams, and ice shelves. Both areas are close to the geographic poles, and both are virtually landlocked. The West Antarctic Ice Sheet must be removed in order to envision that there once existed a partly landlocked sea there. From these observed analogies, Mercer  suggested that similarly to West Antarctica, the Arctic Ocean during glacial periods hosted thick ice shelves, fed by ice streams draining large marine ice domes. Further building on Arctic-Antarctic analogies, Hughes et al.  suggested that ice shelves must have filled the entire Arctic Ocean during the Last Glacial Maximum (LGM) in order to prevent the marine portions of the North American and Eurasian ice sheets from collapsing. Since the first mapping, data revealed traces of ice grounding in the central Arctic Ocean as deep as 1000m below the present sea level [Jakobsson, 1999; Polyak and Jakobsson, 2011], the hypothesis of huge thick paleo ice shelves covering the Arctic Ocean has been revisited and discussed in numerous articles, e.g., [Bradley and England, 2008; Engels et al., 2008; Grosswald and Hughes, 2008; Jakobsson et al., 2010]. Studies of sediment cores suggest that the largest and deepest drafting ice shelves existed during Marine Isotope Stage 6, about 140,000 years ago, [Jakobsson et al., 2010].
 Due to the situation that contemporary three-dimensional thermomechanical numerical ice models are still challenged by the marginal ice dynamics in coupled sheet/stream/shelf complexes, we explored a statistical approach to address plausible extents of a MIS6 Arctic Ocean ice shelf. We followed the line of thought suggested by Mercer  and assumed that MIS6 Arctic Ocean ice shelves behaved similarly to current Antarctic ice shelves, and that they also were scaled similarly. This assumption forms the basis for our statistical modeling approach. It is a strong assumption that naturally could be debated. However, we consider it a viable working hypothesis. It should be noted that not all Antarctic ice shelves were incorporated in our statistical “tuning” database. For instance, ice shelves with rather heterogeneous physiography have not been considered, as their classification is beyond the capabilities of the predictor xgeom. The simple geometric criterion on which the classification of an ice shelf as either open or embayed is based can not be applied to ice shelf configurations that are, e.g., pinned at their seaward edge, by a (chain of) islands/ice rises. Furthermore, it is of limited use in the classification of ice shelves that have joint boundaries with other ice shelves, such as, e.g., the Brunt ice shelf and the Riiser-Larsen ice shelf. For the 17 ice shelves considered here, xgeom proved a useful and stable predictor. Moreover, we consider it acceptable to disregard Antarctic Peninsula ice shelves from the analysis because it seems far-fetched to claim, in the spirit of Mercer , similarities between their general configuration and possible Arctic Ocean paleo ice shelves.
 A yet unresolved issue in the analogy approach is that the largest Antarctic ice shelves presently generally do not produce icebergs drafting deeper than ∼350m, while plowmarks in water depths deeper than 500m are attributed to icebergs originating from either outlet glaciers or ice shelves fed from major interior basins [Dowdeswell and Bamber, 2007]. Plowmarks comparable in depth to the ones mapped in the Arctic Ocean have not yet been detected in the Antarctic.
 However, basal accretion is observed for a number of Antarctic ice shelves [Zotikov et al., 1980; Engelhardt and Determann, 1987; Oerter et al., 1992; Khazendar et al., 2001] and has been suggested as a possible mechanism responsible for seaward thickening of ice shelves. In Jakobsson et al. , it is hypothesized that Atlantic waters might not have entered the Amerasian Basin during glacial periods so that the Canada Basin might have become a very cold environment with great potential for accretion and, hence, deep-draft ice shelves.
 We note that modeled predictions of return draft for embayed ice shelves appear counter-intuitive at first sight, as they can be smaller than return draft for open shelves. However, this must be viewed against the fact that the draft data of four of the eight Antarctic ice shelves classified as embayed could not be used in the EVT. Enlarging the database by more draft data for contemporary Antarctic ice shelves of type embayed will likely yield improved predictions, but, as yet, awaits implementation.
 Improved predictions are also expected once spatially variable fields such as water temperature along the calving front, xtemp, are no longer reduced to a single number prior to entering the MLM. The crude averaging techniques applied in the derivation of xtemp render xtemp a less stable predictor than, e.g., grounding line length, xground, and ice shelf geometry, xgeom (cf. Figure 3). Once the variation of xtemp with latitude, longitude, and depth is accounted for in the MLM, we expect xtemp to play an increasingly important role as predictor in our statistical framework reflecting, eventually, observationally confirmed evidence [Jenkins et al., 2010]. Until then, we refrain from considering additional spatially variable fields as predictors, although, e.g., surface air temperature data is available for all ice shelves.
 The three-step statistical approach proposed here results in reasonably simple and robust models. The results would remain essentially the same even if minor technical model refinements were made such as the use of a formal BIC criterion in the MLM in step1, or employment of an automatic threshold selection in the POT/EVT context in step2. Although the EVT modeling is performed for an ice shelf complex at one (arbitrary) point in time, we argue that considering “replicates” over time in order to produce a predictive distribution for the return draft will likely not lead to improved insights due to the uncertainties associated with the estimation of the return draft. However, it should be kept in mind that Antarctic ice shelves are currently at interglacial extents. Therefore, our predictions for Paleo-Arctic Ocean ice shelf configurations are likely to represent lower bounds for the glacial MIS6 ice shelf complex.
 The main goal for our statistical modeling was to address whether the mapped traces of ice grounding in the central Arctic Ocean could be caused by icebergs originating from an Amerasian ice shelf, if assuming an environment similar to that of Antarctic ice shelves. The deepest mapped ice grounding is located on Morris Jesup Rise, and it exceeds the present water depth of 1000m (Figure 1c). However, those iceberg plowmarks are from singular deep-drafting icebergs. It has been shown that icebergs occasionally capsize and, for a short duration of time, they reach depths greater than their original drafts. This cannot be excluded in the Morris Jesup Rise case. On the other hand, the scours on Chukchi Borderland, Yermak Plateau, and Lomonosov Ridge clearly suggest armadas of icebergs, likely composed of tabular icebergs originating from ice shelves [Dowdeswell et al., 2010; Jakobsson et al., 2010]. The features resembling mega-scale glacial lineations on Chukchi Borderland and Yermak Plateau are located in present water depths of ∼900−400m and 530 m, respectively. It should be noted that the deepest ice grounding on the Lomonosov Ridge may have been caused by icebergs of Eurasian sources [Jakobsson et al., 2008b; Kristoffersen et al., 2004; Polyak and Jakobsson, 2011], although the presently available data are not conclusive.
 For all ice shelf segments considered, return drafts exceeding 910m are not obtained when predictions are based on median estimates. However, based on third quartile estimates, return drafts exceeding 910m lie within the 75% upper uncertainty bounds for all segments except B–C. We see this as a first indicator that the MIS6 ice shelf complex must have comprised more than just an ice shelf along segment B–C, fed by ice streams in the Amundsen Gulf and McClure Strait only. Indeed, given a hypothetical extension of the ice shelf along B–C to the west (reaching the Chukchi Borderland, thus with grounding line A–C), predicted return drafts at the calving front larger than 910m are within the 75% uncertainty bound. Similar return drafts are obtained for individual ice shelves along C–D, and D–E, respectively. These results suggest that a small, confined ice shelf in the Amerasian sector of the Arctic along B–C only can be excluded as a possible source of icebergs large enough to cause the plowmarks on the Morris Jesup Rise and the Yermak Plateau. Rather, such bergs could have calved from the fronts of an ice shelf complex extending either westward from McClure Strait and toward the Chukchi Borderland (A–C), or from east of McClure Strait to Ellesmere Island (C–D), or from Ellesmere Island to the northernmost coast of Greenland (fed by an ice stream in Nares Strait, segment C–D). Combining these ice shelves into one extending from Point Barrow to the northernmost coast of Greenland (B–E), return drafts of 780m are modeled based on third quartile estimates. Within the 75% upper uncertainty bound, return drafts exceed 1500m for segment B–E. Similar results are obtained if the ice shelf complex extends from the Chukchi Borderland (A–E). Since three independent and individual ice shelf configurations (A–C, C–D, and D–E) are modeled to have calving fronts allowing for return drafts exceeding ∼910m within the 75% upper uncertainty bound, we argue that one should not, a priori, claim ice shelves to be absent from either one of those regions. Hence, our statistical analysis of the Arctic Ocean MIS6 ice shelf complex proposed by Jakobsson et al.  indicates that these large Amerasian Ocean ice shelves could indeed have been the sources of deep-draft icebergs, the traces of which have been mapped as deep as ∼1000m below the present day sea level in parts of the central Arctic Ocean.
 As the MLM approach involves a crude classification of ice shelf geometry only, exact topographies of an Arctic MIS6 ice shelf configuration can not be modeled. For example, consider the shelves with grounding line A–E and B–E. The former has a larger grounding line and the prediction of ylen, yarea are thus larger compared with the latter. However, the minimal calving length based on the geodetic distance of the segment end points on the sphere are 2231km (A–E) and 2521km (B–E), indicated by the dotted lines in Figure 7. The associated ice shelf areas are measured as 1.45×106km2 (A–E) and 1.06×106km2 (B–E), and are marked by squares in Figure 7. The magenta diamond indicates length of calving front (3080 km) and ice shelf area (675,801 km2) for the MIS6 ice shelf complex proposed by Jakobsson et al. . According to the statistical model proposed here, this ice shelf configuration is just outside the 90% confidence region (configuration B–E). This may indicate that either Jakobsson et al.  slightly underestimated the ice shelf extent or that modeled ice shelf areas are overestimated when applying the model in the paleo-Arctic setting.
 Spatial reconstructions of Quaternary glaciations in the Arctic Ocean region portray multiple episodes of glacial advances and retreats, associated with repetitive impact from floating ice shelves fringing the waxing and waning continental Amerasian and Eurasian ice sheets [Dyke et al., 2002; Svendsen et al., 2004; Jakobsson et al., 2010]. Marine glacigenic landforms used to reconstruct Quaternary glacial conditions in the Arctic Ocean include mega-scale glacial lineations, iceberg plowmarks, flutes, and redeposited sediment accumulations [Jakobsson et al., 2008b; O'Regan et al., 2010]. The remarkable depths—approximately 1000m below the present sea level—at which iceberg deep-draft scours are mapped in the Arctic Ocean, call for a modeling approach designed to complement the available observational evidence, and eventually, the spatial reconstructions based on geophysical mapping alone.
 We have proposed a statistical model in which the general dimensions of Arctic Ocean paleo ice shelves are predicted from relations between contemporary Antarctic ice shelf dimensions and their local physical environments. Critical to the identification of possible sources of deep-drafting icebergs is a rigorous analysis of the ice shelves calving fronts' thicknesses. We have employed extreme value theory to account for the fact that the mapped deep-draft plowmarks must rather be extreme events than common ones. Thereby, our model predictions of extreme drafts for an Arctic Ocean paleo ice shelf complex are derived within a firmly based statistical framework that is specifically designed to deal with extreme, rather than common events. Indeed, predicted extreme ice shelf drafts match observed deep-draft iceberg scours if the ice shelf complex is sufficiently large. Thus, additional modeling based support is provided in favor of the extensive MIS6 ice shelf complex discussed by Jakobsson et al. , and which hitherto was inferred from interpretation of geophysical and geological data only.
 The three-step statistical model proposed here is robust, however, further refinements are possible. Obviously, considering more ice shelves and more data in the MLM is one option. However, a better representation of variables that have until now entered the MLM in a very simplified manner (water temperature) will likely yield more substantial improvements than those achievable by simply increasing the amount of data. Only after spatially variable fields can be properly accounted for is it reasonable to include, e.g., surface air temperature and sub-ice shelf melt/accretion rates.
 Furthermore, the statistical approach itself can also be refined. Instead of the static approach chosen here, a hierarchical dynamical spatio-temporal statistical model could be employed. Then, the draft (or, more generally, ice shelf thickness over time) is stochastically modeled using input variables like ice thickness at and ice flux across the grounding line, ice flow velocity, water temperature, etc. Such a stochastic model would consist of simplified dynamic partial differential equations governing ice shelf dynamics complemented with (prior) distributions for all unknown parameters or unresolved processes, and would, as such, be able to shed light on especially the temporal evolution of Arctic Ocean paleo ice shelf complexes.
 N.K. and R.F. are joint first authors of this manuscript. N.K. thanks C. Stover Wiederwohl, Texas A&M University, for introduction to and guidance through the WOCE-SODB database during the “Oden Southern Ocean 0910” cruise to Pine Island Bay/West Antarctica. R.F. acknowledges funding from SNSF 129782, 143282 and URPP Systems Biology. This is a contribution from the Bolin Center for Climate Research at Stockholm University, Sweden. We thank the editor, Bryn Hubbard, the associate editor, Mike Bentley, as well as Jesse Johnson, and five anonymous reviewers for valuable comments on the manuscript.