Evolutionary spectral analysis has been used to study changes through time in the variability of ENSO-related time series. However, the significance of estimated evolutionary spectra has not been formally assessed. This paper describes a test for non-stationarity based on an estimate of the evolutionary spectrum and a time series bootstrap procedure. The test is applied to the seasonal time series of sea level pressure at Darwin. No significant non-stationarity is found.
 El Niño-Southern Oscillation (ENSO) is the dominant source of interannual variability in global climate. There is considerable interest in understanding the long-term historical stability of ENSO variability [e.g., Trenberth and Hoar, 1996, 1997; Harrison and Larkin, 1997; Rajagopalan et al., 1997]. This interest stems in part from a possible connection between ENSO variability and large-scale warming. A natural way to detect changes over time in the variability of a time series of a direct or indirect measure of ENSO is through evolutionary spectral analysis [Priestley, 1981]. As described in more detail below, this involves estimating the spectral density function locally in time. Although evolutionary spectra have been estimated for ENSO-related time series by Mann et al. , Urban et al. , and others, no formal assessment of the significance of these estimates appears to have been made. This paper describes a test for non-stationarity based on an estimate of the evolutionary spectrum. Significance is assessed through a time series bootstrap procedure. The test is applied to a seasonal time series of sea level pressure at Darwin and no evidence of non-stationarity is found.
 Consider a time series X = (X1, X2,…, Xn) of length n. Interest centers on testing the null hypothesis Ho that the spectral density function of X is stationary over time against the alternative hypothesis H1 that the spectral density function is non-stationary. Let Xt(m) = (Xt−m, Xt−m+1,…, Xt+m) be the subseries of X of length 2m + 1 (m < (n − 1)/2) centered at time t (t = m + 1, m + 2,…, n − m) and let:
be the periodogram of the subseries Xt(m), where ωj = 2π j/(2m + 1), j = 1, 2,…, m. We will refer to It(ωj; m) with t fixed as the local periodogram and we will refer to the ordered series of local periodograms Im+1(ωj; m), Im+2(ωj; m),…, In−m(ωj; m) as the evolutionary periodogram.
 Under Ho, It (ωj; m) has an approximately exponential distribution with mean f(ωj), where f(ω) is the true spectral density function of X [Priestley, 1981]. It follows that, under Ho, f(ωj) can estimated by the average over time of the local periodograms:
A test of the goodness of fit of this estimate to the individual local periodograms can be based on the exponential deviance:
[McCullagh and Nelder, 1989]. To use D to test for non-stationarity, it is necessary to know its distribution under Ho. When ∣t − t′∣ < 2m + 1, the subseries Xt(m) and Xt′(m) overlap, It(ωj; m) and It′(ωj; m) are not independent, and the standard distributional result for the deviance, which assumes independent observations, does not hold. Instead, the statistical significance of the observed value of D can be assessed using the so-called sieve bootstrap [Bühlmann, 2002].
 The sieve bootstrap, which was specifically developed for time series data, proceeds by fitting a stationary autoregressive model of order p to the complete time series X. This model has the form:
For fixed p, the parameters μ and ϕ1, ϕ2,…, ϕp can be estimated by the method of maximum likelihood (ML) and, by fitting models of different order, the estimated order can be found by minimizing Akaike's Information Criterion (AIC). Details are available in most texts on time series analysis [e.g., Brockwell and Davis, 1991]. Let:
be the residuals from the fitted model, where and 1, 2,…,p are the ML estimates of the corresponding parameters. Form a bootstrap time series X* according to:
where X*t = Xt for t = 1, 2,…, and R*t is sampled at random with replacement from the observed residuals. Treating the bootstrap series X* in exactly the same way as the original data, find the value D* of the deviance. Repeat the procedure a large number of times and estimate the significance level by the proportion of bootstrap series for which D* exceeds the observed deviance D.
 This section presents the results of applying the method described in the previous section to test for non-stationarity in the standardized seasonal time series of sea level pressure at Darwin for the period 1876–2002. This time series, which is shown in Figure 1, has a total length of n = 508 and is an updated version of the time series analyzed by Trenberth and Hoar . Figure 2 shows the evolutionary periodogram for this time series for m = 80. For this choice of m, each subseries covers approximately 40 years. In Figure 3, the spectral estimate (ωj) is shown. The deviance for this estimate is D = 25142.
 Turning to the sieve bootstrap, the order selected by the AIC is = 4. The estimates of the autoregressive parameters for this model are 1 = 0.51, 2 = 0.18, 3 = −0.07, and 4 = −0.08. The initial standardization of the time series ensures that the mean μ is 0. Of 100 bootstrap series generated from this fitted model, 60 had deviance values D* larger than 25142, for an estimated significance level of 0.60. We repeated the test using selected values of m between 80 and 200 and fitted AR models of order 3 and 5 with no qualitative change in the estimated significance level. This analysis provides no evidence to support non-stationarity in the time series in Figure 1. Similar results were found for a time series if sea surface temperature in the so-called Niño-3 region.
 The purpose of this paper has been to test for non-stationary variability in an ENSO-related time series. The test is based on a simple estimate of the evolutionary spectrum in conjunction with a time series bootstrap procedure. It is worth pointing out that, under the alternative hypothesis of non-stationarity, the evolutionary periodogram is not a good estimate of the evolutionary spectrum. However, interest here has centered on hypothesis-testing and not on estimation and the evolutionary periodogram is both convenient and appropriate for this purpose.
 The results of the previous section indicate that local variations in ENSO over the past century – at least as reflected in sea level pressure at Darwin and sea surface temperature in the Niño-3 region – are not inconsistent with overall stationarity. The test applied here considered variability at the full range of frequencies. The analysis could be sharpened by restricting attention to a range of frequencies over which non-stationarity is of particular interest. This range would have to be specified independently and, in particular, could not be chosen on the basis of the evolutionary periodogram (or other evolutionary spectral estimate).
 The authors thank two anonymous reviewers for their comments.