In spite of considerable efforts, long-range forecasting of Indian summer monsoon rainfall (ISMR) is still a challenge for both statistical and dynamical tools. We highlight the winter-to-spring Pacific North America (PNA) oscillation as a predictor for the ISMR. A PNA-related index is proposed that is highly correlated with the following summer precipitation over India and is also a precursor of El Niño-Southern Oscillation (ENSO) events over recent decades. The PNA index compares well with other predictors used in operational statistical models for ISMR prediction. A multiple linear regression scheme is tested with a cross-validation hindcast approach and confirms the added value of our predictor, at least over the period 1958–2005. Nevertheless, the predictor shows less skill over the first half of the 20th century. Possible physical mechanisms of this teleconnection are also briefly discussed and could involve both a tropical Pacific sea surface temperature and Eurasian snow cover pathway.
 Forecasting Indian summer monsoon rainfall (ISMR) is of great importance for the livelihoods of more than one billion of people and for India's economy. Year-to-year fluctuations of the amount of precipitation received during the monsoon season (June to September) can cause severe droughts or floods which dramatically affect the agricultural sector and have an adverse impact on economies and societies in South Asia. The influence of El Niño-Southern Oscillation (ENSO) on Asian monsoon variability has been extensively documented by both observational and numerical studies and is now widely recognized by the climate community [Soman and Slingo, 1997]. An inverse relationship has been established between the East Equatorial Pacific sea surface temperature (SST) and the ISMR, with a positive (negative) phase of ENSO associated with a deficient (heavy) monsoon rainfall over India. However, this relationship exhibits strong multi-decadal variability and has diminished since the 1980's [Kripalani and Kulkarni, 1997; Krishna Kumar et al., 1999]. Moreover, ENSO predictability shows a significant spring barrier so that the Pacific SST cannot be used as an efficient predictor of the Indian summer monsoon [Webster and Yang, 1992].
 Though many efforts have been conducted over recent decades for developing both statistical and dynamical tools, the long-range forecasting of ISMR remains a challenge for the climate community [Gadgil et al., 2005]. Statistical predictions are based on more-or-less established relationships between ISMR and various climate parameters considered as potential predictors, like regional anomalies in SST or atmospheric circulation observed a few months, seasons or years before the summer monsoon. Dynamical predictions rely on coupled or forced general circulation models (GCMs) and therefore have a more physical basis, but do not necessarily outperform statistical schemes given the difficulty of state-of-the-art GCMs in simulating the salient features of the monsoon climate [Wang et al., 2005]. In view of these results, the Indian Meteorological Department (IMD)'s operational seasonal forecasting system is still based on statistical models using a relatively large number of predictors with a two-stage strategy: a first forecast is issued in mid-April and a second one by the end of June. Recently, more robust multiple linear regressions were developed with a lower number of predictors after the failure of the operational forecasts in 2002 and 2004 [Rajeevan et al., 2006].
 The present study is aimed at suggesting a new predictor for improving the ISMR hindcasts (and hopefully forecasts) issued by the IMD. This predictor is related to a well-known atmospheric circulation pattern, the Pacific North America (PNA) oscillation [Wallace and Gutzler, 1981]. The loading pattern of the PNA is characterized by alternating centres of anomalous pressure that arc north-eastward through the North Pacific Ocean, through Canada, and then curve south-eastward into North America. Its link with ENSO has been extensively studied but the role of tropical pacific SST in triggering or amplifying internal modes of variability such as the PNA is still a matter of debate [Trenberth et al., 1998; Straus and Shukla, 2002]. The PNA influence on regional temperature and precipitation is well established, but few studies have focused on its possible remote impacts. Using both reanalyses and coupled ocean-atmosphere simulations, Yu et al.  suggest that ENSO controls only of the PNA variability, which in turn has apparently a limited influence on tropical diabatic heating in boreal winter. Here we demonstrate that a PNA index averaged over winter and spring shows a significant statistical relationship with the subsequent Indian summer monsoon and can be used as an efficient predictor of IMSR in a simple multiple linear regression scheme.
2. Data and Methods
 To represent the ISMR, we use the June to September (JJAS) seasonal mean All India Rainfall (AIR) index available over the 1871–2005 period, an area-weighted average from 29 Indian Rainfall subdivisions [Parthasarathy et al., 1995]. GPCC (Global Precipitation Climatology Centre) precipitation reanalyses for the 1901–2007 period were downloaded at http://gpcc.dwd.de. Sea level pressure (SLP) and 500hPa geopotential height (Z500) monthly fields are derived from the European Centre for Medium-Range Weather Forecasts (ECMWF) reanalyses (1958–2001) and operational analyses (2002–2007), a dataset which will be named ERA50 hereafter. Global SSTs are derived from the Hadley Centre climatology (HadSST) available over the 1870–2006 period. We also use the Hadley Centre SLP climatology (HadSLP) available over the 1850–2003 period (http://hadobs.metoffice.com/index.html). The equatorial Pacific upper ocean heat content (mean temperature between 0 and 300 meters depth) was derived from SODA reanalyses [Carton et al., 2000]. To remove global warming effect, all time series have been detrended using a simple linear fit before computing seasonal anomalies.
3.1. PNA-Monsoon Relationship
 A monthly PNA index is first constructed according to the Wallace and Gutzler definition [Wallace and Gutzler, 1981] based on the Z500 distribution and is averaged from December to May (DJFMAM). This index is strongly correlated with the ISMR index over the 1958–2005 period (correlation coefficient R = 0.47, p < 0.001). Figure 1a shows the regression of the summer precipitation and of 850 hPa wind over the DJFMAM PNA index. The positive relationship with the monsoon rainfall is clearly visible, a cyclonic anomaly over India and an increase of the moist south-westerly flow over the Arabian sea driving more precipitable water towards the Indian subcontinent.
 To have a wider perspective on the PNA-monsoon relationship, a pseudo-PNA index was also computed from the HadSLP sea level pressure climatology (1850–2003). Figure 1b shows the correlation pattern of SLP reanalyses with the original PNA index over the 1958–2007 period. The strongest correlation is found over the two centres located over the North Pacific ocean (see Figure 1b), so that it was decided to keep only this dipole pattern in the calculation of our pseudo-PNA index (hereafter called the “North Pacific Dipole index”):
where SLP are standardized mean sea level pressure values. As suggested in Figure 2a, the DJFMAM NPDI computed with ERA50 is highly correlated with ISMR for the period 1958–2005 (R = 0.56 and p = 0.0001). The NPDI is therefore a better predictor of the Indian monsoon than the original PNA index of Wallace and Gutzler . These correlations suggest that a positive DJFMAM anomaly of the PNA oscillation during winter and spring is followed by strong monsoon rainfall over India in the following summer, and vice versa. The individual correlations between ISMR and the two parts of the dipole are 0.43 and −0.41 with the subtropical and north Pacific centre of action respectively. Thus, they both play a role in the NPDI-ISMR correlation.
 To assess the contribution of the NPDI compared to a canonical ENSO index, Figure 2b shows the lead-lag monthly correlations between ISMR and the monthly timeseries of NPDI and Niño-3.4 index (i.e., the average JJAS SST anomalies in the 120°W-170°W/5°S-5°N domain). The monthly time series have been smoothed using a 5-month sliding window in order to increase the signal to noise ratio. The maximum NPDI correlation (black curve) with summer monsoon rainfall appears in March, when the Niño-3.4 correlation (red curve) shows a rapid decline. The NPDI is thus less sensitive to the spring predictability barrier than ENSO and therefore represents an interesting precursor of the monsoon. The Niño-3.4 index correlation with the DJFMAM NPDI (blue curve) shows that the ENSO variability leads the NPDI by several months, but also suggests that the NPDI is itself a potential precursor of ENSO. This feature is more than an artefact due to the quasi periodicity of ENSO, since the NPDI- Niño-3.4 index correlation is stronger than the Niño-3.4 SST autocorrelation.
Figure 2c shows the sliding correlations between the NPDI computed with the HadSLP2 data, the ISMR index and the Niño-3.4 index. The NPDI-ISMR relationship (solid black line) is strong and stable during recent decades, but was not significant in the first half of the 20th century and in the late 19th century. The ENSO-ISMR relationship (solid red line) is more stable, but shows an apparent weakening over recent decades [Torrence and Webster, 1999]. This weakening has been attributed to several reasons such as global warming [Krishna Kumar et al., 1999], stochastic noise [Gershunov et al., 2001], or more recently Atlantic SST variability [Kucharski et al., 2008]. The strong PNA-monsoon relationship is observed since the 1960s, a period characterized by a significant lead-lag PNA-ENSO relationship (solid blue line). It suggests that the strong NPDI-ISMR correlation during recent decades could result from the apparent influence of the PNA oscillation on the ENSO development. We speculate that a possible mechanism could be the persistence of the PNA in a particular phase during winter months involves circulation and SST anomalies in the North and equatorial Pacific. An anticyclonic (cyclonic) circulation, associated with a positive (negative) NPDI, is associated with a strengthening (weakening) of the easterly winds near the equator. This modulation of the easterly winds can affect the intraseasonal occurrence of westerly wind bursts which are known to play a role in the triggering of ENSO events [McPhaden et al., 1998; Belamari et al., 2003].
 These results suggest that the PNA-monsoon predictive relationship is embedded in a broader ENSO-PNA-monsoon system. However, the partial correlation between the DJFMAM NPDI and ISMR, with the JJAS Niño-3.4 SST influence removed by linear regression, remains significant (R = 0.44, p < 0.01). This residual correlation suggests that, besides tropical Pacific SST, there is another pathway between the winter/spring PNA and the following summer monsoon variability.
 A recent study of the Eurasian snow-Indian monsoon relationship has emphasized that the eastern Eurasian snow depth in spring was positively correlated with the subsequent ISMR between 1966 and 1995 [Peings and Douville, 2009]. The PNA oscillation is one of the extratropical circulation modes which modulates the snow depth variability over Eurasia [Popova, 2007]. Indeed, the NPDI exhibits a strong correlation with the eastern Eurasian snow depth over this period (close to 0.6). We hypothesize that the snow-monsoon relationship might therefore be a consequence of the PNA influence on both spring snow cover and subsequent Indian summer monsoon. Alternatively, it might also be evidence of another PNA-monsoon pathway through an active role of the Eurasian snow cover. This topic is beyond the scope of the present study and will need further investigation, including sensitivity experiments with atmospheric GCMs.
 To help understand the multi-decadal variability of the PNA-ISMR relationships, we have computed the standard deviation of the NPDI over a 31-year sliding window (dashed black line on Figure 2c). The strength of the NPDI correlation with ISMR (and ENSO) is in phase with the epochal variations of the NPDI variance during the 20th century. This result suggests that the NPDI influence vanishes when its year-to-year variability is not strong enough.
3.2. Implications for the Statistical Prediction of ISMR
 The next step is to assess if the NPDI can be useful in the context of statistical forecasting, using a multiple linear regression model as that of Rajeevan et al. . The list of their predictors used for the two-stage models (i.e., April and June forecasting) is given in Table 1, as well as their respective correlation coefficient (CC) with the ISMR index over the period 1958–2005. We have derived the same predictors from the datasets described in Section 2 and our results are therefore slightly different. Except for two first stage predictors, all correlations are significant at the 99% confidence level and the DJFMAM NPDI exhibits the strongest correlation with ISMR. Among the other ISMR predictors, correlations with NPDI are significant for Niño-3.4 SST anomaly tendency (J4, R = −0.47), equatorial south east Indian Ocean SST anomaly (A2/J2, R = 0.44), and Equatorial Pacific upper ocean heat content (A6, R = −0.40). It suggests that NPDI has some connection with Indo-Pacific SST variability, which does not mean that it is not as well the expression of internal extratropical atmospheric dynamics. To discuss the added value of our new ISMR predictor, we have compared different multiple linear regression schemes using a leave-one out cross validation over the period 1958–2005. This method consists in comparing observed ISMR values with those calculated from the regression schemes based successively on all years except the forecast year. For each stage, this comparison has been made between all possible combinations from the pool of one to seven predictors shown in Table 1.
Table 1. Details of Predictors Used for the First Stage Forecast and for the Second Stage Forecast of ISMR and Comparison With the NPDI Predictorsa
CC With ISMR (1958–2005)
First stage forecast, predictors A; second stage forecast, predictors J.
 To assess the model's skill, we have used the correlation coefficient R between the 38 predicted and observed ISMR values, the Root-Mean-Square-Error (RMSE) and a generalized cross-validation (GCV) function computed as given below: with Y′ the model forecast, Y the observed value, n the number of years and p the number of predictors. GCV is nearly equal to the square of the RMSE with a correction for the number of predictors used in the model. Table 2 summarizes the two best models for each stage, with and without the NPDI as a predictor (A7 and J7).
Table 2. Comparison of the Two Better Models of ISMR Prediction Over the Period 1958–2005a
With and without the NPDI (A7 and J7), for the two stage of forecasting. The results are obtained by multiple linear regression realized with a leave-one-out cross validation.
Stage 1 (Mid of April)
Model 1: A1-A2-A3-A4-A5
Model 2: A1-A3-A4-A5-A7
Stage 2 (End of June)
Model 3: J1-J2-J5-J6
Model 4: J3-J5-J6-J7
 For the second stage, the ISMR prediction is considerably improved with the correlation coefficient between observed and predicted values increased by 0.08, i.e., an increase of more than 10% of explained variance with the addition of the NPDI in the list of potential predictors. For the first stage the improvement is less pronounced (about 7% increase of explained variance). Figure 3 shows the ISMR time series for 1958–2005 and the predicted values for the two forecasting stages. An interesting feature of the models integrating the NPDI predictor is a better ability to capture extreme values of monsoon rainfall, and in particular the droughts of 2002 and 2004 which were not predicted by operational statistical models.
4. Discussion and Conclusions
 This study highlights the lead correlation between a winter-to-spring PNA-related index and the following Indian summer monsoon rainfall during recent decades. Our index is computed several months before the monsoon onset and may therefore be useful for operational seasonal forecasting.
 Understanding the physical mechanisms that control the interannual variability of the PNA-monsoon system remains a difficult challenge and will require further investigation. This relationship seems partly associated with the ENSO variability and with its influence on Indian monsoon. However, the NPDI-ISMR relationship remains significant after the removal of ENSO effects through simple linear regression, suggesting that the NPDI influence on the summer Indian monsoon has another pathway. In line with the results of Peings and Douville , a contribution of the eastern Eurasian snow cover has been proposed and will be tested in a modelling study with the CNRM atmospheric GCM.
 Finally, the implications of our new monsoon predictor for the statistical prediction of ISMR have been briefly explored. In keeping with the two stages of the IMD operational seasonal forecasting system, a multiple linear regression scheme has been tested, which is based on a pool of seven potential predictors (including the NPDI). A simple cross-validation approach has been applied over the period 1958–2005 for all possible models. The results show that including the NPDI in the pool of predictors yields better skill, with an increase of about 7% in the explained variance of ISMR for the first stage, and 10 % for the second forecast stage. This increase has been obtained over a relatively long period (1958–2005) and the new regression improves in particular the hindcast of the 2002 and 2004 deficient monsoon seasons, which were poorly predicted by the operational IMD forecasting system. Note however that a single model approach has been used to assess the relevance of our NPDI predictor and that its potential contribution to a multi-model [Rajeevan et al., 2006; Sahai et al., 2008] empirical forecasting system remains to be evaluated.
 Moreover, the PNA-monsoon relationship is not stable over the entire 20th century. The strengthening of the NPDI variability observed since the 1950's could be an explanation for the increasing influence of the PNA oscillation on both Indian monsoon and ENSO, but this needs further analysis.
 We are thankful to Alexander Gershunov, Aurélien Ribes and anonymous reviewers for their constructive comments and suggestions to improve the quality of the paper.