The simulations of the fifth Coupled Models Intercomparison Project (CMIP5) strengthen previous assessments of a substantial role of anthropogenic emissions in driving precipitation changes in the Sahel, the semiarid region at the southern edge of the Sahara. Historical simulations can capture the magnitude of the centennial Sahel drying over the span of the 20th century and confirm that anthropogenic forcings have contributed substantially to it. Yet, the models do not reproduce the amplitude of observed oscillations at multidecadal timescales, suggesting that either oscillations in the forcing or the strength of natural variability are underestimated. Projections for Sahel rainfall are less robust than the 20th century hindcast and outlier projections persist, but overall the CMIP5 models confirm the CMIP3 results in many details and reaffirm the prediction of a rainy season that is more feeble at its start, especially in West Africa, and more abundant at its core across the entire Sahel. Out of 20 models, four buck this consensus. Idealized simulations from a subset of the CMIP5 ensemble—simulations designed to separate the fast land-atmosphere response to increased greenhouse gases (GHGs) from the slow response mediated through changes in sea surface temperature (SST)—confirm that the direct effect of CO2 is to enhance the monsoon, while warmer SST induce drying over the Sahel. At the same time, these simulations suggest that the seasonal evolution of the rainfall trends in the scenario simulations, spring drying and fall wetting, is an inherently coupled response, not captured by the linear superposition of the fast and slow response to CO2.
 Rainfall in the Sahel has experienced substantial multidecadal swings and an overall reduction during the course of the 20th century (Figure 1). Paleoclimate records indicate that North Africa has suffered prolonged droughts before [Nicholson, 1978; Brooks, 1998; Shanahan et al., 2009], but whether the remarkable drying of the Sahel over the course of the 20th century was natural or manmade is an open question. Recent pluvials and drought in the Sahel have been mostly controlled by concomitant variations in sea surface temperature (SST, for example: Giannini et al. ; Hagos and Cook ), but the natural or anthropogenic origin of these is debated (e.g. Zhang et al. ; Ting et al. ; Mohino et al. ; Booth et al. ).
Biasutti and Giannini  noted that the CMIP3 models reproduced a centennial drying in the Sahel and interpreted the consensus across models to indicate that anthropogenic forcing had caused about a third of the long-term 20th century drying. Ackerley et al.  used a perturbed physics ensemble to come to a stronger conclusion: that most of the drying between 1940 and 1980 was due to anthropogenic reflective aerosols, via their influence on the SST gradient in the Atlantic. Yet, the strength of the indirect aerosol forcing is controversial [Stevens and Feingold, 2009], and dismissing a strong role for natural variability is premature [Ting et al., 2009; Biasutti, 2011; Evan, 2012], especially at timescales less than a century, at which the influence of the Atlantic Multidecadal Variability might be dominant [Hoerling et al., 2006; Zhang et al., 2006; Mohino et al., 2011]. Similarly, the most recent recovery of the rains [Nicholson et al., 2000] might be simply natural variability [Mohino et al., 2011] or a forced response to increased greenhouse gases (GHGs) [Haarsma et al., 2005] or reduced aerosols [Ackerley et al., 2011]. Attribution is confounded by the lack of consensus across the CMIP3 models on future trends in summertime rainfall [Held et al., 2005; Cook and Vizy, 2006; Biasutti et al., 2008]. The profound model biases in both climatology and modes of coupled rainfall/SST variability [for example, Cook and Vizy, 2006; Joly et al., 2007] also limit our confidence in a quantitative attribution of the past drought to any one forcing, and some studies have noted that SST alone is insufficient to force the full amplitude of Sahel variations in most uncoupled models (e.g., Scaife et al. ) and that Sahel rainfall is sensitive to vegetation and dust processes that are not routinely included in coupled models [Zeng et al., 1999; Yoshioka et al., 2007; Kucharski et al., 2012].
 Here we show that the newest generation of coupled models, those participating in CMIP5, reaffirms the results of CMIP3. In particular, we find that historical simulations accurately capture the centennial drying trend, even as they fail to fully represent the multidecadal oscillations. We also show that, while some uncertainty in future summertime projections persists in the CMIP5 ensemble, there is qualitative agreement with CMIP3 in both the pattern and the seasonal evolution of the rainfall anomalies. This agreement lends credence to the multimodel mean projection of a rainy season that starts out more feebly, but becomes more intense at its core and end. The CMIP5 suite of simulations comprises idealized runs that can distinguish the role of SST warming from the direct effect of CO2 on the land-atmosphere climate; these are analyzed to understand the origin of the Sahel rainfall anomalies in scenario simulations.
 We use four sets of simulation from the CMIP5 archives [Taylor et al., 2012]: preindustrial, historical, scenario, and idealized simulations. Preindustrial control simulations provide a baseline for the natural variability in the climate system, in the absence of any anthropogenic forcings. Historical simulations are forced by best estimates of historical natural forcings (e.g., volcanoes and solar irradiance) and anthropogenic emissions. Because the CMIP5 ensemble includes Earth System Models with interactive carbon cycle and atmospheric chemistry alongside simpler ocean-atmosphere-coupled models, the concentration and distribution of radiatively active gases and aerosols are not the same in all historical simulations [Friedlingstein et al., 2006]. Similarly, different treatments of the interaction of aerosols and clouds lead to different indirect aerosol forcing. We start our analysis of historical runs from 1900, when observations over Africa are available. The RCP8.5 scenario is used for the reminder of the 21st century, due to the greater number of available simulations. As described in Riahi et al. , “RCP8.5 depicts a relatively conservative business as usual case with low income, high population and high energy demand due to only modest improvements in energy intensity.” This is an extreme scenario, with a continuous rise in radiative forcing during the 21st century, leading to a value of about 8.5 Wm−2 in 2100; such growth in radiative forcing is achieved mostly by a tripling of carbon emission within this century, while sulfate emissions are cut to a quarter. Emission scenarios for a number of other GHGs and atmospheric pollutants, as well as estimates of future changes in land use, are also specified. Irrespective of how models treat chemically active species and the carbon cycle, the GHG forcing grows fast and dominates over the effect of aerosols by the end of the century although the response to changes in aerosols are far from being negligible for regional climate [see Levy et al., 2008; Levy et al., 2012]. Idealized simulations that only include forcing from increasing CO2 concentrations (see section 4) supports the assumption that GHGs affect much of the Sahel rainfall changes seen in the scenario simulations.
 Two versions of the University of East Anglia Climate Research Unit gridded monthly precipitation datasets are used to assess trends in observations. The Hulme dataset [Hulme, 1992] covers 1900–1998 and TS3p1 [Mitchell and Jones, 2005] covers 1901–2009. The latter dataset optimizes for comprehensiveness and not stability, but the two sources agree well among themselves and with a simple arithmetic average of available stations from the Global Historical Climatological Network [Peterson and Vose, 1997], see Figure 1. Of course, the observations going in any of these products are the same, so it is possible that all datasets are biased at the beginning of the century, when the station data are extremely scarce (Figure 1). One source of confidence in the observations comes, remarkably, from GCM simulations forced by the observed SST, which duplicate the observed record even in the early decades of the 20th century (Isaac Held, personal communication and figure posted on http://www.gfdl.noaa.gov/isaac-held-homepage).
3 Historical Trends and Projections
 Figure 2 shows summertime (July, August, September, JAS) mean rainfall trends over the historical and scenario runs in 20 models (when possible, a model is represented by the mean of its ensemble runs). Where three quarters or more of the models simulate changes of the same sign, agreement across the ensemble is indicated by stippling. As did CMIP3, the CMIP5 ensemble captures the large-scale drying of the entire Sahel during the 20th century both in the mean and in most individual models (Figure 2a), although the drying pattern is displaced to the south compared to observations and much weaker than observed, especially on the mountains of Senegal. On average, the simulated Sahel rainfall index suggests little change before 1940, a rather linear drying between the 1940s and the 1980s and a subsequent recovery (Figure 2c). This progression matches observations quite well in a qualitative sense, but not in the magnitude of the interdecadal variations; in particular, neither the pluvial of the 1950s nor the severity of the drought in the 1970s and 1980s are captured in the multimodel mean. If internal multidecadal variability of SST were the dominant cause for these oscillations [Ting et al., 2009], the multimodel mean would not and should not capture them. On the other hand, the mismatch could be due to an underestimation of oscillations in the forcing, for example in the effect of aerosols [Ackerley et al., 2011; Booth et al., 2012]. One can find within the ensemble an individual integration matching observations, but in general, most simulations underestimate the strength of the observed multidecadal trends (see the discussion of Figure 7 below).
 Both the multimodel mean and a majority of models project a wetter Sahel everywhere but along the Atlantic coast (Figure 2b). The intermodel agreement is significant enough to lend credence to the mean projection, but is not as good as in the historical simulation. This can be seen in the intermodel standard deviation for the Sahel index, which remains constant in the historical simulations but grows by two thirds by the end of the 21st century, indicating growing disagreement (Figure 2c).
 Disagreement in Sahel rainfall projections was present in CMIP3 and is yet to be fully explained; this was a true disagreement rather than an indication that anthropogenic trends in the region will be negligible [Power et al., 2012]. It should not be a surprise, then, that similar disagreements would surface in CMIP5 or even be amplified, given the additional uncertainty in the projected radiative forcings. When the individual model trends are shown against the typical range of comparable trends in the preindustrial controls (Figure 3), they confirm that positive and negative projections are both significant. Thus, as in CMIP3, we do not predict negligible trends in the Sahel. Instead, we see substantial, but opposite, responses in different models: half of the models show a clear wetting, a quarter show clear drying and the rest shows no significant trends. For comparison, only two out of 20 models project a significant wetting trend in the 20th century (see Figure 3 and Figure 7).
 The disagreement across models is reduced, but not eliminated, in the months leading in and out of the rainy season. Biasutti et al.  and Biasutti and Sobel  had noticed that the seasonality of rainfall trends in the 20th and 21st centuries differed: while the drying trend of the past was consistent throughout the rainy season (in both observations and the CMIP3 models), the future projections were better interpreted as a delay in the rainy season, with negative trends in the onset months (June–July, JJ) and positive trends in the demise months (September–October, SO). The same seasonality is duplicated in CMIP5, as seen in the rainfall trends in each calendar month in the historical and scenario simulations (Figure 4). There is enough noise in these simulations that both positive and negative trends are simulated for all calendar months in both epochs. Yet, it is clear that the 20th century is characterized by overall negative trends (mimicking the shape of the rainy season, the negative trends peak in August), while the 21st century sees qualitatively different trends in the first half of the year (negative anomalies peaking in June and July) than in the second half (positive anomalies peaking in September). The negative anomalies in JJ are comparable to those seen in the 20th century, while the positive anomalies in SO are much larger. The agreement in JJ and SO projections is still not as good as it was for summertime trends in the 20th century: the growing spread across models plotted in Figure 2 marks the onset and demise seasons as well (not shown), but this is ameliorated for subregions of the Sahel.
 The difference in the spatial extent of the anomalies coming in and out of the rainy season is evident in Figure 5. The negative JJ anomalies are concentrated in West Africa, while the positive SO anomalies span the Sahel. When we define the West and Central-Eastern Sahel region to coincide with the regions of best agreement in the JJ and SO trend maps, we find that the sign of the centennial trends is quite robust (Figure 6): only three models (from two modeling centers) show non-negligible trends bucking the dry consensus in West Africa in JJ, and only one bucks the wet consensus in Central-Eastern Africa in SO. Still, a JJ projection that ranges from 80% decline to 30% increase (and vice versa for SO) is far from a complete success.
 One measure of progress from the CMIP3 archive is in the models' ability to simulate the right amplitude of the observed centennial trend (Figure 3a), but can they also capture the magnitude of the decadal and multidecadal variations? Figure 7 compares the distributions of 25-year, 50-year and 95-year trends in observations and in three sets of model integrations: the preindustrial controls, the 20th century historical simulations, and the 21st century RCP8.5 projections (running windows over the available period in each integration were used to calculate the individual trends). In the top panels, we compare the observations to the historical and control runs; the histograms are not normalized and are plotted on a semi-log scale to emphasize the low counts. The blue and gray lines to the left of each plot indicate the one-in-a-hundred trend in the simulations and help put the observed trends in context. The observations show a predominance of drying trends at all timescales. The observed centennial trends (Figure 7c) fall well within the distribution for the historical runs, an indication that the forced integrations can capture the observed magnitude quite well; instead, trends of the observed magnitude are rarer than one in a hundred in the absence of external forcings. At shorter timescales, the models underestimate the strength of the observed multidecadal trends, so that strong drying trends are much more likely in the observations than in the model simulations for both the preindustrial and the 20th century periods.
 In the bottom panels of Figure 7, we compare the distributions for the 20th and 21st century. At the shortest timescale of 25 years, the two are very similar, with only a slight preference for drying trends in the 20th century and wetting trends in the 21st. At longer timescales, the forced trends become more prominent, and the two distributions diverge. The histogram for the 20th century simulations narrows, consistent with the expectation of more muted variability at the longest timescales and becomes centered at around −5%, about a quarter of the observed 20th century trend. Conversely, the histograms for the projected 50- and 95-year trends spread out, reflecting model disagreement and increased uncertainty.
4 Pathways of CO2 Influence on the Sahel
 The cause for disagreement in projections of total rainy season rainfall has been investigated at great depth in the context of the CMIP3 simulations, and it remains unclear. Attempts to discard or accept different projections based on the quality of the models' historical simulations [for example, Cook and Vizy, 2006; Lau et al., 2005] have not been successful, consistent with similar experience in attempts to constrain climate sensitivity, for example [Klocke et al., 2011], and with theoretical work [Weigel et al., 2010]. Quite surprisingly, given how SST changes drove the 20th century Sahel rainfall changes, different trends in SST over the tropical Atlantic and Indo-Pacific oceans (or, equivalently, free-troposphere temperature in the tropics) cannot account for different trends in rainfall, at least not if we require that the same rainfall/SST relationship holds at interannual and centennial timescales [Biasutti et al., 2008; Caminade and Terray, 2010]. While current work has proposed that the relative warming of the North Atlantic (with respect to the tropical mean) can explain more than a quarter of the intra-ensemble scatter (Alessandra Giannini, personal communication), several studies [Patricola and Cook, 2010; Giannini, 2010; Skinner et al., 2012] have shown that the (fast) response of the land-atmosphere system to the local enhanced GHG forcing is as important as the (slow) response mediated through the warming of SST, and different sensitivities to each process should be considered in explaining the intra-ensemble differences in the projections.
 Idealized simulations from the CMIP5 archive can perhaps help resolve this puzzle. In this section, we use three ensemble sets of idealized simulations (available only for a subset of CMIP5 models, see Figure 9). In the first set (1%to4X), carbon dioxide concentration grows by 1% per year, doubling the preindustrial value at year 70, quadrupling by year 140, and remaining constant thereafter; to the extent that short-lived species' influence on climate is weaker than that of CO2, we expect the idealized coupled simulations to produce a climate similar to the scenario simulations. The second set of simulations (SSTclim4X) is uncoupled and is designed to extract the fast response of the atmosphere to CO2 increase: we compare control simulations that are forced by preindustrial climatological SST and preindustrial CO2 concentrations to experiments in which the CO2 concentration is quadrupled, but SST is left unchanged. The third set (Abrupt4X) comprises coupled simulations in which the initial state is preindustrial, but CO2 concentrations are held fixed at four times the preindustrial value; trends in these simulation capture the slow climate response to CO2, as the SST equilibrates to radiative forcing from the heightened CO2 concentration.
 Figure 8a compares the changes in rainfall in the 1%to4X simulations with those in the RCP8.5 simulations over the extended rainy season (May through October); each dot represents the linear trend over 140 and 95 years (for 1%to4X and RCP8.5, respectively) for a calendar month and CMIP5 model. In the aggregate, there is a clear linear relationship between trends in these two sets of simulation, with small scatter, and with small departure from the 1:1 line. This is a strong indication that other forcings in the scenario simulation are less important than the well-mixed GHGs in setting the rainfall response in the Sahel.
 Having established that the idealized simulation is a good stand in for the scenario simulation, we can try to decompose the full response of the coupled system to increasing CO2 into two components: the fast land-atmosphere response to increasing GHG concentrations and the slow response to the warming SST. Such linear decomposition is not achieved, as the fast and slow responses do not add up to the full coupled response. This can be seen in two measures, shown in Figure 8b and Figure 9. Figure 8b compares the Sahel precipitation monthly trends in the 1%to4X simulations to the sum of the fast and slow response estimated from SSTclim4X and Abrupt4X simulations, respectively. The sum of the two components does not fall on the 1:1 line, instead, the full coupled response is much stronger than the sum of the fast and slow response and it does not vanish when the latter do (the linear fits for individual months or the full year do not go through the origin).
 One would not necessarily expect a perfect correspondence, given the way the individual components of the response are estimated; specifically, uncoupled integrations can provide biased estimates of the coupled response, for example Douville , and internal variability is likely to muddle our estimate of the response, especially in coupled integrations, for example Deser et al. . But even if we can accept both the departures from the identity line and the indifferent correlation, our impression that a linear decomposition is indeed problematic is supported by the annual cycle of the trends. These are shown in Figure 9 for the individual idealized simulations and for the sum of the fast and slow responses. When CO2 increases are taken in isolation, with no attendant SST changes, the response is a robust wetting of the Sahel throughout the year, with positive anomalies peaking in August. This positive response is consistent with the importance of enhanced land-sea temperature contrast [Haarsma et al., 2005] and enhanced energy input [Giannini, 2010] in boosting monsoon rainfall. When the response to SST warming is seen in isolation, the response is a robust drying of the Sahel that also persists throughout the year and peaks at the core of the rainy season, in August. This negative response is also consistent with some literature that has suggested remote drying outside of the main oceanic convective centers [Neelin et al., 2003] due to the increased tropical stability that accompanies a uniform warming of the tropical oceans [Held and Soden, 2006]. All else equal, warmer and moister tropics would support enhanced precipitation, so the negative anomalies in the Sahel must be the result of dominant changes in the regional circulation and relative humidity.
 Contrary to a presumption of linearity, the CMIP5 ensemble suggests that these mechanisms interact in a complex way in the fully coupled system. The slow and fast response to CO2 both peak in August, and their linear superposition shows moderate wetting throughout the rainy season. Thus, the seasonal signature of the trends in the 1%to4X and RCP scenario simulations—namely, the fact that early season drying is turned around into a late season wetting of the Sahel—cannot be easily explained by the two mechanisms being dominant at different times of year. One caveat is that we do not have all the necessary idealized simulations to perform this decomposition on the entire CMIP5 ensemble. It would be a welcome development if, when more integrations become available, the current interpretation will be challenged and a simple linear superposition were to suffice as an explanation. The nature of the non-linear interaction between the fast and slow responses to CO2 quadrupling is unclear, but we speculate that the mutual influences of land and ocean that are important in creating the observed annual cycle of surface temperature and precipitation [Li and Philander, 1997; Fu et al., 2001; Biasutti et al., 2003; Biasutti et al., 2005; Dwyer et al., 2012] are also important in its modulation by increased GHGs. This hypothesis is the focus of current research.
 The CMIP5 generation of coupled models lends strong confirmation to previous assessment of the anthropogenic contribution to past drought in the Sahel and to the expectation of future changes in the seasonality of rainfall. Yet, questions of great consequence remain open. First: why do most models underestimate multidecadal oscillations in Sahel rainfall? Second: what is the source of uncertainty in the projections?
 Individual coupled simulations do reproduce the decadal ups and downs of the observed Sahel rainfall: Held et al.  documented the case of one GFDL model, and we see the same for one MIROC model (not shown). But not withstanding these good matches, the weak skewness of the simulated distributions (Figure 7), compared to the prevalence of strong multidecadal drying trends in observations, suggests that something might still be missing in these models. Even setting aside the possibility that multidecadal forcing from reflective and absorbing aerosols is underplayed, there is no shortage of candidates: from dust [Mahowald et al., 2010] to vegetation [Kucharski et al., 2012], to mesoscale organization of rainfall [Taylor et al., 2011]. And some basic processes are obviously not properly represented in these models, as the CMIP5 models stubbornly retain the biases that have plagued previous-generation models in this region [Roehrig et al., 2013]: from the absence of a cold tongue in the Gulf of Guinea [Richter and Xie,2008], to the misplacement of the Sahel rain band [Cook and Vizy, 2006] and the African Easterly Jet [Ruti et al., 2011], to the misrepresentation of African Easterly Waves [Ruti and Dell'Aquila, 2010]. Until we can trust that climate models capture the main structure of the regional climate and the full extent of variability in Sahel rainfall, a more quantitative attribution of past trends to anthropogenic forcings will remain elusive.
 Even as the continuous presence of outlier models and the uncertainty regarding the magnitude of the projected anomalies remain troubling, the agreement across the CMIP5 models (four fifths of the models agree in sign) adds weight to the 21st century projections of a dryer onset season in West Africa and an intensification of the late rainy season throughout the Sahel. The 1%to4X idealized simulations of CMIP5 indicate that these projections are a response to increasing GHGs, with no discernible role for aerosols and short-lived species.
 The switch from dry to wet anomalies as the rainy season progresses has been described as part of a delay in the seasonal march of rainfall in the oceanic ITCZ and monsoon regions and has been linked to seasonal changes in SST [Biasutti and Sobel, 2009] and to the sensitivity of convective margins to changes in tropospheric stability [Lintner and Neelin, 2007; Seth et al., 2010] and surface energy budget [Seth et al., 2010; Giannini, 2010]. In light of our results, these arguments will have to be refined. Idealized simulations designed to distinguish the direct effect of enhanced GHGs concentration (the fast response) and the indirect effect mediated through changes in SST (the slow response) show that each effect changes Sahel rainfall in opposite ways throughout the rainy season, but that the two effects do not add up to a delay of the rains (in the models available for this analysis). We do not doubt that the above-mentioned mechanisms play a role in setting the Sahel anomalies, but we caution that local radiation and remote SST seem to interact in a complex way. We speculate that the mutual influences of land and ocean, and the inherent difference in the phase and amplitude of their annual cycle, are important in determining the projected delay of the Sahel rains in the coupled response.
 We acknowledge the World Climate Research Programme's Working Group on Coupled Modelling, which is responsible for CMIP, and we thank the climate modeling groups for producing and making available their model output. For CMIP the U.S. Department of Energy's Program for Climate Model Diagnosis and Intercomparison provides coordinating support and led development of software infrastructure in partnership with the Global Organization for Earth System Science Portals
 Many heartfelt thanks to Haibo Liu and Naomi Naik for acquiring the datasets and making them easy to work with. This work was supported in part by NOAA (RAPID-NA08OAR4320754), and by NSF (SES-1048946).