MPI-ESM is a new version of the global Earth system model developed at the Max Planck Institute for Meteorology. This paper describes the ocean state and circulation as well as basic aspects of variability in simulations contributing to the fifth phase of the Coupled Model Intercomparison Project (CMIP5). The performance of the ocean/sea-ice model MPIOM, coupled to a new version of the atmosphere model ECHAM6 and modules for land surface and ocean biogeochemistry, is assessed for two model versions with different grid resolution in the ocean. The low-resolution configuration has a nominal resolution of 1.5°, whereas the higher resolution version features a quasiuniform, eddy-permitting global resolution of 0.4°. The paper focuses on important oceanic features, such as surface temperature and salinity, water mass distribution, large-scale circulation, and heat and freshwater transports. In general, these integral quantities are simulated well in comparison with observational estimates, and improvements in comparison with the predecessor system are documented; for example, for tropical variability and sea ice representation. Introducing an eddy-permitting grid configuration in the ocean leads to improvements, in particular, in the representation of interior water mass properties in the Atlantic and in the representation of important ocean currents, such as the Agulhas and Equatorial current systems. In general, however, there are more similarities than differences between the two grid configurations, and several shortcomings, known from earlier versions of the coupled model, prevail.
 The development of models used as tools for a better understanding of climate and environmental changes has evolved into two directions. On the one hand, increased computing power allows for much higher resolution in the major subcomponents ocean and atmosphere. On the other hand, models now include biogeochemical and, in part, chemical processes to study the complex interplay between air, water, and soil. Models including biogeochemical modules and an interactive carbon cycle are referred to as Earth system models (ESM). At the Max Planck Institute for Meteorology (MPI-M), this line of development can be seen in the evolution from the CMIP3 model, which included the European Center-Hamburg (ECHAM5) atmosphere model and the Max Planck Institute Ocean Model (MPIOM) [Jungclaus et al., 2006] to the present MPI-ESM that is documented in this special issue of Journal of Advances in Modeling Earth Systems. First, the implementation of submodules for land surface processes and ocean biogeochemistry allowed for the simulation of an interactive carbon cycle, and the model participated in the Coupled Carbon Cycle Climate Model Intercomparison Project (C4MIP) [Friedlingstein et al., 2006]. Using a coarse-resolution version of this model system, Jungclaus et al.  demonstrated that it is possible to maintain a stable carbon cycle over thousands of years in a control experiment and carried out the first combined carbon cycle and climate simulations over the last millennium with a complex ESM. Toward CMIP5, further development work led to the development of the ECHAM6 atmosphere version, followed by the implementation of a dynamical vegetation model, technical improvements, and configurations with better representation of the stratosphere by enhanced vertical resolution.
 Along with several other manuscripts on various aspects of MPI-ESM in the special issue of Journal of Advances in Modeling Earth Systems, this paper documents elements of the ocean circulation and oceanic properties. The ocean and sea-ice model MPIOM has remained basically unchanged from the previous coupled atmosphere ocean model ECHAM5/MPIOM [Jungclaus et al., 2006]. Also one of the grid configurations, the (now) low-resolution GR1.5, remained the same. The higher-resolution version of MPI-ESM features one of the highest ocean resolutions (0.4°) among all available CMIP5 models.
 We concentrate in this contribution on surface ocean and water mass properties and on integrated variables such as mass and heat transports. These are essential oceanic elements for the general performance of the coupled system. For our model configurations, we assess the effect of improving the horizontal ocean resolution from a noneddy-resolving nominal 1.5° grid to an eddy permitting 0.4°. Additional more general papers deal with the other subcomponents of MPI-ESM: ECHAM6, the atmospheric component of MPI-ESM, is documented by Stevens et al. , the land-component Jena Scheme for Biosphere Atmosphere Coupling in Hamburg (JSBACH) is described by Reick et al. , and the characteristics of the coupled system and basic climate change simulations are reported in M. Giorgetta et al. (Climate change from 1850 to 2100 in MPI-ESM simulations for the Coupled Model Intercomparison Project 5, manuscript submitted to Journal of Advances in Modeling Earth Systems, special issue of The Max Planck Institute for Meteorology Earth System Model).
 Because we here give a broad overview of the performance of the ocean component in the MPI-M CMIP5 simulations, several aspects cannot be described in detail and have been taken up by more dedicated studies as part of the MPI-ESM special issue: Arctic sea-ice evolution under historical and scenario boundary conditions are discussed by Notz et al. , an evaluation of air-sea fluxes in A. Andersson et al. (Evaluation of MPI-ESM ocean surface fluxes, manuscript in preparation for Journal of Advances in Modeling Earth Systems, special issue of The Max Planck Institute for Meteorology Earth System Model, hereinafter referred to as Andersson, in preparation), and a study on land-surface fluxes and river runoff is provided by Hagemann et al. . The ocean biogeochemistry in the CMIP5 experiments as simulated by the Hamburg Ocean Carbon Cycle Model (HAMOCC5) is documented by Ilyina et al. .
 The remainder of this paper is organized as follows. In section 2, MPIOM is described as part of the MPI-ESM. Section 3 describes the experimental configurations for the CMIP5 experiments under consideration, and section 4 explores the capability of MPIOM to represent the observed mean state of the ocean. Section 5 documents aspects of variability in the coupled system, and section 6 discusses in detail the issues of model-dependent representation of key features in the ocean. Section 7 concludes the manuscript with a summary.
 The MPI-ESM consists of the general circulation models for the atmosphere ECHAM6 [Stevens et al., 2013] and for the ocean MPIOM [Maier-Reimer, 1997; Marsland et al., 2003; Jungclaus et al., 2006]. We describe here the model versions MPI-ESM-LR (LR, low resolution) and MPI-ESM-MR (MR, mixed resolution). ECHAM6 is run at T63 horizontal resolution (approximately 1.875° on a Gaussian grid) with 47 vertical levels in MPI-ESM-LR, and 95 levels in MPI-ESM-MR. Both setups resolve the troposphere and the stratosphere up to 0.01 hPa. Details of the ECHAM6 model physics and the development steps relative to the previous version, ECHAM5, can be found in Stevens et al. . The most important changes concern the shortwave radiative transfer, the representation of the land-surface including interactive vegetation, and the representation of the middle atmosphere as part of the default configuration (in CMIP3, ECHAM5 had 31 levels covering the atmosphere to 10 hPa). Changes in the shortwave physics have been combined with a revised surface albedo scheme and new representation of cloud optics. Over the ocean, ECHAM5 treated the surface albedo as constant, whereas ECHAM6 accounts for zenith angle [Stevens et al., 2013]. The albedo of sea ice and snow is calculated in ECHAM6, taking into account processes such as aging of snow and melt ponds on top of the sea ice [Roeckner et al., 2012].
 The carbon cycle comprises the ocean biogeochemistry module HAMOCC5 [Ilyina et al., 2013] and the land surface scheme JSBACH [Reick et al., 2013]. In MPI-ESM, the three-dimensional transport of carbon within the ocean and the atmosphere as well as the exchange between atmosphere and land biosphere are calculated at each time step while the exchange between ocean and atmosphere occurs at the coupling time step. In the particular CMIP5 simulations described here, atmospheric CO2 concentrations are, however, not calculated interactively but prescribed, and fluxes between the compartments are diagnosed. Aspects of the carbon cycle in the MPI-ESM CMIP5 simulations are covered by Schneck et al. (The land contribution to natural CO2 variability on time scales of centuries, manuscript submitted to Journal of Advances in Modeling Earth Systems, special issue of The Max Planck Institute for Meteorology Earth System Model) and Ilyina et al. . Another new feature in MPI-ESM is a module for dynamic vegetation [Brovkin, 2013].
 Ocean and atmosphere are coupled daily without flux adjustments using the Ocean Atmosphere Sea Ice Soil (OASIS3) coupler [Valcke et al., 2003]. Increasing the coupling frequency was not possible in this version of the model due to technical limitations in the hydrological discharge model. In the meantime, this issue has been solved and forthcoming MPI-ESM model version will be able to better represent the diurnal cycle.
 River runoff is treated interactively in the land hydrology module [Hagemann and Dümenil-Gates, 2003] that is embedded in the atmosphere model. The hydrology module applies a higher-resolution (0.5°) grid and contains a river-routing scheme. The resulting freshwater fluxes are interpolated back onto the atmosphere grid and are then passed to the ocean as part of the precipitation field. River discharge is therefore inserted just over the top grid cell of MPIOM. The mass balance of ice sheets is treated in a simplified way, converting precipitation minus evaporation over the glaciers immediately into surface runoff.
 The ocean component, MPIOM, has remained basically unchanged with respect to ECHAM5/MPIOM [Jungclaus et al., 2006], and we summarize here only its main features. Details on the model equations and physical parameterizations can be found in Marsland et al. . MPIOM is a free-surface ocean general circulation model formulated on an Arakawa-C grid in the horizontal and a z-grid in the vertical. It solves the primitive equations with the hydrostatic and Boussinesq approximations. Lateral diffusivity is parameterized by an isopycnal formulation based on Redi , where the diffusion coefficient varies with grid spacing (proportional to 103 m2 s−1 for a 400 km wide grid cell). In addition, tracer transport by unresolved eddies is parameterized following Gent et al. . They formulated the thickness diffusion in terms of an eddy-induced velocity that is added to the tracer advection. In MPIOM, the diffusion coefficient is chosen proportional to the local isopycnal diffusion (250 m2 s−1 for a 400 km wide grid cell). Vertical mixing and diffusion are based on the Richardson-number dependent formulation by Pacanowski and Philander . In addition, turbulent mixing in the ocean mixed layer is assumed to be proportional to the cube of the 10 m wind speed, decaying exponentially with depth and potential density difference to the surface [Marsland et al., 2003]. Advection of scalar tracers is computed with a second-order total variation diminishing scheme following Sweby , and an advective slope-convection scheme is included for a better representation of statically unstable flow over sills and shelves [Marsland et al., 2003]. MPIOM includes a sea-ice model [Wolff et al., 1997; Marsland et al., 2003; Notz et al., 2013] formulated using the viscous-plastic rheology of Hibler . Sea-ice thermodynamics relate changes in sea-ice thickness to a balance of radiant and turbulent atmospheric fluxes, and oceanic heat fluxes. It includes a subgrid-scale treatment of the ice-covered and ice-free part of the grid cell [see Notz et al., 2013]. The effect of snow accumulation on sea ice is included, along with snow-ice transformation. Ice formation and melting are accounted for assuming a constant sea-ice salinity of 5 psu.
 The model versions used here both apply 40 unevenly spaced levels in the vertical, ranging from 12 m near the surface to several hundred meters in the deep ocean where the bottom topography is represented by a partial-step formulation [Wolff et al., 1997]. The first 20 layers are distributed over the upper 700 m. MPIOM applies a conformal mapping grid in the horizontal based on the formulation of Maier-Reimer . Along with the different resolutions (LR and MR), the two versions described here also differ with respect to their conformal mapping grid. One version has two grid poles; this is the GR-grid family described in Marsland et al.  and used, for example, by Haak et al. , Jungclaus et al. [2006, 2008], or Mikolajewicz et al. . The second version uses a newly implemented tripolar (TP) quasi-isotropic grid that has been constructed following Murray . The latter is a combination of a regular Mercator grid south of the Equator and a quasi-isotropic grid to the north of it, in which the North Pole singularity is removed by implementing a line between points over Canada and Siberia. Similar grid configurations can be found in other state-of-the-art ocean and CMIP5 models [e.g., Griffies et al., 2011; Voldoire et al., 2012]. The TP family of grids is available at 1° (TP1), 0.4° (TP04, as described in this paper), and the eddy-resolving 6-min (TP6M) version described by von Storch et al. (Vertical eddy fluxes of heat, salt, and momentum simulated by the 1/10° STORM OGCM, manuscript submitted to Journal of Advances in Modeling Earth Systems, special issue of The Max Planck Institute for Meteorology Earth System Model, hereinafter referred to von Storm, submitted). The GR1.5 grid has two poles, one over Antarctica and one over southern Greenland, and features a nominal resolution of 1.5°, which translates into a grid spacing ranging from 15 km around Greenland to 185 km in the tropical Pacific. The convergence of the mesh size toward the poles thus provides relatively high resolution in the deepwater formation regions around Antarctica and in three regions that are important for northern hemisphere water mass formation: the Labrador Sea, the Nordic Seas, and the Greenland-Scotland Ridge (GSR) with its deepwater conduits [Jungclaus et al., 2008]. However, a further refinement toward eddy-permitting resolution would have led to extremely small grid sizes near Greenland, implying the need for very short time steps. While the GR1.5 grid is clearly non eddy-resolving over the world ocean, the TP04 can be classified as “eddy-permitting” for most regions.
 Finally, we note that another subset of CMIP5 simulations, the paleointegrations for the last glacial maximum, the mid Holocene, and the last millennium [Braconnot et al., 2012] were carried out with another dedicated version of the model, called MPI-ESM-P. This configuration is identical to MPI-ESM-LR with two exceptions: the dynamical vegetation module is not active and the orbital parameters are prescribed rather than calculated from the internal calendar. The representation of oceanic features in the MPI-ESM-P control simulations (also available from the CMIP5 database), such as those documented in this paper, are very similar to the ones in MPI-ESM-LR.
3. Experimental Setup and Spin-Up Procedure
 For each model configuration we have performed, a 1000-year control simulation under preindustrial (1850) boundary conditions. These “PICTRL” runs form the reference for the simulations of the instrumental period (1850–2005, a.k.a. “historical” simulations), future scenarios, and idealized climate change experiments [Taylor et al., 2012]. A three-member ensemble of “historical” simulations is available for both model configurations, and we use the last 25 years (1980–2005) for comparison with recent near-surface observations. For integrated quantities, interior ocean properties, and the assessment of low-frequency variability, we focus our analysis on the well-equilibrated 1000 year-long control simulations.
 Before a coupled control simulation can be started with a “frozen” model system, a spin-up procedure is necessary to bring the model into equilibrium in terms of radiation balance and to minimize the drift, in particular, in the deep ocean. Changes in the atmosphere physics [Stevens et al., 2013], the new land surface and dynamic vegetation schemes, and modified tuning parameters in the coupled system [Mauritsen et al., 2012] imply changes in the atmosphere-ocean fluxes compared to the earlier ECHAM5/MPIOM version of the model. In order to arrive at a coupled simulation with as little long-term drift as possible under the constraint of economic use of the available computer time, we start the spin-up simulation not from climatology or an ocean state estimate, but from existing temperature and salinity fields from the CMIP3 preindustrial control experiment. Ocean temperature and salinity fields were readily available for the LR setup, but had to be interpolated onto the TP04 grid for MPI-ESM-MR. During the early phase of the spin up, additional tuning steps turned out to be necessary and interactive vegetation was only switched on after a relatively stable state had been achieved after several hundred years of integration. The spin-up procedure lasted for more than 1900 years in MPI-ESM-LR and, owing to CPU time limitations, 1500 years in MPI-ESM-MR. In both configurations, at least 500 years were run without any further changes before the start of the control simulations. In contrast, the spin up for the CMIP3 model version was only a few hundred years. In addition to the MPI-ESM CMIP5 simulations we will also refer to MPIOM stand-alone experiments that were carried out in LR, MR, as well as in the eddy-resolving TP6M configurations using atmospheric reanalysis data from the National Center for Environmental Predictions (NCEP) [Kalnay et al., 1996] for the period 1948–2010. In contrast to coupled simulations, the NCEP simulations were started after a spin-up time of only a few decades and are therefore much closer to the initial climatology (see von Storch, submitted).
4. Evaluation of the Mean Ocean State
4.1. Ocean Temperature and Salinity
 After the spin-up procedure, the atmospheric global mean surface temperature exhibits only little drift around the preindustrial target value of 13.7°C, and the top-of-the-atmosphere imbalance is mainly related to small energy leaks in the atmosphere and the coupling interface [Mauritsen et al., 2012]. In the ocean, however, time scales are much longer, and some drift is expected in the deep ocean to prevail over thousands of years.
 We compare simulated temperatures and salinities with those from the Polar Science Center Hydrographic Climatology (PHC3). The PHC3 climatology is a blend of the Levitus et al.  data with an updated data set for the Arctic from Steele et al. . Deviations with respect to PHC3 in the globally averaged temperature and salinity profiles are similar in both model versions and have retained some of the biases inherited from the CMIP3 solution. Serious biases prevail in particular in the intermediate layers of the ocean. The large errors have evolved during the spin-up and reflect the inability of the model to maintain the correct water mass properties. This appears to be a persistent phenomenon in state-of-the-art climate models [e.g., Voldoire et al., 2012; Danabasoglu et al., 2012; Sterl et al., 2011]. In general, the ocean in MPI-ESM gets too warm and saline at intermediate levels and in the deep ocean whereas it is too cold and fresh in the upper layers. For temperature, biases are considerably smaller in MR, compared to LR and CMIP3 (Figure 1a). Salinity errors (Figure 1c) are relatively similar with some larger errors in the depth-range around 1000 m in MR, possibly caused by overly saline intermediate waters originating in the Red Sea (not shown).
 Intermediate-depth biases are often linked to shortcomings in the parameterization of unresolved eddies and/or spurious numerical mixing as an inherent feature of z-level models [e.g., Delworth et al., 2012; Griffies et al., 2000]. In the more detailed discussion of biases in the Atlantic basin following below, we demonstrate that at least part of the problem can be related to an improper representation of the circulation and the interbasin exchange between Indian Ocean and South Atlantic.
 An advantage of our spin-up procedure is that trends over the control runs have been minimized through the long spin-up runs. The linear trends of the order 0.02°C (LR) and 0.05°C (MR) and 0.003 psu per 1000 years (Figures 1b and 1d) are relatively small, compared to those reported from other CMIP5 models [e.g., Sterl et al., 2011; Griffies et al., 2011]. In fact, the trend in the ECHAM5/MPIOM CMIP3 control simulation was larger by almost 1 order of magnitude (Figures 1b and 1d).
 Sea surface temperature (SST) and sea-ice properties are the most important ocean-related quantities in a coupled simulation since they determine the atmosphere-ocean fluxes and their variability. Freshwater fluxes, river runoff, and the resulting sea surface salinities (SSSs) reflect the performance quality of the coupled system. Since the PHC3 climatology represents the modern state of the ocean, we compare the simulated SST from the last 25 years of the historical simulations with the PHC3 climatology (Figure 2). As expected, the present day (1980–2005 average) ensemble means from the historical simulations are warmer than their respective preindustrial control integrations by globally 0.52°C (LR), and 0.58°C (MR) and, therefore, closer to the observed state. Global mean biases for the 1980–2005 average in surface properties and at 1100 m depth relative to the PHC3 data are given in Table 1. The MR model shows generally somewhat smaller errors, both for means and root-mean-square errors (RMSE). The larger mean SST errors in LR can be explained by the slightly colder control state at the beginning of the historical simulation (not shown).
Table 1. Model Minus Observations (PHC3) Area-Weighted Mean and RMSE Differences for SST and SSS, and for the Potential Temperature and Salinity at 1100 m Deptha
Simulated data represent the ensemble mean of the historical simulations for the period 1980–2005. All data were mapped onto a 1° geographical grid prior to the analyses. Temperature and salinity are in °C and psu, respectively.
SST mean bias
SSS mean bias
T1100 mean bias
S1100 mean bias
 At the surface, large regions of the world's ocean show biases of less than 1°, and the MR ensemble displays generally slightly smaller errors than the LR ensemble (Figures 2a and 2b). However, there are systematic bias patterns that prevail in both model setups. The largest SST errors are found in the North Atlantic where both model versions produce a too zonally oriented North Atlantic Current (NAC). The resulting cold bias covering the region between the simulated and observed subtropical gyre margin appears to be a persistent feature in state-of-the-art climate models, where lack of resolution prevents a proper representation of the Gulf Stream separation [Dengg et al., 1996]. The problems in the NAC path may, however, also be related to the representation of the deep western boundary current [Gerdes and Köberle, 1995; Bryan et al., 2007] and the properties and strengths of the overflows across the GSR [Danabasoglu et al., 2010]. Even though the higher resolution in MR allows for a better representation of key features such as the flow through Florida Strait, there is no improvement in the Gulf Stream/NAC position: the cold bias reaches even further to the north where the “northwest corner” should appear, and the MR realization shows a more pronounced warm bias in the Labrador Sea. Both model versions show a similar surface warm bias in the Nordic Seas. It is likely that a lack of heat loss over the subpolar North Atlantic leads to relatively warmer Atlantic waters in the Nordic Seas [Swingedouw et al., 2012].
 The equatorial cold bias in the Pacific, another typical feature of many coupled ocean-atmosphere models, is improved in MR, compared to LR. The area where the bias exceeds 1.5°C is reduced, and the cold tongue does not spread out as far to the west into the Warm Pool. A longstanding problem of coupled models is the proper representation of the upwelling regions at the eastern boundaries of the oceans [e.g., Randall et al., 2007]. Large and Danabasoglu  have investigated these biases in detail and identified a number of interrelated problems typical for a climate model. At relatively coarse resolution, eastern boundary currents and upwelling may be too weak, while erroneous winds and an improper representation of stratocumulus clouds are diagnosed from the atmosphere model. A particular problem is related to the Andes and Sierras, which are not well represented in the T63 atmosphere (see also Griffies et al. ). In our simulations, the biases west of Africa and west of Baja California are very similar in LR and MR so that ocean resolution apparently does not improve the boundary currents and upwelling. On the other hand, biases are much smaller in stand-alone ECHAM6 simulations with prescribed SSTs [Stevens et al., 2013], so that the issue of eastern-boundary SST biases clearly remains a problem of the coupled system. Interestingly, the bias pattern is somewhat different in the southeastern tropical Pacific, off the coast of Peru, where the warm bias in simulated SST is smaller in LR than in MR. This feature is related to differences in the upwelling and is presently the subject of further investigations on resolution-dependent differences in the representation of the equatorial dynamics.
 The Southern Ocean warm bias seen in both realizations can be associated with an overestimation of downward shortwave radiation into the Polar Regions [Stevens et al., 2013]. This feature remains characteristic of most climate models [Griffies et al., 2011; Sterl et al., 2011]. In our simulation, it goes along with an underestimation with sea-ice coverage, which, in turn, contributes to the warm bias.
 There are also considerable differences between the LR and MR simulation in other regions of the Southern Hemisphere: the large cold bias around 40°S in the Indian Ocean sector has largely disappeared in MR due to a more realistic representation of the margin between the subpolar and subtropical gyres. In the western South Atlantic, the error dipole associated with the confluence zone and the pronounced warm maximum at the Brazilian coast near 40°S is reduced in MR, compared to LR. The latter differences, as we show later, are related to different representations of the ocean circulation in the boundary region between Southern Ocean and Atlantic in the different resolutions. The northern part of the Pacific Ocean also exhibits relatively strong biases associated with the Kuroshio position. This feature is less pronounced in MPI-ESM-LR. An additional difference between LR and MR is the typical “horse-shoe” pattern in the eastern North Pacific SST pattern. This bias is less pronounced in MR, possibly related to the different bias patterns in the tropical Pacific. A comparison between two NCEP-forced LR and MR stand-alone ocean simulations (not shown) does not produce such differences, indicating that the North Pacific SSTs in the respective coupled simulations may be influenced by the different vertical resolution in the atmosphere, a slightly different tuning of ECHAM6, differences in equatorial SST, or different representations of atmospheric teleconnections in the MR and LR configurations.
 Sea-surface salinity biases (Figures 2c and 2d) exhibit rather similar patterns in both LR and MR model configurations. Large deviations from the observation-based climatology are found in the Arctic, with generally too fresh conditions near the Siberian coast and with too salty surface waters in the Canadian Arctic and Beaufort Sea. Part of the fresh bias may be explained by an overestimation of runoff from the Eurasian rivers [Hagemann et al., 2013]. The origin of the positive salinity biases in the Canadian Arctic and Beaufort Sea is, however, not clear and may also be related to a summer bias in the observations. In this regard, we also note large (>1 psu) differences between different observation-based data sets, such as PHC3 and the World Ocean Atlas 2009 [Antonov et al., 2010] in high northern latitudes. The problems in simulating the correct path of the NAC lead to a negative salinity bias that resembles the cold SST bias in the North Atlantic. Both model versions show a band of too fresh conditions in the subtropical South Atlantic and too salty conditions in the southwestern equatorial Pacific, both related to the displacements of the respective Intertropical Convergence Zones (ITCZs). In the Pacific, these biases are caused by a combination in the shifts of the ITCZ and the South Pacific convergence zone. While the stand-alone version of ECHAM6 driven with observed SSTs produces the well-known “double” ITCZ with associated excess precipitation in the southwestern tropical Pacific, the coupled model simulates much less precipitation directly at the Equator with the largest biases over the warm pool (Andersson, in preparation). In the tropical Atlantic, the shift in the ITCZ leads to excess precipitation on the eastern side of the basin and too little precipitation over the Amazon. The precipitation biases manifest themselves in insufficient Amazon discharge [see also Hagemann, 2013; Grodsky et al., 2012]. A more comprehensive evaluation with particular focus on precipitation over the oceans is given in an accompanying paper (Andersson, in preparation). For salinity, substantial improvement in MR versus LR occur mainly at the southern exit of the Indian Ocean, also related to the exchange between Indian Ocean and South Atlantic and a more realistic simulation of the position of the northern edge of the Antarctic circumpolar current (ACC) in MR (see section 6.2).
 The magnitude and patterns of the surface temperature and salinity biases shown here for the LR configuration are very similar to those from the predecessor CMIP3 model ECHAM5/MPIOM [Jungclaus et al., 2006]. Therefore, differences between the CMIP3 and CMIP5 model using the same ocean grid appear less pronounced compared to those between LR and MR. This holds in particular for the errors described above for the South Atlantic and the southern part of the Indian Ocean, and for the salinity biases in the tropical South Atlantic.
 Biases in ocean interior water mass structures can be seen in basinwide zonal averages. Errors are typically still large in state-of-the-art ocean models [Griffies et al., 2009] and accumulate even more in coupled simulations that are integrated to quasiequilibrium over long spin-up integrations. The current simulations are no exception, and errors of several degrees centigrade reflect problems in parameterization and resolution. Zonal means for the Atlantic and Arctic Ocean, and for the combined Indo-Pacific basin (Figure 3) reveal that the upper layers of the ocean are generally too cold and fresh, whereas the intermediate layers are too warm and too salty. Deepwater masses produced in the Nordic Seas are too warm and saline, and these deviations propagate with the lower limb of the Atlantic overturning circulation into the intermediate and deep ocean. In the upper layers, SST and SSS biases seen in Figure 2 partly penetrate into the upper few hundred meters.
 In the Atlantic, the most striking error patterns occur at intermediate depth with maxima near 40°S extending into the Northern Hemisphere at the level of the Antarctic intermediate water (AAIW). Similar errors were found in earlier ocean-only simulations with MPIOM [Marsland et al., 2003] and other ocean models [Griffies et al., 2009]. Typically, such errors are thought to be related to incorrect interior circulation and tracer advection as well as unrealistic interior mixing, owing to a lack of proper representation of eddy-induced tracer transport [Griffies et al., 2009]. In the configurations presented here, the pronounced differences in the LR and MR simulations allow for an identification of the source of the bias. In a set of ocean-only simulations (not shown), the error near 40°S and 1000 m depth is established very quickly after the start of these experiments using the GR1.5 grid and can clearly be related to advection of warm and salty water from the Indian Ocean. The respective simulations with the TP04 (eddy permitting) and TP6M (eddy resolving) model configuration demonstrate that the resolution-dependent representation of the Agulhas current system, its retroflection, and the associated transfer of salt and heat (see section 6.2) is responsible for the larger mismatch at lower resolution. In fact, the bias is further reduced in the eddy-resolving TP6M ocean-only run.
 In both configurations displayed in Figure 3, the error patterns stretch out to the north at the depth of the AAIW but are much more pronounced in LR. Moreover, in LR, there is another local maximum in the salinity bias near 30°N that can be associated with the impact of warm and saline Mediterranean waters. While the mass transport of the exchange flow through the Strait of Gibraltar is of similar strength in both setups (Table 2), the Mediterranean water is distributed differently by the large-scale circulation in the Atlantic (not shown). In MR, part of the Mediterranean outflow is flowing northward along the European continental shelf, whereas the warm and saline water masses in LR almost entirely spread out in southwesterly direction into the open Atlantic. The too zonal path of the NAC leads to a cold and fresh bias in the upper ocean between 40 and 50°N. The larger warm bias in the MR simulation around 60°N and to the north of the GSR is not compensated for by salinity errors and leads to reduced densities in the Nordic Seas in this setup. As shown in section 4.6, shallower mixed-layer depths (MLDs) in the MR realization indicate that there is less heat release to the atmosphere in the sinking regions. Since the strength of the overflow across the GSR is determined in part by the density contrast between the basins, the temperature mismatch is probably also responsible for a weaker overturning circulation in the northern North Atlantic in MR (see section 4.3).
Table 2. Transports (Sv) Through Passages From Long-Term Means of the PICTRL Simulations of MPI-ESM-LR and MPI-ESM-MR Together With Observational Estimatesa
If not specified otherwise, the numbers represent the total (depth-integrated) barotropic transport.
 The error patterns in the Indo-Pacific basin (Figures 3 c, 3d, 3g, and 3h) show much less differences between the two model versions. Slight improvement is seen in the MR representation of temperatures in the North Pacific while errors in the tropics appear to be a bit more pronounced. Therefore, while better resolving the exchange flow between Indian Ocean and South Atlantic has improved the biases in the Atlantic, the general problem of middepth biases known from other models, remains. Delworth et al.  compared coupled simulations at roughly 1°, 0.25°, and 0.1° resolution and found a deterioration of the bias magnitude in the eddy-permitting solution. They related the subsurface drift in that model to insufficiently resolved mesoscale eddies (the Gent-McWilliams parameterization was switched off). In our model, retaining GM, though with a relatively low coefficient, has allowed us to arrive at a similar bias in both model versions.
4.2. Large-Scale Ocean Circulation
 The large-scale horizontal circulation is described in terms of the barotropic streamfunction (BSF) in Figure 4. As MPIOM is a free-surface model, BSF is diagnosed from the vertically integrated horizontal velocities at each time step. Both model configurations reproduce the well-known features of the oceanic gyres and large-scale current systems. In the North Atlantic, subpolar gyre maximum values of BSF to the south of Greenland are 37 Sv in MR and 28 Sv in LR (1 Sv = 106 m3 s−1), bracketing the observational estimate of 34 Sv given by Clarke . Both model configurations feature a maximum subtropical gyre BSF of about 50 Sv, comparable to the estimate of 48 Sv by Johns et al. . In the Pacific, the subtropical gyre associated with the Kuroshio has a maximum of about 80 Sv to the south of Japan in the LR and about 60 Sv in MR configurations, whereas estimates derived from altimeter products give about 42 Sv [Imawaki et al., 2001]. However, the peak value in the model is determined by a relatively small recirculation cell. Observations [e.g., Jayne et al., 2009] indicate peak values of the Kuroshio transport of up to 114 Sv. Important differences between the two configurations appear at the boundary between Southern Ocean and South Atlantic and the Agulhas region south of Africa. The Agulhas current is a boundary current that flows westward around the Cape of Good Hope before retroflecting to the south and then recirculating eastward. In the LR configuration, there is much too little retroflection, and the Indian Ocean subtropical gyre extends far into the South Atlantic, forming a huge “super gyre.” The zero line of the BSF to the south of Africa, which forms the boundary between the Agulhas system and the ACC, is found several degrees further to the south so that the Indian Ocean is more connected to the South Atlantic. As a result, the South Atlantic subtropical gyre is much stronger in LR (90 Sv) than in MR (62 Sv) and much higher than in estimates from hydrographic observations such as the ∼30 Sv reported by Peterson and Stramma . The connection between Indian Ocean and South Atlantic is an important component of the global thermohaline circulation [e.g., Gordon, 1986] and has received increased attention in recent years [e.g., Beal et al., 2011]. The resolution-dependent differences in the LR and MR configurations will therefore be further explored in section 6.2.
 In the western South Atlantic, the Brazil current separation and the Malvinas confluence is also affected by the different northward extensions of the ACC in LR and MR. The reduced SST bias here (Figure 2) indicates a better representation in MR compared to LR.
 As shown in Table 2, the total depth-integrated transports through important passages are largely simulated within the uncertainty of the observational estimates. The transport of relatively fresh water from the Pacific into the Arctic is one fundamental ingredient for the freshwater budget of the Arctic and the Atlantic Ocean. The two model solutions (0.7 Sv in LR and 1 Sv in MR) bracket the observational estimate of 0.8 Sv, even though the channel, as represented by one grid box in the Arakawa-C grid, is too wide in the LR configuration. The Nordic Seas exchange water masses with the Atlantic through the different openings of the GSR. The total flux through Denmark Strait appears to be slightly overestimated in comparison with observations. The combined depth-integrated flow between Iceland and Scotland is close to 5 Sv in both models, consistent with the transport scheme provided by Hansen et al. [2008, Figure 1.12]. However, the distribution between the openings to the east and west of the Faroe Island is realized somewhat differently in the two model setups. While the transport is almost equally divided between the two openings in LR, this is not the case in MR where we find much higher transports between Iceland and the Faroes (not shown). The latter indicates different pathways of the Atlantic Water branches that are likely caused by the slightly different Subpolar Gyre geometries. Going further south in the Atlantic, the GR1.5 grid does not contain a Florida Strait (Figure 4), whereas, in the TP04 grid, it is possible to obtain a transport between Florida and the Bahamas of 16 Sv. Data from the RAPID program monitored a transport of 31.7 Sv using a submarine cable between Florida and the Bahamas [Kanzow et al., 2010]. In the model, however, the distribution of the flow to the east and west of the Bahamas is not distinguishable.
 The Indonesian throughflow forms another important part of the global thermohaline circulation and the models simulate only slightly higher transports (16.9 Sv in LR and 15.7 Sv in MR) than the observations (11.6–15.7 Sv). Although there are large differences between the Indian Ocean subtropical gyre systems in the two setups, the simulated flows through the Mozambique Channel (25 Sv in LR and 19 Sv in MR) are in agreement with the range of observations (5–26 Sv) [DiMarco et al., 2002]. The Drake Passage transport is clearly too strong compared to observations; it reads 169 Sv in LR and 185 Sv in MR. Cunningham et al.  gave a value of 137 ± 8 Sv and Nowlin and Klinck  estimated 134 Sv with an uncertainty range of 14 Sv. Overestimating the Drake Passage transport may be caused by insufficient representation of the eddy dynamics in the ACC but could also be related to overly strong winds or errors in the representation of water masses. Ocean-only experiments driven by NCEP atmospheric forcing overestimate the Drake Passage transport even more: both LR and MR configuration simulate transports of almost 200 Sv. Probably therefore, the too strong ACC is caused by an overestimation of the density gradient across the ACC. Gent et al.  have identified several ingredients setting the ACC strength in a model, among them bottom topography and the choice of the eddy and background diffusivity. However, more focused sensitivity experiments would be necessary to explain the fact that the MR model produces a stronger ACC. We note that the ACC strength in ECHAM5/MPIOM (165 Sv) was very similar, indicating no improvement between the CMIP3 and CMIP5 realizations.
 We also include in Table 2 the flow of relatively dense waters that form the overflows across the GSR. The σΘ = 27.8 isopycnal is traditionally taken to define the interface of the dense overflow waters crossing the GSR mainly through Denmark Strait and the Faroe Bank Channel. We diagnose a considerably stronger overflow in Denmark Strait in the LR (3.7 Sv) compared to the MR (2.5 Sv) configuration. While both results bracket the often quoted estimate of 2.9 Sv by Ross , the higher transport in the LR model agrees better with the more recent measurements of Macrander et al.  who diagnosed a mean transport of 3–3.5 Sv from a 5 year long campaign. In the Faroe Bank Channel, both models simulate an overflow transport exceeding 3 Sv (LR 3.1 Sv; MR 3.5 Sv), a bit higher than the observational estimate of about 2 Sv [Hansen and Østerhus, 2007]. However, the simulated overflow to the east of Iceland occurs mainly in the Faroe Bank Channel, whereas observations suggest an additional 1 Sv contribution from various small channels between Iceland and the Faroes [Dickson and Brown, 1994]. Thus, the total strength of the outflow of dense water from the Nordic Seas between Iceland and Scotland is simulated relatively well. Comparing the overflow transports with those from ocean-only experiments (not shown) confirms that MPIOM produces considerably less overflow in Denmark Strait in the MR setup compared with the LR configuration even if the properties of the northern source waters are similar. This points to a better representation of the overflow conduit in the LR configuration. In terms of grid resolution, the LR (GR1.5) model, owing to the proximity of the northern grid pole over Greenland [see Jungclaus et al., 2008, Figure 22.3], resolves the Denmark Strait better than the more homogeneous MR (TP04) configuration, whereas the region to the east of the Faroe Islands is represented rather similarly.
4.3. Meridional Overturning Circulation
 The large-scale meridional overturning circulation (MOC) is an important component of the climate system as a carrier of heat and freshwater. In the Atlantic, the MOC is responsible for the northward heat transport throughout the basin. The MOC circulation is represented by the clockwise rotating North Atlantic Deep Water (NADW) cell and the counterclockwise rotating Antarctic Bottom Water (AABW) cell. Both LR and MR (Figure 5) configurations exhibit a similar structure with the zero line between the two cells at approximately 3000 m, a maximum of the NADW cell between 35°N and 40°N and an extension of the clockwise cell into the Nordic Seas (Figure 5). The simulated streamfunctions also resemble those from the CMIP3 model [Jungclaus et al., 2006, Figure 6] with slight improvement in the AABW strength. Generally, MPI-ESM-MR features a weaker NADW and a stronger AABW cell. The simulated maxima of the NADW cell at 26.5°N in the North Atlantic read 19 Sv for LR and 16.7 Sv for MR at 1000 m depth and bracket the observational estimate of 18.7 Sv obtained from the 4 year RAPID campaign [Kanzow et al., 2010]. The data from the RAPID array indicate also a similar depth of the NADW maximum, but the boundary between NADW and AABW cell is considerably deeper (between 3500 and 4000 m) in the observations, indicating that NADW is not dense enough in the simulations. Danabasoglu et al. [2010, 2012] found an improved depth profile for experiments including an overflow parameterization, which would indicate that the water mass properties originating in the Nordic Seas are essential for setting the depth of the Deep Western Boundary Current. The maximum strength of the AABW cell in the RAPID data at 26.5°N [Kanzow et al., 2010, Figure 5] is about −2 Sv, where our simulations show −2.8 Sv in LR and −3.4 Sv in MR. In the South Atlantic, the stronger inflow of AABW as simulated by MPI-ESM-MR appears to be more in accordance with the inverse-model result by Ganachaud and Wunsch  of about 6 Sv. Talley et al.  calculate about 8 Sv from absolute geostrophic velocities based on hydrographic data. Both estimates come, however, with relatively large uncertainty owing to sampling problems.
 The comparison of the maximum of the NADW cell as a function of latitude of the two model setups (not shown) confirms that the strength of the overturning is weaker in MR throughout the Atlantic up to ∼60°N. NADW is to a large part fed by the overflow across the GSR [Dickson and Brown, 1994]. As described in the previous section, the MR configuration produces less overflow transport, in particular, in Denmark Strait. Moreover, a more detailed inspection of overflow water mass properties reveals that the overflows are also slightly less dense in MR (see above). After leaving the overflow conduits, the dense waters descend on the continental slopes and entrain considerable amounts of ambient waters, roughly doubling their volume transport in idealized overflow models. According to Price and Baringer , the amount of entrainment depends on the flow velocity and the density contrast in a way that a denser overflow entrains more ambient fluid. Therefore, the weaker overturning in the MR configuration can in part be explained by differences in the water mass properties that form the dense water reservoir in the Nordic Seas. The other part seems to be related to a different representation of Denmark Strait in the respective grids. The globally lower-resolution model gives a better representation of the dense overflow in terms of transport in this particular region of grid-size convergence.
4.4. Heat and Freshwater Transports
 Another key quantity is the meridional ocean heat transport. In both models, the implied global heat transports (Figure 6a), obtained from the residual atmospheric heat fluxes, agree well with estimates that are based on satellite retrievals of adjusted radiances and atmospheric reanalyses [Trenberth and Fasullo, 2008; Fasullo and Trenberth, 2008]. In the Atlantic, the MPI-ESM-LR heat transport (max 1.1 PW, 1PW =1015W) exceeds this observation-based estimate with its maximum slightly below 1 PW. However, Ganachaud and Wunsch  report 1.27 ± 0.15 PW from inverse calculations and the recent estimate from the RAPID campaign reads 1.3 PW [Johns et al., 2011]. The maximal transport in MPI-ESM-MR is about 0.95 PW, somewhat smaller than in LR. Interestingly, this does not appear to be related only to the weaker overturning cell. Decomposing the advective heat transports into the zonally averaged (MOC) and deviation-from-the-zonal-average (gyre) components for the Atlantic (Figure 6c), shows nearly identical MOC components between 20°N and 35°N and a weaker gyre contribution in MR.
 The global freshwater transports, as implied from the atmosphere to ocean fluxes (Figure 6b), are in reasonable agreement with observational estimates [Wijffels et al., 2001]. As the latter are sparse and associated with large uncertainties, we can deduce from the observations mainly the latitudes of sign changes and approximate maxima and minima. Overall, we see a slightly better agreement compared to the ECHAM5/MPIOM model [Jungclaus et al., 2006, Figure 7]. Differences between the two CMIP5 configurations are small. For most of the Atlantic, this holds also for the MOC and gyre components of the advective transports (Figure 6d). Interestingly, there is a deviation again at the southern boundary of the Atlantic. Both model configurations agree on a total freshwater import into the South Atlantic of about 0.2 Sv, consistent with the notion of the Atlantic as an evaporative basin, but the relative contribution of the subcomponents is different. At 34°S, the overturning component is clearly negative in MR, whereas it is close to zero in LR and the gyre components differ accordingly (see section 6.2).
4.5. Sea Ice
 Sea ice modifies the heat, momentum, and freshwater exchanges between ocean and atmosphere in (temporarily) ice-covered regions. Local melting and freezing and advective transports determine the sea-ice budget. A more detailed account of the sea-ice evolution in the Arctic and a comparison between the respective representations in the CMIP3 and CMIP5 models is given in a separate paper by Notzetal. . In addition, the sensitivity of the simulations to changes in the model parameterization of sea-ice geometry is discussed by Mauritsenetal. . Here we document only the most general features reflecting on the ability of the model to reproduce a realistic ice cover and thickness distribution. Regarding Arctic sea-ice coverage over the years 1980–2005, MPI-ESM shows good agreement with the observed ice extent obtained from NOAA/NSIDIC [Meieretal., 2011] (Figure 7). We diagnose for the MPI-ESM historical simulations marked improvements compared to the CMIP3 predecessor model that was evaluated by Koldunovetal. . MPI-ESM produces, in general, less summer sea ice, in particular, in regions where ECHAM5/MPIOM overestimated the ice cover: in the Laptev Sea, the East Siberian Sea, and the Canadian Archipelago. Modeled ice extent from MPI-ESM matches observations now almost everywhere, with the exception of slightly underestimated summer ice cover in the Kara Sea. In winter, notable differences between model and observations are a too small modeled sea-ice extent in the Odden ice tongue and in the Sea of Okhotsk, and a too large sea-ice extent south of the Bering Strait. In contrast to ice extent, the sea-ice thickness distribution is still a challenging quantity to observe, and only recently has it been possible to compile observational data from satellites into a sea-ice reanalysis framework [LindsayandZhang, 2006]. The Pan Arctic Ice-Ocean Modeling and Assimilation System (PIOMAS) [Schweigeretal., 2011] allows for an assessment of ice thickness over longer time and space scales in the Northern Hemisphere. PIOMAS data suggest a mean annual ice thickness of 2.5–3.3 m for the Arctic and a typical winter ice distribution with the thickest ice (exceeding 3.5–4 m) to the north of Greenland and the Canadian Archipelago. The MPI-ESM modeled distribution of sea ice thickness with the highest accumulation to the north of Greenland and the northern exit of the Canadian Archipelago agree well with the PIOMAS reanalysis [Schweigeretal., 2011, Figure 6]. The simulated sea-ice distribution, extent, and thickness are remarkably similar in the LR and MR configurations. The improvements compared to the CMIP3 simulations must be sought in the atmosphere model. For the higher northern latitudes, the evolution from a low-top model to a high-top model including the stratosphere appears to be most important, while further enhancing the vertical resolution in the atmosphere model from 47 levels in LR to 95 levels in MR leads to a better representation of variability features (e.g., quasi-biannual oscillation [Stevensetal., 2013]).
 While the annual cycle of Arctic sea ice evolution has been improved, Southern Hemisphere sea ice is generally underestimated in MPI-ESM (Figure 8). The seasonal cycle of sea ice area is less well simulated than in the CMIP3 system [Jungclaus et al., 2006, Figure 4]. This seems to be related to issues in the atmospheric pressure distribution, with a—compared to observations—too strong modeled low-pressure system over the Amundsen Sea and a significantly different air-pressure distribution over the Weddell Sea.
4.6. Mixed-Layer Depth
 The surface mixed layer is deepened by wind-induced mixing, Ekman subduction, and mixing of gravitationally unstable water columns. Such convective water mass formation is important in the formation of NADW and AABW, but also for mode waters such as the Subantarctic Mode Water (SAMW). Convectively active regions can be identified by the maximum depth of mixing attained during an annual cycle. The MLDs are calculated here using a potential density criterion of 0.125 kg m−3. For each grid point, Figure 9 depicts the annual maximum monthly MLD obtained from the 1000 year control integrations. Overall, MPI-ESM-LR and -MR produce similar patterns of convectively active regions. One important region for NADW formation is the Labrador Sea. The LR configuration features deep convection more confined to the interior Labrador Sea, which is in better agreement with observations [Pickart et al., 2002]. As mentioned earlier in the discussion of the subpolar gyre circulation, the higher resolution in the Atlantic does not help to better reproduce the pathway of the NAC in MR. On the contrary, there is a broader subpolar gyre in MR with somewhat less realistic circulation also in the Irminger and Labrador Seas. Moreover, the grid resolution near the southern tip of Greenland is higher in LR than in MR owing to the grid convergence, so that the boundary current in the Labrador Sea is simulated more accurately in LR (not shown). The centers of convection in the Nordic Seas are simulated similarly in both model configurations. In general, the MR model produces considerably shallower MLDs in the northern sinking regions, indicating less-intensive convection and less heat release to the atmosphere, which could be responsible for a more pronounced warm bias in the northern North Atlantic (Figure 3). However, as has been discussed with respect to the strength of the overflows across the GSR, the depth of the mixed layer alone is not a good measure of the deep water production that eventually feeds the NADW via the overflows across the GSR.
 In the Southern Hemisphere, deep MLDs around 40°S indicate the pycnostads associated with SAMW formation [Talley et al., 2011]. Estimates based on observations [e.g., Kara et al., 2003] indicate a band of deep mixed layers extending along much of the northern flank of the ACC in the Indian and Pacific Ocean. The simulations reproduce this feature in general, but considerable differences can be seen in the Indian Ocean. In MPI-ESM-LR the deep MLDs in the Indian Ocean are almost absent or shifted to the southern flank of the ACC. Compared to Kara et al.  and the ocean reanalysis by Carton and Giese  MPI-ESM-MR reproduces shape and magnitude of SAMW-related MLDs in the southern Indian Ocean more realistically than MPI-ESM-LR. The regions of improved MLDs are also characterized by a better representation of SST and SSS (Figure 2). We notice also differences in the western South Atlantic and to the west of Cape Horn, but the information from observations and reanalysis does not allow for a judgment which model configuration does better. Adjacent to Antarctica, both configurations simulate extremely deep mixed layers and active convection down to the deep ocean in the Weddell Gyre, but also in the Ross Sea. Although deep convection has been observed in the Weddell Sea [Gordon, 1978], such observations are scarce and both model setups are very likely to overestimate open ocean convection, while they are underestimating dense water formation on the shelves surrounding Antarctica [e.g., Marsland et al., 2004]. This discrepancy might be responsible for the general overestimation of the ACC in both model versions.
5. Variability in the Coupled System
 The main focus of the present manuscript is a documentation of the mean state of the ocean in the MPI-ESM integrations. Nevertheless, we include here two aspects of variability that are important for the coupled system: The El Niño Southern Oscillation (ENSO), and the decadal to multidecadal variability of heat transports in the North Atlantic.
 Compared to the CMIP3 model, the representation of ENSO in our CMIP5 simulations has clearly improved in terms of the strength of the variability. The amplitude, the primary metric of ENSO measured here by the standard deviation of the Niño3 (5°S–5°N, 150°W–90°W), Niño3.4 (5°S–5°N, 170°W–120°W), and Niño4 (5°S–5°N, 160°E–150°W) SST anomalies (Table 3), has decreased considerably, for example, for Niño3.4 from an overly strong 1.26°C in ECHAM5/MPIOM to a level close to the 0.76°C from observations in LR (0.75°C), while being slightly weaker for the MR configuration (0.67°C). While the convergence of ENSO amplitude toward the observed range is encouraging, an examination of the physical feedbacks responsible for ENSO development reveals that the new models also suffer from an underestimation of the Bjerknes feedback that tends to be compensated by a too-low thermal damping effect [Guilyardi et al., 2012].
Table 3. Characteristics of the El Niño/Southern Oscillation in the HadISST Data, the CMIP3 Model System, and the Two MPI-ESM Versions Contributing to CMIP5
 The shape and amplitude of the ENSO power spectrum are relatively well simulated by the new model (Figure 10). As in the observations, the modeled ENSO exhibits a rather broad spectrum between 2.5 and 6 years. However, the more realistic representation of the ENSO power spectrum is found again in the LR configuration, which, in addition to the interannual variability, shows enhanced power at decadal time scale (Figure 10). This improvement compared to ECHAM5/MPIOM is most likely due to changes in the convection parameterization in ECHAM6 [Stevens et al., 2013; Guilyardi et al., 2012]. While the reduction in amplitude makes the MPI-ESM simulations superior to earlier model versions, this does not hold for other ENSO-related characteristics. The observed seasonal phase locking of ENSO to the boreal winter was better captured by the ECHAM5/MPIOM model, whereas the new model configurations show a weaker seasonal modulation of ENSO strength in addition to a secondary maximum of ENSO in late spring-early summer (not shown). The relative strength of El Niño versus La Niña events, or the “nonnormality of ENSO” [Burgers and Stephenson, 1999] is expressed in terms of a skewness parameter. For Niño3.4 and Niño3, the observational data (HadISST) [Rayner et al., 2003] indicate a predominance of El Niño over the last 140 years. Compared to the observations and the CMIP3 model version, skewness indices are slightly degraded in the new model versions (Table 3), especially in the MR model that shows skewness close to zero. Determining the reason for this behavior is beyond the scope of this paper, but there are indications that this might be related to the modeled nonlinear response of the central Pacific zonal wind stress to SST anomalies [Kang and Kug, 2002; Frauen and Dommenget, 2010]. The spatial structure of ENSO is realistically simulated in both model versions (Figure 11). However, the more equatorially confined SST variability patterns stretch out further to the west compared to the observations. This spatial bias appears more pronounced in the LR configuration due to a too strong and too far westward extended equatorial cold tongue. Both model configurations simulate much weaker teleconnections towards the central subtropical North and South Pacific and tropical western Pacific than observed. The teleconnections to the Indian Ocean and Atlantic are reproduced relatively well and the weaker amplitude of the regression patterns in the Atlantic is another improvement over the ECHAM5/MPIOM model [cf., Jungclaus et al., 2006, Figure 13].
5.2. Decadal to Multidecadal Variability in the Atlantic Circulation and Heat Transports
 Variations in the Atlantic MOC and, more importantly, the associated meridional heat transports are important drivers of low-frequency variations of North Atlantic SSTs [e.g., Latif et al., 2004]. Moreover, oceanic variations on these time scales provide the long-term memory leading to skill in decadal predictions [Matei et al., 2012]. The period and magnitude of MOC variations, as well as the underlying mechanisms, are highly model dependent [e.g., Menary et al., 2012]. Hurrell et al.  report pronounced differences in different versions of one model system, due to differences in resolution or in the parameterizations. The latter does not seem to be the case in MPI-ESM. Time-latitude plots of decadally smoothed total advective and overturning heat transport anomalies (Figure 12) display relatively similar characteristics. In both realizations, the magnitude of the anomalies is of the order of 0.1 PW, with somewhat higher amplitudes in LR. The decomposition into gyre and MOC components reveals that the latter is mainly responsible for the variations south of 40°N, and the relative importance of both components changes roughly at the boundary between subtropical gyre and subpolar gyre. The variability regimes to the north and south of that boundary are also different. Spectra taken at 26.5°N and 48°N (not shown) exhibit peaks in variance between 30 and 50 years at the subpolar latitude and enhanced power at lower frequencies in the subtropics. Another, apparently resolution-dependent, aspect is that the overturning-related anomalies tend to migrate further to the north in LR. This is another consequence of the representation of the subpolar gyre that is probably too wide and stationary in MR. Details of the multidecadal variability in MOC and heat transport as well as the role of the gyre system in regulating the meridional heat transports are the subject of ongoing investigations and will be reported in an upcoming paper.
6. Ocean Model Resolution and Particular Circulation Patterns
 In this final section, we focus on two regions where the ocean resolution in the LR and MR configurations show fundamental differences. One region is the tropical Pacific ocean, the other the margin between the respective subtropical gyres and the ACC.
6.1. Equatorial Pacific
 Simulated currents and temperatures in the Pacific Ocean along the equator and at 110°W are compared with observations from Johnson et al.  in Figure 13. The compilation of hydrographic observations and current measurements were obtained by an evaluation of several oceanographic sections from the 1990s. Along the Equator (Figure 13, left column), both model versions show a realistic representation of the position and slope of the 20°C isotherm, with a tighter thermocline in the MR simulation. In the observations, the 26°C isotherm outcrops near 140°W where isotherms in the simulations are closer to 24°C. The finer-resolution TP04 grid in MPI-ESM-MR delivers a clearly better representation of the core of the eastward flow along the thermocline, with more realistic speeds and location of the maximum. In comparison with the observations, the boundary between eastward and westward flow is, however, too shallow in MR whereas in LR, there is clearly too strong westward flow in the surface layers. At 110°W (Figure 13, right column), the TP04 grid allows for a good representation of the current system in terms of horizontal and vertical structure and amplitude. In particular, the position of the maximum and the tilt of the core of the equatorial undercurrent with depth are well captured. This holds also for the structure and the asymmetry of the south equatorial current system across the equator with highest speed near 2°N. Stratification in the upper layers appears to be stronger in the simulations, and both model versions are somewhat warmer than the observations below the thermocline. On the other hand, the observations only cover a limited period and may be subject to variability on various time scales.
6.2. Southern Ocean/South Atlantic transition
 Resolution-dependent differences between the LR and MR versions of MPI-ESM have shown up most clearly in the transition region between Southern Ocean and South Atlantic. Intermediate-depth temperature and salinity biases (Figure 3) indicate an overestimation of the input of warm and saline Indian Ocean waters, and the BSF (Figure 4) shows pronounced differences in the Agulhas region and the confluence zone along the Brazilian coast. This part of the world ocean has received increased attention in recent years because of its role in regulating the freshwater (salt) and heat input into the Atlantic and possible consequences for the stability of the Atlantic MOC [De Vries and Weber, 2005; Biastoch et al., 2009; Lee et al., 2011]. The greater Agulhas system feeds the upper limb of the overturning circulation through the “leakage” of warm and salty waters from the Indian Ocean to the Atlantic [Beal et al., 2011]. Hydrographic observations demonstrate that the Agulhas current, after having passed the southern tip of Africa, turns south and then forms the so-called Agulhas retroflection, where the Agulhas current loops back into the Indian Ocean [Lutjeharms and van Ballegooyen, 1988]. The leakage is thought to be mainly maintained by the shedding of Agulhas rings, i.e., eddies carrying warm and salty waters. The system is prone to strong variations related to the dynamics of the current system and the position of the zero line of wind stress curl [Beal et al., 2011].
 Comparing in more detail the MPI-ESM LR and MR model solutions, it is clear that only the higher-resolution model allows for the formation of a pronounced retroflection pattern (Figure 14). In MR, the Agulhas current (AC) is simulated as a narrow boundary current that is retroflected back into the Indian Ocean forming the Agulhas return current (ARC), clearly to be identified in Figure 14b. The formation of anticyclonic eddies, resembling Agulhas rings, can also be seen in the MR realization. In the LR setup, the AC largely continues into the Atlantic and feeds a southern hemisphere “supergyre” [e.g., Speich et al., 2007]. The subtropical front south of Africa is situated more to the south in LR; in fact, the position of the zero line of the BSF along 20°E differs by about 4° latitude (Figure 14a). This feature is clearly related to a lack of resolution. A similar difference occurs in the uncoupled ocean simulations. This is not surprising as eddies play an important role for the dynamics of the ACC and the retroflection. The intensity of mesoscale activity is reflected in the variance of sea surface height (SSH). The standard deviations obtained from monthly SSH data from the respective simulations are compared with data derived from satellite altimetry (AVISO, http://aviso.oceanobs.com/duacs/) in Figure 15. The observational data identify the AC and ARC as narrow and meandering flows. The northwestward orientated tongue of high eddy activity in the Cape Basin indicates the shedding of Agulhas rings and their pathway into the South Atlantic. None of these features is captured by the LR model, although there is some enhanced variance to the south of Africa related to variations of the BSF. In contrast, although underestimating the magnitude of the variance in comparison with the observations, the MR realization includes the position and width of the ARC as well as the shape and orientation of the Agulhas rings' pathway. At eddy-permitting resolution, the MR model is clearly not able to reproduce all the details of mesoscale activity in the real Agulhas system. Biastoch et al.  have demonstrated that for this purpose, eddy-resolving (0.1°) resolution is necessary and that eddy-permitting models tend to produce Agulhas rings too regularly. Nevertheless, the MR model includes, with its ability to simulate rings carrying Indian Ocean water masses into the Atlantic, a considerably improved representation of the exchange of mass, heat, and salt at this key location of the thermohaline circulation.
 In contrast to the MR simulation, the Agulhas current in the LR setup continues into the South Atlantic subtropical gyre, and the opening between the two oceans, as defined by the zero line of the BSF, is much broader. The differences in circulation seen at the surface carry on in the deeper layers (not shown), and the intermediate depth flow carries overly warm and salty water into the Atlantic, where it accumulates in the subtropical gyre. These water-mass differences also contribute to the much stronger South Atlantic subtropical gyre in LR.
 The differences in the hydrography and circulation also have consequences for the heat and freshwater transport into the South Atlantic. De Vries and Weber , by applying a freshwater flux anomaly, have demonstrated that modification of the salinity properties can lead to pronounced changes in the relative contribution of gyre and overturning component of the meridional freshwater transports, and this is what we diagnose in our analysis. Figure 6d demonstrates that both configurations simulate a similar (about 0.2 Sv) total freshwater transport at 34°S, but with different contributions from the subcomponents. While the overturning contribution is near zero in LR, it is clearly negative (−0.14 Sv) in MR. Given that the MR model simulates the South Atlantic circulation and hydrography more realistically than the LR model, it seems reasonable to also assume that the sign of the freshwater transport is better reproduced in the higher-resolution model. The notion of a freshwater exporting overturning circulation is also supported by recent evaluations of observations [Huisman et al., 2010; Bryden et al., 2011] and ocean reanalyses [Hawkins et al., 2011]. In contrast, most CMIP3 models show freshwater import by the overturning component [Drijfhout et al., 2010].
 Earlier studies have suggested that the sign of the overturning component can be used as an indicator for MOC stability such that the Atlantic MOC is bistable if it imports its own salt [Rahmstorf, 1996; Dijkstra, 2007; Hawkins et al., 2011]. At present, we do not have freshwater perturbation experiments at hand for the model configurations discussed here. A preliminary evaluation of the CMIP5 idealized global warming experiments in the two model configurations indicate a stronger decrease of the Atlantic heat transport in the MR simulation by the time the simulations reach four times preindustrial CO2. However, the experiments were not carried out long enough to assess any MOC recovery behavior, and the experimental setup does not allow for a clear discrimination between oceanic and atmospheric processes, or if the stronger decline in MR just follows from the different mean state in the unperturbed run. In the near future, water hosing experiments [Stouffer et al., 2006; Swingedouw et al., 2012] will be carried out to study, in detail, the stability of the MOC in the two model setups.
7. Summary and Conclusion
 This paper provides a description of MPIOM as the ocean component of the MPI-ESM. A basic evaluation based on long control simulations and ensembles of historical integrations is presented for two model configurations: MPI-ESM-LR and MPI-ESM-MR. The two setups differ in the vertical resolution in the atmosphere and the horizontal resolution in the ocean.
 Compared to the CMIP3 model system ECHAM5/MPIOM [Jungclaus et al., 2006], we have diagnosed improvements mainly regarding features that are related to changes in the atmosphere model. This comes as no surprise as the ocean model did not change apart from technical developments and the CMIP3 model also applied the same ocean grid as MPI-ESM-LR. Stevens et al.  have documented the development steps leading from ECHAM5 to ECHAM6 analyzing atmosphere stand-alone simulations. They conclude that higher vertical resolution in the middle atmosphere lead to improvements in the extratropical circulation, a better representation of the middle atmosphere, and a systematic reduction of temperature biases in the upper troposphere. For the coupled system and the ocean, improvements can be seen most notably in the ENSO amplitude and in the circulation over the Arctic. The latter is likely related to an improved representation of parameterized gravity waves (Mauritsen, personal communication). In the analysis provided here, we have focused on the mean state and main variability features and we have not found many differences that could be related to the further increased vertical resolution in the MR atmosphere (95 versus 47 levels). Further research will study details of stratosphere-troposphere interaction and their role for coupled ocean-atmosphere processes [e.g., Manzini et al., 2012] in more detail.
 Comparing the surface properties of the MR and LR simulations reveals that applying higher resolution leads to an improvement in the North Pacific, the eastern tropical Pacific, the southern South Atlantic and southern Indian Ocean, whereas the North Atlantic and the Labrador Sea are less well simulated in MPI-ESM-MR. The two ocean model grids (GR1.5 and TP0.4) discussed here differ not only in their nominal resolution but also in the placements of the poles of the curvilinear grids. While TP04 features quasi-homogeneous resolution over the ocean (because the northern grid poles are placed over large Northern Hemisphere land masses), the northern pole in GR1.5 is located over Greenland. The resulting resolution convergence can be seen as an advantage because it allows for a relatively good representation of the deepwater formation centers in the Labrador Seas and in the Nordic Seas and in the overflow channels across the GSR. Thus, while resolution-dependent improvements can be clearly seen in the TP04 model in the tropical circulation and in the South Atlantic, a less good performance in the North Atlantic and in terms of overflow water masses could be a consequence of a better representation of these regions in the GR1.5 grid. However, since a much higher resolution of the GR class grid does not appear appropriate we expect further convergence at eddy-resolving resolution in a coupled model with a TP6M grid.
 Many of the biases with respect to observational estimates occurred also in the earlier version ECHAM5/MPIOM indicating a need for further model improvement in the coupled system. This holds, in particular, for the warm biases in the upwelling regions along the west coasts of North and South America and southern Africa, and the cold bias along the equator in the Pacific. The cold bias in the North Atlantic, however, seems to be predominantly a problem of the ocean model. Recent progress has been documented by Bryan et al.  and Danabasoglu et al. , arguing for a better representation of the overflows and, eventually, for eddy-resolving resolution in global models.
 Subsurface quantities also show resolution-dependent differences. Most notably, the errors in intermediate-depth water mass properties in the South Atlantic, which appear most pronounced in the LR configuration, are related to the overestimation of heat and salt transfer from the Indian Ocean into the South Atlantic. At eddy-permitting resolution, the MR configuration provides a much better representation of the Agulhas current system, and such local mesoscale dynamics turn out to be an important ingredient for a proper representation of the mean state in the Atlantic.
 While the resolution-dependent differences between the LR and MR configurations give insight into important aspects of the underlying dynamics, it is also worth noting that the MPI-ESM setups behave rather similarly in many aspects. In particular, the control experiment variability characteristics, as described here in terms of ENSO and Atlantic heat transport variability, appear to be quite robust in terms of amplitude and spectrum.
 Ocean-only experiments indicate that the “eddy-permitting” model configuration may have to be just an intermediate step toward a globally eddy-resolving ocean component in an ESM. For example, Farneti and Delworth  show a resolution-dependent response of the MOC to southern hemisphere wind changes, and Biastoch et al.  have demonstrated that an eddy-resolving resolution of 0.1° is necessary to adequately simulate the shedding of Agulhas rings. This is also what we find in ocean-only simulations with the MPIOM TP6M setup at globally 0.1° resolution (von Storch, submitted), but the computational costs of such a system prevent us from running such a model coupled to a high-resolution atmosphere. On the other hand, regional grid refinement, for example in the overflow region or the Agulhas region, might be another strategy. At MPI-M, model development will explore directions as well as new dynamical cores and further improvements in the representation of oceanic processes.
 The authors thank the scientific programmers at MPI-M and the staff of the German Climate Computing Center (DKRZ) for their support in developing the model and conducting the model experiments. The CMIP5 simulations have been funded by the Federal Ministry for Education and Research in Germany (BMBF). J.H.J. acknowledges funding through the BMBF research program “MiKlip” (FKZ: 01LP1158A), and D.M. received financial support through the BMBF research program “Nordatlantik” (FKZ: 03F0605D/E). K.L. was supported through the Cluster of Excellence “CLISAP” at Hamburg University, funded by the German Science Foundation (DFG). We thank Achim Stössel, Stephen Griffies, and one anonymous reviewer for comments and suggestions that helped to improve the manuscript.