This study aims to compare and validate two soil-vegetation-atmosphere-transfer (SVAT) schemes: TERRA-ML and the Community Land Model (CLM). Both SVAT schemes are run in standalone mode (decoupled from an atmospheric model) and forced with meteorological in-situ measurements obtained at several tropical African sites. Model performance is quantified by comparing simulated sensible and latent heat fluxes with eddy-covariance measurements. Our analysis indicates that the Community Land Model corresponds more closely to the micrometeorological observations, reflecting the advantages of the higher model complexity and physical realism. Deficiencies in TERRA-ML are addressed and its performance is improved: (1) adjusting input data (root depth) to region-specific values (tropical evergreen forest) resolves dry-season underestimation of evapotranspiration; (2) adjusting the leaf area index and albedo (depending on hard-coded model constants) resolves overestimations of both latent and sensible heat fluxes; and (3) an unrealistic flux partitioning caused by overestimated superficial water contents is reduced by adjusting the hydraulic conductivity parameterization. CLM is by default more versatile in its global application on different vegetation types and climates. On the other hand, with its lower degree of complexity, TERRA-ML is much less computationally demanding, which leads to faster calculation times in a coupled climate simulation.
 The lower boundary to the atmosphere for fluxes of radiation, heat, water, momentum and chemical compounds is provided by the land surface [Stöckli et al., 2008]. The importance of these fluxes for the climate system has already been demonstrated in several studies (e.g., Pielke ; Seneviratne et al. ; Pitman ). Land-atmosphere interactions in tropical Africa are very dynamic, e.g., tropical forests have a large impact on surface temperature by physiological processes (through latent heat flux) rather than by radiative processes (through albedo), which is completely different from midlatitude or boreal regions [Snyder et al., 2004], making this region an essential study area.
 Soil-vegetation-atmosphere transfer (SVAT) schemes, also known as land-surface parameterizations (LSP) or land-surface schemes (LSS), simulate surface characteristics, e.g., properties of soil, vegetation and lakes, and calculate the matching turbulent fluxes as lower atmospheric boundary conditions in global or regional atmospheric models (for both numerical weather prediction and climate applications). During the last decades SVAT schemes have been developed from simple energy balance parameterizations to complex sub-models including a full integration of connected biogeochemical processes [Sellers et al., 1997]. Intercomparison studies were done in order to objectively quantify and compare the performance of different SVAT schemes, most notably by the PILPS (Project for the Intercomparison of Land-Surface Parameterization Schemes) comparing 23 schemes of different complexity [Henderson-Sellers et al., 1996]. The study revealed a considerable spread among simulated water and heat fluxes and a significant parameterization uncertainty [Pitman and Henderson-Sellers, 1998]. Similar results were found in subsequent intercomparison studies [Dirmeyer et al., 1999; Boone et al., 2004]. Although not yet entirely understood, for many SVAT parameterizations the uncertainty originates from the fact that simulated processes are based on few field observations and idealized experiments [Stöckli et al., 2008]. Therefore including regions with sparse observations is a fundamental objective.
 Model validation and improvement partly consists of tuning parameters in such a way that simulated values fit the measurements as good as possible. The majority of the flux measurements used for parameter tuning are located within the extra-tropical Northern Hemisphere [Williams et al., 2009], particulary forest ecosystems, hence model validation and improvement is biased toward these observations. Therefore, also including flux observations outside temperate and boreal climate zones should be an important priority.
 The focus of this study will be on the performance of two state-of-the-art schemes which differ in their degree of complexity: TERRA-ML, version 4.0, developed at the German Weather Service (DWD); and the Community Land Model (CLM), version 3.5, developed at the National Center of Atmospheric Research (NCAR). TERRA-ML (also called “TERRA” in figures and tables) and CLM3.5 are originally designed as components of the atmospheric models maintained respectively by DWD and NCAR. It is possible, however, to use the schemes apart from their native atmospheric host model and to couple them to another atmospheric model (e.g.,Davin et al.  describes the coupling of CLM3.5 to the atmospheric model of DWD), allowing a choice of SVAT scheme based on an objective intercomparison independently from the choice and performance of the atmospheric model.
 Following the methodology applied in the second phase of PILPS [Pitman and Henderson-Sellers, 1998], both SVAT schemes are run in standalone mode by decoupling them from the atmospheric model and forcing the atmospheric boundary conditions with meteorological in-situ measurements. Therefore the schemes can be regarded as individually functioning models and their performance can be evaluated independently from atmospheric models. The SVAT models are run with a 1D vertical column grid without a spatial dimension (single grid cell approach). This approach is useful to assess model biases which originate from the SVAT component, and to isolate them from general biases of the atmospheric host model.
2. Method and Data
2.1. Model Description
 Each soil layer in both models has a water budget depending on extraction by evapotranspiration, runoff, capillary and gravitational transports, and vertical boundaries (e.g., from adjacent soil layers). Forced by climatic conditions and based on a range of mechanistic equations for hydrological (soil) and biophysical (vegetation) processes, the models simulate fluxes of radiation, water and heat. Specific model properties and characteristics are discussed in Sections 2.1.1 and 2.1.2.
2.1.1. Community Land Model (CLM)
 CLM is the SVAT component of the global climate model CCSM (Community Climate System Model) Version 3 [Collins et al., 2006]. All vegetation characteristics are prescribed separately for seventeen so-called “Plant Functional Types” or PFT's [Bonan et al., 2002]. These are basic vegetation types which can be combined to represent more complex biomes, e.g., a savannah biome can be composed by combining grassland with deciduous tropical trees. One single grid cell can contain up to four different PFT's for which energy and moisture fluxes are simulated. The grid cell averaged flux is calculated according to the PFT's areal proportions (tile-approach). Vegetation parameters are mainly defined in an extensive set of PFT-dependent constants (e.g., optical and aerodynamic parameters, root distribution parameters, photosynthetic capacities..) listed inOleson et al. , complemented with spatially-varying data sets for leaf area index (LAI), stem area index (SAI) and canopy height. Soil albedo (αsoil) depends on spatially-varying soil color, and plant albedo (αplant) depends on PFT-dependent optical properties (e.g., leaf albedo, stem albedo). The total surface albedo is a combination of both, depending on the proportion of sunlit soil exposure through the canopy (which in turn depends on LAI and SAI), and is calculated for the near-IR and visible spectrum separately. The soil column has a total depth of 3.43 m and consists of ten layers, each of which can have a different soil texture (provided by a yet another spatially varying data set). Hydraulic conductivity and diffusivity are parameterized using formulations byClapp and Hornberger . A prognostic aquifer scheme allows a variable groundwater table to rise into the soil column or fall below it [Niu et al., 2007]. A detailed model description of CLM3.0 is given by Oleson et al. . For modifications in CLM3.5, see Oleson et al. . On the basis of recommendations from the model developers, an optimal model time step of 1200 seconds was chosen.
 TERRA-ML is the SVAT component of the regional atmospheric model COSMO (Consortium for Small-scale Modeling), mainly used as an operational forecast model by the German Weather Service (COSMO-EU and COSMO-DE; seeBaldauf et al. ) and more recently applied on climate scales (COSMO-CLM; seeRockel et al. ). It is also part of the operational global atmospherical model GME from the German Weather Service [Majewski et al., 2002]. The vegetation within each grid cell is characterized by a small number of parameters, being LAI, plant cover, root depth and roughness length. Based on the Global Land Cover 2000 data set (GLC2000), a number of vegetation types is defined for which all parameters have a specific value. No distinction is made between, e.g., tropical, temperate and boreal forests (note that TERRA-ML does not explicitly recognize vegetation types, but only uses the values of vegetation parameters). For LAI and plant cover, the look-up table [Doms et al., 2011] provides a minimum and maximum, from which the model derives an annual cycle: for the extra-tropical Northern Hemisphere, a growing and resting period is calculated according to latitude. This is shifted 180 days for the extra-tropical Southern Hemisphere. In the tropics (20°N–20°S) all months are considered growing period (constant and maximum values of LAI and plant cover). Finally a height correction factor is applied (only on LAI). The (total) surface albedo within a grid cell is a combination ofαplant and αsoil, weighted by the areal proportion of plant cover in the grid cell. αsoiland soil texture are depending on six spatially-varying soil types. Plant transpiration and soil evaporation are parameterized using the BATS scheme [Dickinson, 1984]. With only six hydrological layers, a vertically homogeneous soil texture and a total depth of 2.43 m, the soil column is less sophisticated than in CLM. Parameterization of hydraulic conductivity and diffusivity is based on Rijtema . The lower boundary condition (at lowest soil level) consists of free drainage from gravitational water, only allowing a downward water flux. The full model documentation can be found in Doms et al. . In a sensitivity test, a range of model time steps was applied. Time steps smaller than 360 seconds did not influence the results significantly, hence this was chosen as an optimal trade-off between performance and computing time.
 When TERRA-ML is used in a coupled climate model, its computational cost is lower than CLM. The wall time needed for a simulation of one year depends on machine type, number of processors, etc. and hence can not easily be presented quantitatively. To give a indicative comparison, the walltime at the K.U.Leuven supercomputer for one year on 32 Nehalem X5560 processors amounts 39.6 hours when coupled to TERRA-ML, and 55 hours when coupled to CLM, which is a difference of about 40%.
2.2. Model Setup
2.2.1. Initial Conditions
 The models are set up with the same initial conditions for soil moisture (full saturation) and soil temperature (300 K). The spin-up time is defined as the time needed to reach equilibrium of soil water levels, which is achieved after repeatedly re-initializing the model from the former model output. All analyses are done on spun-up model runs.
2.2.2. Vegetation Composition
 The last columns of Table 1give an overview of the general vegetation characteristics around the flux tower sites used in this study. The vegetation can generally be characterized as a specific GLC2000 vegetation class in TERRA-ML (Table 2) or a combination of PFT's in CLM (Table 3). The only flux site which has a heterogeneous PFT composition is Demokeya, for which the savannah vegetation is simulated by 80% grassland (GRA) and 20% tropical deciduous trees (BDT). In TERRA-ML, the flux site is categorized as “deciduous shrubs” as well as “deciduous open broadleaved forest”, however both classes have the same parameter values.
 For the main model comparison and validation (Section 3), input parameters (e.g., αsoilin TERRA-ML) are not tuned to site-specific observations since the ultimate objective of a SVAT model is the coupling to a spatially explicit atmospheric model, without modifying these parameters on point-based scale. An overview and comparison of the default input parameters can be found inTable 3(CLM) and 2 (TERRA-ML).
 In CLM, the default input data set for LAI and SAI is a spatially-varying annual cycle, based on MODIS satellite images [Lawrence and Chase, 2007]. For every grid cell a further distinction is made between different vegetation types. Hence the values depend on the spatial coordinates of the flux site, the time of the year and the vegetation type (“PFT” in CLM). The annual course of LAI at the different flux sites is given in Figure 2. The MODIS LAI at Demokeya is discussed in Ardö et al.  and for Mongu see Huemmrich et al. . The model allows a spatially-varying canopy height and bottom, but for the moment these are still set to PFT-dependent constants. The default input data set for soil color is also based on MODIS data [Lawrence and Chase, 2007]. Layer-dependent sand and clay percentages are prescribed from the global soil profile texture data sets ofReynolds et al. , and are currently the default soil texture inputs for CLM.
 In TERRA-ML, default LAI and plant cover in the tropics are constants, only affected by a height correction factor. For most of the flux tower sites, the calculated LAI approximates the maximum value from the look-up tables [Doms et al., 2011], only for the Mongu site (>1000m) the LAI decreased drastically due to the height correction (from 3.4 to 2.3). TERRA-ML has six soil classes (sand, sandy loam, loam, clay loam, clay and peat) based on the FAO/UNESCO Soil Map of the World. Based on the description of local soil characteristics by the flux tower managers, all sites are best represented by the “sand” class, containing 90% sand and 5% clay, (the “sandy loam” class would imply a texture of only 65% sand). The “sand” class has a soil albedo of 0.30 (Table 2). Total surface albedo is calculated as follows:
in which αplantequals 0.15 (non-spatially hard-coded constant) and PC is plant cover (0.8 to 0.9 at all flux sites).
2.2.4. Meteorological Forcing Data
 The models are forced with hourly meteorological input data at reference height (Table 1). They both require continuous time series of precipitation rate (P; mm s−2), air temperature (Ta; K), relative humidity (RH; %), wind speed (u; m s−2), downward short-wave radiation (SWd; W m−2) and downward long-wave radiation (LWd; W m−2). In addition CLM also needs surface pressure (Ps; Pa) which is a constant in TERRA-ML and calculated as the overall mean surface pressure adjusted for each site. Analogous toStöckli and Vidale , data gaps (e.g., due to measurement failure) or inconsistencies (e.g., negative solar irradiance) are bypassed using a linear interpolation, except for P which is substituted with zeros, and SWdwhich is estimated by interpolating only between similar local solar times. This is done in order to provide a required continuity in the input data. Only small gaps are allowed (<1 day) since long interruptions could affect the simulation. Model output corresponding with gap-filled or substituted meteorological data is not taken into account for the evaluation.
LWdis only observed at the Mongu flux site and has to be estimated for the other flux sites. This is typically done by a semi-empirical model based on temperature and vapour pressure. These models are calibrated with a collection of weather station data, and hence are limited in their geographical applicability. Because of the scarce data availability in the tropics,Aguiar et al.  applied and evaluated seven models in the Amazon region for a forest and pasture site: the Idso model [Idso, 1981] performed the best in humid conditions but overestimated LWd during dry conditions, while the Brutsaert model [Brutsaert, 1975] was better fit for dry conditions since it yields generally lower values. In this study, the two models are compared with observed LWd for the Mongu site. Given the pronounced seasonality at Mongu, it is a good reference for both humid and dry conditions. The results are shown in Figure 3 and confirm the findings of Aguiar et al. . Consequently the Idso model will be used for the two wet-equatorial flux sites (Kissoko and Tchizalamou) and the Brutsaert model for the semi-arid flux site (Demokeya). For Mongu the observed quantities are used.LWd is estimated as
where ϵo is the atmospheric emissivity and depends on the chosen model. The parameterization of Idso and Brutsaert are respectively
where ea (Pa) is the atmospheric vapour pressure. In case Ps is not measured, it is estimated by
where M is the molecular weight of air, g the gravitational acceleration, z the measurement height above sea level (Table 1), R the universal gas constant, and Pso the mean sea level pressure (1013 hPa).
2.3. Evaluation Data
Figure 1 and Table 1 give an overview of the flux tower sites used in this study with their respective coordinates and general characteristics. These sites are either associated with or part of the CarboAfrica network [Bombelli et al., 2009] which is a part of the global FLUXNET network [Baldocchi, 2001], providing a comprehensive data set of measured micrometeorology, i.e., surface energy, water and carbon fluxes, together with general meteorological observations. The Kissoko (forest) and Tchizalamou (grass) sites are representative for a near-equatorial (4°S) moist climate and are discussed in-depth byNouvellon  and Castaldi et al. . Two other sites have a more pronounced seasonal climate resulting in a deciduous woodland vegetation at the Mongu site (15°S) and a savannah vegetation at the Demokeya site (13°N), discussed in Merbold et al.  and Ardö et al. .
 Half-hourly surface sensible (H; W m−2) and latent (LE; W m−2) heat fluxes were measured with the eddy covariance technique, documented by Moncrieff et al. . None of the validation data were gap-filled and hence corresponding points in time are not taken into account for the model evaluation.
 Based on findings of Sellers et al.  and Crago and Brutsaert , random measurement uncertainty (ULE and UH; W m−2) is introduced on LE and H observations during each time step. The uncertainties consist of a relative error constrained by a minimum error for smaller flux magnitudes:
 Net radiation (Rn; W m−2) was measured with a single-output radiometer (not indicating separate energy balance components), independently from the measurements ofSWd. Because no useful ground heat flux observations are available for most of the flux sites, the ground heat flux is parameterized as a funcation of LAI and Rn, according to Choudhury et al.  and adapted by Kustas et al. :
 Results are presented as monthly averages which could not be calculated for months without flux measurements, explaining the occasional gaps in the graphs. Model-derived monthly averages are calculated by omitting data which corresponds to gaps and inconsistencies in meteorological forcing data (Section 2.2.4) and gaps in micrometeorological flux data (LE and H).
 Average energy fluxes are only calculated for daytime, defined as all data corresponding with a measured Rn > 50 W m−2 [Nussbaum et al., 1995; Leeuw and Zantvoort, 1997]. This is done in order to prevent a flattening-out of the signal caused by the much lower nighttime fluxes.
 Data gaps could occur on preferential times of the daily cycle (e.g., at noon) and hence bias the monthly flux averages. Therefore first the monthly means of all valid flux data are calculated per hourly time segment of the daily cycle (ignoring possible data gaps), from which a weighted average is calculated according the frequency of Rn > 50 W m−2 in each segment.
 Uncertainty ranges are indicated by error bars and calculated as monthly means of the half-hourly uncertainties, however the real errors are likely to be lower since propagated uncertainty decreases with accumulation time [Williams et al., 2009].
3. Model Comparison and Validation Using Default Input Parameters
3.1. Monthly Timescale
 The Kissoko site is characterized by a long wet season spanning from October to April (Figure 4). A minimum observed latent heat flux (100 W m−2) is maintained year-round due to the transpiration capacity of the vegetation (Figure 5a). Consequently the annual cycle has a relatively small amplitude. Both models overestimate LE during the wet season with around 50–70 W m−2 in 2005 and around 25 W m−2in 2006. In contrast to CLM, TERRA-ML generally fails to capture a sustained evapotranspiration during the dry season, and hence underestimates LE (by about 50–75 W m−2). Underestimations of latent heat flux correspond with overestimations of sensible heat flux, and vice versa (Figure 5b). The ground heat flux is generally well represented in both models (Figure 5c).
 Humidity and precipitation seasonality at the Tchizalamou site is similar to Kissoko's, and average monthly rainfall during wet seasons is much higher (Figure 4). Nevertheless, the grass vegetation limits observed LE (Figure 6a): the lower canopy cover captures less water to evaporate and allows more water to infiltrate the soil, and the photosynthetic active biomass is much lower, hence the transpiration rates too. The absence of deep roots explains the dry-season depressions, with an average value of 50 W m−2. LE is overestimated in the wet season with big differences between the models: TERRA-ML overestimates by up to 100 W m−2, while in CLM this is maximally 50 W m−2. The inter-model difference for LE is reflected by the other fluxes (Figures 6b and 6c), e.g., during the first wet season (2006–2007) the higher LE in TERRA-ML corresponds with a lower H and a lower G. In the dry seasons, both models simulate LE equally well, while H is mostly underestimated by CLM (30–50 W m−2). The ground heat flux is systematically overestimated by CLM and underestimated by TERRA-ML, both by up to 30 W m−2 (Figure 6c).
 The Demokeya flux site has a climatology of a short wet season (July to October) followed by eight dry months (Figure 4). Demokeya is located more to the north and its semi-arid savannah climate has lower precipitation amounts compared to the evergreen flux sites located closer to the equator. Water scarcity during the long dry season forces a higher Bowen ratio (H/LE) with an observed average monthly H of about 200 W m−2 (Figure 7b). LE is quasi-constant and very low (Figure 7a). In the short wet season, the precipitation causes LE to peak, while H has exactly the opposite annual cycle. The overall performance of CLM is good, with underestimations of LE during the wet season (about 30 W m−2) and corresponding overestimations of H. In TERRA-ML this bias is even higher (70 W m−2), and also visible in the dry season (30 W m−2). Furthermore a strong and consistent overestimation of H can be seen (about 70 W m−2), causing monthly averages of H up to 300 W m−2. Finally the ground heat flux has an overestimation during dry season (maximally 50 W m−2) which disappears in the wet season (Figure 7c).
 At the Mongu flux site, the rainy season consists of the months from November to March (Figure 4). Compared to the evergreen forest site (Kissoko) a stronger precipitation peak occurs in January but the entire period only lasts for five months, explaining the vegetation's deciduous nature. During the wet season, the flux partitioning of TERRA-ML is unrealistic with an overestimation of LE and underestimation of H, on average respectively 100 and 60 W m−2 (Figures 8a and 8b). This bias is probably related to overestimated soil moisture in the topsoil layers. The dry season on the other hand is characterized by a well-simulated H and an underestimation of LE of about 30–60 W m−2. CLM simulates LE and H well during the wet season. Similar to TERRA-ML but to lesser extent, CLM also underestimates LE during the dry season (20 W m−2), while H is overestimated drastically (50–70 W m−2).
3.2. Hourly Timescale
 The evaluation on monthly (daytime-only) timescale does not give all information about the model performance. It is also important that the SVAT schemes can simulate the daily cycle with a good amplitude and phase. This can be evaluated generally by correlating simulated and observed hourly time series. An overview of model performance statistics is given inTable 4, showing correlation coefficients (R) and root mean square errors (RMSE) between brackets. A visualization in Taylor diagrams [Taylor, 2001] is shown in Figure 9, summarizing a number of statistical parameters. Statistics are calculated for entire time series on hourly timescale and hence are not to be compared directly to the daytime-only monthly averages (Figures 5 to 8). Correlation between simulated and observed fluxes is high for all models, with coefficients mostly exceeding 0.85. Inter-model performance differences are mostly in favor of CLM, except for H at Tchizalamou. For Kissoko, the lower performance of dry-season LE in TERRA-ML can be seen in the RMSE as well as in correlation (0.87 versus 0.96 for CLM).
Table 4. Model Performance Statistics for LE and H on Hourly Timescale (Correlation: RMSE)a
Bold numbers indicate best performance with default input parameters. Italics indicate significant improvements of TERRA with modified input parameters (TERRAmod).
The hydraulic conductivity experiment for the Mongu site is not included since it is not applicable for all flux sites.
 In this study version 3.5 of the Community Land Model was used. However, the latest version (CLM4) includes a number of parameterization improvements, possibly alleviating some deficiencies noted in this study. A comparison of CLM3.5 and CLM4 is done by Lawrence et al.  and indeed reveal improvements for the two tropical flux sites (Amazon region) with higher correlations and lower RMSEs in CLM4, especially for sensible heat flux.
 The major deficiencies noticed in TERRA-ML simulations are discussed insection 4 and addressed by modifications to input parameters. No important parameterizations or input parameters were modified since the model version used in this study.
4. Modification of TERRA-ML
4.1. Modification of Input Parameters
 Important issues in the default TERRA-ML simulations are the overestimation of LE, especially during the wet season (Figures 5a, 6a and 8a), the consistent overestimation of H at the Demokeya site (Figure 7b), and the failure to capture a sustained evapotranspiration during the dry season (Figure 5a and 8a).
 The main objective of this section is to enhance the performance of TERRA-ML and reveal the true capacities of the SVAT model after improving its input parameters. The modifications will be applied on all flux sites and the effect will be shown for all simulated fluxes (Figures 10–13). The default TERRA-ML simulation is represented by the broken blue lines.
4.1.1. Leaf Area Index
 The first parameter to modify in TERRA-ML is the LAI. A decrease of LAI might decrease overestimations of LE. Instead of the default constant value, a MODIS-based annual cycle of LAI is prescribed (Figure 2), which is the same as used by CLM and discussed in Section 2.1.2. The new LAI is based on spatially-varying satellite information and will therefore be more similar to observations than the default look-up table values. The new values are generally lower than the TERRA-ML defaults (Table 2), especially for the Tchizalamou site.
 The simulation with modified LAI is shown by green solid lines in Figures 10–13. For most of the flux sites, the LAI does not have a significant influence on any of the heat fluxes. Given its significant decrease in LAI, the simulated flux partitioning at Tchizalamou does show improvements: the overestimation of LE decreased with about 50 W m−2 (Figure 11c) and the underestimation of H improved accordingly (Figure 11d).
4.1.2. Surface Albedo
 An increase of surface albedo decreases the available energy at the surface (Rn), and might alleviate overestimations of H and LE. The black solid lines in Figures 10a, 11a, 12a and 13ashow the surface albedo observed by the SEVIRI radiometers onboard the METEOSAT-9 satellites [Schmetz et al., 2002; Schulz et al., 2009]. The data is available at CM-SAF (http://www.cmsaf.eu/) for the period may 2007 to present. For Kissoko, data of 2007–2009 is shown instead of 2004–2006. The grid cell resolution equals 15x15km which means that the values are not entirely representative for the immediate surroundings of the flux tower. However it is an indicative proxy and suitable for identifying serious model biases.
 In the case of Kissoko, Tchizalamou and Demokeya, TERRA-ML underestimates surface albedo (blue crosses inFigures 10a, 11a and 12a). To obtain a better match with the observations, αsoil and/or αplant are increased for the three flux sites (Table 2). The new surface albedo is indicated by red crosses in the above mentioned figures. TERRA-ML allows the albedo to change according to plant cover (i.e., in an annual cycle), but since this is a constant value in the tropical region (Section 2.1.2), albedo is only slightly affected by superficial soil moisture variations.
 As a result of the albedo modification, Rn is simulated better at all three flux sites (10b, 11b and 12b). The graphs are showing the cumulative effect of consecutive modifications, hence contain the joint effect of modified LAI (Section 4.1.1) and modified albedo. Additional improvements might be obtained by taking into account a more detailed canopy structure (with, e.g., leaf inclination angle distribution) influencing light interception [Baldocchi et al., 2002], which is not included in the models used in this study. The propagation of the improvement to the heat fluxes is however different for every flux site. At Kissoko, the small decrease of overestimated Rn (about 20 W m−2) is translated into a decrease of H, which is a slight improvement compared to the default simulation (Figure 10d). The decrease of energy excess at the Tchizalamou site (about 50 W m−2) is distributed equally to LE and H, resulting in a smaller decrease of both. LE is now simulated better, but the LAI-induced improvements for H are partly neutralized (Figures 11c and 11d). At Demokeya, the improvements of Rn only propagate to the sensible heat flux, since LE is limited by water scarcity rather than available energy. As a result, the simulation of H is significantly improved (Figure 12d).
4.1.3. Root Depth
 Finally the underestimation of LE in the dry season is addressed. During this period, the main controlling factor is transpiration of root zone water by vegetation.
 One way to increase model performance is to modify the lower model boundary of TERRA-ML by connecting it to a groundwater aquifer. This replaces the default downward-only free drainage with a two-way interaction including an upward diffusion flux. As a consequence the overall soil moisture content increases year-round, increasing the soil water availability in the root zone and hence preventing LE underestimations in the critical periods. This approach is discussed byGrasselt et al.  and our experiments confirm it with satisfying results (Figures not shown). However, since we lack reliable and continuous soil moisture observations for the Kissoko site, the physical realism of both default and modified hydraulic model configurations are uncertain.
 Another approach to improve the model performance is to enlarge the area of water uptake. Rooting depth in TERRA-ML is parameterized in order to simulate surface fluxes of moisture and energy globally in a realistic way, but is barely meant as a physically correct parameter. In the TERRA look-up tables [Doms et al., 2011] rooting depth of forest/woodland is set to 1 meter (Table 2) which is a significant underestimation compared to observed depths [Canadell et al., 1996; Laclau et al., 2001; Schenk and Jackson, 2002; Christina et al., 2011] but common practice in most present-day land surface schemes [Kleidon and Heimann, 1998]. For tropical evergreen forests, roots are especially important to extract soil water during drier periods in order to maintain a near-potential transpiration rate [Nepstad et al., 1994; Da Rocha et al., 2004]. Since vegetation can only extract soil water from the root zone, root depth is a limiting factor for transpiration and hence for LE.
 Both approaches quantitatively succeed in reducing the model biases. However, taking into account both the uncertainty regarding soil hydraulics and the certain underestimation of root depth, the second approach is preferred since it is physically better supported by observations.
 Root depth of trees is increased to soil depth (2.40 m) and consequently the roots can extract water from the entire soil column, allowing an enhanced transpiration in dry seasons. Simulations with modified root depth are indicated by yellow lines in Figures 10 and 13. In the Kissoko case, performance improved for both heat fluxes: the dry season LE increased with about 50 W m−2, largely removing the underestimations (Figure 10c). The sensible heat flux decreased accordingly (Figure 10d). In the Mongu case, the effect of root depth is limited to the first months of the dry season, after which LE falls back to the values from the default simulation (Figure 13c). The dry period lasts longer compared to Kissoko, which makes the dependency on stored soil water higher.
4.2. Evaluation on Hourly Timescale
 The combined effect of the modified input parameters on model performance (hereafter called “TERRAmod”) is shown in Figure 9 and Table 4. Significant improvements are noticed for the LE in Kissoko (correlation increase from 0.87 to 0.95 and RMSE decrease from 47.5 to 35.26) and Tchizalamou (RMSE decrease from 50.57 to 32.83), and for the H in Demokeya (RMSE decrease from 73.84 to 50.55). In other cases, the modified input parameters did not significantly alter the performance. It can be concluded that the overall effect is neutral to positive.
4.3. Parameterization of Hydraulic Conductivity
 At the Mongu site, the wet-season overestimation of LE by TERRA-ML is not affected by any of the input parameter modifications (Figure 13). Simulated soil hydraulics are examined in order to trace the cause of this pronounced overestimation. The availability of reliable surface soil moisture observations at this flux site allows for a validation of simulated moisture content, hence modifications to model hydraulics can be justified on a physical basis. Figure 14 shows a comparison of simulated and observed soil water content at a depth of 5 cm, i.e., superficial moisture content. The simulation with the improved input parameters (solid yellow line) shows an obvious overestimation during the wet season.
 Compared to the sandy soils of the other flux sites, the Mongu soil is very well drained, deep, and hydrologically highly conductive. It is located in the “Barotse Sands” from the Kalahari soil group and described by Wolski . Its characteristics were also confirmed by the flux tower managers of the Mongu site. This difference explains why in reality the water is drained downward much faster compared with the simulations, causing the overestimation of superficial soil water content and hence LE.
 The model biases can be reduced by using a different formulation for vertical soil water movements. The hydraulic conductivity (K) in TERRA-ML is by default based on Rijtema's approach, i.e., depending exponentially on soil moisture content [Rijtema, 1969]:
where K0 and K1 are the hydraulic conductivity parameters which depend on soil texture, θ is soil moisture content, θsat is saturated soil moisture content, and θadp is the air dryness point. As an experiment, an alternative linear formulation from Brooks and Corey  is used:
where Ksat is the saturated hydraulic conductivity and b is the pore size distribution index which depends on soil texture. As already found by Grasselt et al. , a linear parameterization causes an enhanced downward water flux through the soil column, resulting in a lower superficial soil moisture content (broken purple lines in Figure 14) and a higher base flow. The soil moisture improvement leads to a significant reduction of the LE and consequently H biases (broken purple lines in Figure 13).
 It is however important to stress that this alternative parameterization is a specific solution for a site-specific model bias. Using the same formulation of hydraulic conductivity at the other three flux sites results in general and significant underestimations of LE (Figures not shown). It is not considered as a beneficial modification of TERRA-ML but only used to explain the wet-season overestimation of LE at Mongu. Neither is it a modification of input parameters easy to implement for all grid cells in a coupled atmospheric model, but a site-specific modification of a model parameterization.
 In this study, two SVAT schemes were applied as standalone models at four tropical sites in Africa. The same meteorological forcing data was used for both models. First, the models were tested with their default input parameters. Their performance was assessed and compared by looking to the annual cycle on monthly timescale, and to overall statistics on hourly timescale. Second, three input parameters of TERRA-ML were improved and the effects on the energy balance of all flux sites was quantified.
 In their default configuration CLM performs better than TERRA-ML on both timescales at most flux sites. This is represented by lower RMSE values and higher correlation coefficients. Major model deficiencies in TERRA-ML are identified, being an overestimation of LE during the wet season (Tchizalamou), a consistent overestimation of H (Demokeya), and the failure to capture a sustained evapotranspiration during the dry season (Kissoko, Mongu).
 To address these model biases, to test the sensitivity to input parameters and to investigate the true potential performance of TERRA-ML, a series of modifications is applied on the input parameters LAI, surface albedo and root depth.
 Firstly, leaf area index in TERRA-ML is generally overestimated compared to observations. This is caused by the relatively simple method of calculating LAI from look-up tables. The replacement of default values with a satellite-based annual cycle only yields an improvement when the LAI difference (i.e., overestimation) is large enough, decreasing biases of LE.
 Secondly, substantial biases of simulated albedo (and therefore Rn) are identified. In the surface energy budget, Rn is the amount of available energy which is distributed over LE, H and G, hence Rnbiases propagate to one or more of the other budget terms. In TERRA-ML the soil albedo (depending on very few soil classes) as well as the plant albedo are hard-coded constants. Plant and soil albedo in TERRA-ML have been modified to better match the satellite-retrieved surface albedo, resulting in a better simulation ofRn and hence of LE and H. The soil albedo in CLM depends on spatially varying soil color (to be provided as input data set) and the plant albedo is related to vegetation phenology (LAI and SAI). LAI and SAI are spatially varying input data sets and based on satellite observations.
 Thirdly, in tropical evergreen forests a sufficient amount of soil water has to be available for root uptake during the drier months, when LE depends more on vegetation transpiration. While CLM is successful in doing so, TERRA-ML largely underestimates the dry-season LE. In order to solve the problem of root zone water scarcity, different approaches can be used: either ‘bringing the water to the roots’ or ‘bringing the roots to the water’, from which the latter is chosen as it is physically better supported by observations. The results indicate that a revision of vegetation parameters is needed for TERRA-ML. This is of course not self-evident since the model is somehow “tuned” to the existing look-up table values (Table 2) which are mainly established by extra-tropical evaluation studies. Plant cover (PC) for the grassland biome in TERRA-ML (Table 2) does not take into account the short dry season; another example of parameters which are mainly tuned for temperate regions.
 An unrealistic partitioning of energy by TERRA-ML is identified in the wet season at the Mongu site. The hydraulic parameters of the soil class “sand” in TERRA-ML are not representative for the hydraulically highly conductive “Barotse” soils, limiting the simulated downward water flux (compared to reality). This causes too high superficial soil water contents, explaining the overestimation of LE. The bias is reduced substantially by replacing the default exponential parameterization of hydraulic conductivity with a linear formulation. It is however a very site-specific model bias, and the alternative parameterization is not a general solution since it is not applicable for the other flux sites, where the hydraulic properties of the soil are better represented by the model. The model deficiency was not found in CLM, which uses a linear parameterization [Clapp and Hornberger, 1978] in combination with layer-dependent soil textures.
 Based on our findings, some general recommendations for SVAT scheme users and developers are given below. First of all, default input data sets are not always adequate and should be checked for their applicability in the simulated region. Some data sets are composed and calibrated in detail for a certain region (e.g., Europe), whereas in other biomes across the world, parameters may have different values and even have a larger impact (e.g., for tropical rain forests). Without proper region-specific input parameters, simulation performance drops. This was clearly demonstrated by modifying default root depth values in the tropical African evergreen forest, thereby resolving the dry-season underestimation of (evapo) transpiration and hence LE.
 Moreover, models show large differences in parameterization complexity. For instance, parameters such as albedo and LAI can be calculated from spatially varying and user-adjustable input data sets (e.g., in CLM, from soil color, plant phenology, etc.) or from a few hard-coded model constants which are supposed to be representative for all vegetation types and biomes (e.g., in TERRA-ML, with fixed plant albedo). In this study, these model constants were modified to observed values, resolving heat flux biases in TERRA-ML on local scale. In more complex SVAT models, spatially explicit input data is used (needed to assure adequate model performance). These complex SVAT models (e.g., CLM) are by default more versatile in its global application on different vegetation types and climates.
 Apart from these findings, also computing power and time should be considered. SVAT models with a lower degree of complexity (e.g., TERRA-ML) are much less computationally demanding which leads to faster calculation times in a coupled climate simulation.
 Field measurements in Kissoko and Tchizalamou were funded by the European Integrated Project Ultra Low CO2Steelmaking (ULCOS, Contract 515960), by the CARBOAFRICA project, and by the Observatoire de Recherche en Environnement F-ORE-T. Technical assistance was also provided by the CRDPI. Measurements in Demokeya were supported by the Swedish Science Council (2004–3888) and by the EU through the CARBOAFRICA project. Measurements in Mongu were supported by CARBOAFRICA and by institutional funding of the MPI for Biogeochemistry, Jena, Germany. For the simulations we used the infrastructure of the VSC Flemish Supercomputer Center funded by the Hercules Foundation and the EWI Department of the Flemish government. We thank the anonymous reviewers for their valuable comments.