A nonlinear mixed‐effects modeling approach for ecological data: Using temporal dynamics of vegetation moisture as an example
Abstract
- Increasingly, often ecologist collects data with nonlinear trends, heterogeneous variances, temporal correlation, and hierarchical structure. Nonlinear mixed‐effects models offer a flexible approach to such data, but the estimation and interpretation of these models present challenges, partly associated with the lack of worked examples in the ecological literature.
- We illustrate the nonlinear mixed‐effects modeling approach using temporal dynamics of vegetation moisture with field data from northwestern Patagonia. This is a Mediterranean‐type climate region where modeling temporal changes in live fuel moisture content are conceptually relevant (ecological theory) and have practical implications (fire management). We used this approach to answer whether moisture dynamics varies among functional groups and aridity conditions, and compared it with other simpler statistical models. The modeling process is set out “step‐by‐step”: We start translating the ideas about the system dynamics to a statistical model, which is made increasingly complex in order to include different sources of variability and correlation structures. We provide guidelines and R scripts (including a new self‐starting function) that make data analyses reproducible. We also explain how to extract the parameter estimates from the R output.
- Our modeling approach suggests moisture dynamic to vary between grasses and shrubs, and between grasses facing different aridity conditions. Compared to more classical models, the nonlinear mixed‐effects model showed greater goodness of fit and met statistical assumptions. While the mixed‐effects approach accounts for spatial nesting, temporal dependence, and variance heterogeneity; the nonlinear function allowed to model the seasonal pattern.
- Parameters of the nonlinear mixed‐effects model reflected relevant ecological processes. From an applied perspective, the model could forecast the time when fuel moisture becomes critical to fire occurrence. Due to the lack of worked examples for nonlinear mixed‐effects models in the literature, our modeling approach could be useful to diverse ecologists dealing with complex data.
1 INTRODUCTION
Classic statistical approaches (e.g., linear regression or ANOVA) have assumptions that often are not met by ecological data, such as when variances change with predictors or responses are nonlinear (Bolker et al., 2013; Zuur, Ieno, Walker, Saveliev, & Smith, 2009). Mechanistic or semimechanistic descriptions often benefit from nonlinear functions (Bolker, 2008) because their parameters have an ecologically meaningful interpretation (Miguez, Archontoulis, & Dokoohaki, 2017), helping to clarify system processes. Furthermore, ecological processes operate at multiple spatio‐temporal scales (Peters et al., 2008) producing data sets with hierarchical structures better handled by the use of random effects (Nakagawa & Schielzeth, 2013). Therefore, nonlinear mixed‐effects models can expand capabilities by including nonlinear regression and fixed and random effects (Lindstrom & Bates, 1990).
While nonlinear mixed‐effects models are not novel (Davidian & Giltinan, 2003), they still present several challenges to ecologists without formal training in statistics (Bolker et al., 2013). Some of these challenges arise from (a) the need to choose a suitable response function; there are many candidate functions and the variety can be overwhelming (Miguez et al., 2017); (b) patterns of correlation and variance usually occurs when experimental units (e.g., individuals, plots) are measured more than once (Davidian & Giltinan, 2003); (c) parameter estimation has no analytical solution and iterative methods must be applied (Bates & Watts, 2007) often leading to additional hurdles (e.g., provide reasonable starting values and model convergence; Bolker et al., 2013). While the last one represents a technical challenge, on the firsts two lies part of the answer to when or why to apply this complex modeling approach (Figure 1).

Regarding the first challenge, ecologists seek to match patterns to ecological theory or simply models to data (Richards, 2005). A large number of ecological process have nonlinear responses, and many deterministic functions have been proposed (a list of typical examples are found in Table 1). For instance, asymptotic patterns are commonly observed in ecology and could be described using rational functions (Bolker, 2008). One well‐known example comes from predator–prey dynamics (Kalinkat et al., 2013). Ecological theory predicts that per capita consumption rate of predators (y) varies with prey density (x) according to predator's capture rate and handling time (Rall et al., 2012). When capture rate (a) and handling time (b) are not supposed to vary with density, the pattern is called “type‐II functional response” (Holling, 1959) and mathematically formalized as y = ax/(1 + abx). In addition, it is possible to reparameterize response functions according to ecological questions (Bolker, 2008). For example, in the type‐II functional response as presented, the parameters to estimate are a, a measure of hunting efficiency or successful search, and b, which indicates the time used to kill, ingest, and digest a prey (Jeschke, Kopp, & Tollrian, 2002). However, we can rewrite a as 1/a′ and b as 1/a′b′ and re‐express the function as y = a′x/(b′ + x). Now, a′ represents the maximum per capita consumption rate reached when density is large (the asymptote of the function) and b′ is the density (x‐value) at the half‐maximum consumption rate. In fact, several strategies to address nonlinear patterns are widely used in ecology (e.g., transformations, polynomials, “splines”) but the main argument against those is that nonlinear models have meaningful parameters (other arguments are lack of parsimony and nonvalidity beyond range of fit; Pinheiro & Bates, 2000).
| Pattern | Function | Ecological context | ||
|---|---|---|---|---|
| J‐shaped |
|
Exponential | y = aebx |
Population ecology (population growths without resource constrains). Eco‐physiology (temperature responses). Epidemiology (outbreaks). |
| Power | y = axb | |||
| Saturating |
|
Michaelis–Menten | y = ax/(b + x) |
Population ecology (type‐II functional response in predator–prey dynamics). Community ecology (resource competition). Eco‐physiology (photosynthetic curves). Forest ecology (light availability in canopy). Production ecology (fisheries, fruit quality). Epidemiology (infection rates). |
| Monomolecular | y = a[1 − e−bx] | |||
| S‐shaped (sigmoidal) |
|
Logistic |
|
Life history (individual biological growths). Population ecology (population growths with resource constrains, type‐III functional response in predator–prey dynamic). Forest ecology (stand dynamics). |
| Gompertz |
|
|||
| Hump‐shaped (unimodal) |
|
Ricker | y = axebx |
Population ecology (capture rates varying with prey size in predator–prey dynamic). Community ecology (richness species varying with productivity or disturbance gradients). Eco‐physiology (optimums). Fire ecology (fire activity along global productivity gradient). |
| Beta |
|
|||
Note
- Terms in equations: y = response variable; x = explanatory variable; e = constant (the base of the natural logarithm); a, b, c = parameters. Parameter values must be in a certain range to match the pattern (e.g., power functions result J‐shaped curves when b > 1, but inverted J‐shaped if b < 0 and decreasing increments with 0 < b < 1). Function names and parameterization vary according to the context in which they are used (see Bolker, 2008).
Regarding the second challenge, when nonlinear ecological patterns are, as commonly occurs in ecology, observed from grouped data (e.g., observations spatially clustered, subjects measured more than once, individuals from the same family, species with phylogenetic relationships; Barnett, Koper, Dobson, Schmiegelow, & Manseau, 2010), mixed‐effects approaches allow correlations within‐group observations to be considered and modeling of heteroscedasticity (Davidian & Giltinan, 2003). For example, we might be interested in studying regional fruit production and designing an experiment where size of individual fruits was recorded over time (growth is usually sigmoidal), allowing for fruits to be nested in trees, trees in orchards, and orchards in regions. In such an experiment, a nonlinear mixed‐effects model would allow us to fit temporal curves to each fruit and to evaluate whether (some parameter of) the curves depend on fruit location, tree species, orchard management, or climate, all of these being incorporated in the model as predictors at different clustering levels (West, Welch, & Galeki, 2007). Indeed, one of the most intuitive applications of nonlinear mixed‐effects models is to describe temporal within‐individual responses and to identify factors determining variability among individual responses (Davidian & Giltinan, 2003). In short, nonlinear mixed‐effects models are nonlinear response functions allowing among‐groups random variation in (one or more) parameters which can be modeled from group‐level predictors (Bolker, 2008); these are convenient to apply with grouped data to describe a nonlinear ecological response, but their use entails more complexity than classical statistics.
Live vegetation moisture, termed as live fuel moisture content (LFMC) in the fire ecology context, is a typical ecological variable in which nonlinearity and correlation structures make it difficult to apply classical statistical methods. To burn, an ecosystem requires precipitation for plant biomass (fuel) to be produced, and dry weather conditions to make that biomass available for burning (Bradstock, 2010). LFMC is an ecological variable determining fuel biomass availability, thus influencing outbreak and spread of wildfires (Rossa, 2017). In other words, because removing the water from fuel requires energy (Jolly et al., 2012), higher moisture content means longer heating time to ignition and slower fire spread (Finney, Cohen, McAllister, & Jolly, 2013). Consequently, LFMC modeling becomes relevant in fire‐prone ecosystems, such as those of Mediterranean regions, where it is important to predict the vegetation moisture threshold at which fires are highly probable (Dennison & Moritz, 2009). Temporal changes in LFMC are determined by physiological and phenological factors associated with weather seasonality. This seasonality influences plant growth rates, water loss by transpiration, and changes in soil water availability (Nelson, 2001). Hence, in Mediterranean‐type climate regions (cold and humid winters, temperate and dry summers), such as northwestern Patagonia (Kottek, Grieser, Beck, Rudolf, & Rubel, 2006), seasonality causes plants to have a relatively high moisture during spring (when sprouting takes place) and then lower values during the autumn senescence (Keeley, Bond, Bradstock, Pausas, & Rundel, 2012). Therefore, LFMC is expected to reach a maximum during the growing season and steadily decrease through the dry season (when fires occur), until it stabilizes at a minimum; this is, naturally, a nonlinear response. In northwestern Patagonia, for example, vegetation growth season starts in early spring and ends in late summer/early autumn (Jobbágy, Sala, & Paruelo, 2002) overlapping part of the fire season, which starts in late spring/early summer (Oddi & Ghermandi, 2016). Since plants develop varied strategies to access water and regulate their water content status, LFMC modeling should consider how moisture seasonal variation differs among plant functional types (Castro, Tudela, & Sebastià, 2003). For example, in extra‐Andean Patagonia, grass and shrubs have different water‐use strategies (Sala, Golluscio, Lauenroth, & Soriano, 1989); shrubs obtain water from deeper soil layers (Golluscio & Oesterheld, 2007). Furthermore, phenological water‐use strategies within the same functional group can vary among coexisting species, as appears to occur with the shrubs Mullinum spinosum (Cav.) Pers. and Senecio filaginoides DC (Fernández, Nuñez, & Soriano, 1992), the first one with deeper root system (Fernandez & Paruelo, 1988), or along aridity gradients, in response to changes in water availability dynamics in soil (Golluscio & Oesterheld, 2007). Therefore, in this region, it is expected that parameters of the LFMC temporal curve depend on type of vegetation and aridity conditions. Lastly, because field sampling must consider LFMC within‐sampling location variability (Yebra et al., 2013), observations are commonly clustered in space (Desbois, Deshayes, & Beudoin, 1997). Hence, LFMC data obtained through field monitoring appear suitable to be analyzed using nonlinear mixed‐effects models.
Although nonlinear mixed‐effects approaches are useful in many areas (Davidian & Giltinan, 2003), including ecology and environmental sciences (Crecente‐Campo, Tomé, Soares, & Diéguez‐Aranda, 2010; Miguez, Villamil, Long, & Bollero, 2008), very few worked examples exists (Bolker et al., 2013). In order for statistical methods to be gradually applied by users, these must be demonstrated and illustrated with examples (Qian, Cuffney, Alameddine, McMahon, & Reckhow, 2010). Here, we illustrate the nonlinear mixed‐effects modeling approach using an ecological example involving the temporal dynamics of live vegetation moisture in northwest extra‐Andean Patagonia. From an ecological perspective, we apply a nonlinear mixed‐effects approach to model temporal changes in LFMC. In particular, we test if moisture content and drying pattern over the fire season differ (a) between grasses and shrubs, (b) between coexisting species, and (c) between sites with different aridity conditions. From a methodological point of view, we aimed to (a) describe a “step‐by‐step” statistical modeling process and (b) show that, compared to other linear and more classical approaches, nonlinear mixed‐effects models improve the description of ecological processes with seasonality such as temporal dynamics in LFMC. This “improvement” is assessed in terms of goodness of fit, model assumptions, and ecological meaning. We start with a nonlinear function linked to a simple statistical model, which is made increasingly complex to include the different sources of variability and correlation structures. While the modeling process is illustrated on LFMC, the framework is useful for other ecological variables with nonlinear patterns and correlation structures. To make this procedure easily reproducible, we provide the R codes used to perform the statistical analyses.
2 METHODS
2.1 Study area
Field data were gathered from northwestern Patagonia (east of Nahuel Huapi Lake, Río Negro, Argentina; Figure 2). The area is characterized by a semiarid climate with a Mediterranean‐type precipitation regime, and annual precipitation decreases in a steep 50‐km west–east gradient from 580 to 260 mm (San Ramón and INTA Pilcaniyeu weather stations). Along this climatic gradient, we established two sampling sites separated by 60 km (Figure 2) with different aridity and plant physiognomy. The western (W) site, the wettest, is a grass steppe dominated by the perennial grass Festuca pallescens (St. Yves) Parodi, and shrub cover is less than 5%. The eastern (E) site, the driest, is a shrub–grass steppe with 60% shrub cover where communities are codominated by Papostipa speciosa and by the shrubs M. spinosum and S. filaginoides (Godagnone & Bran, 2009).

2.2 Experimental design
We carried out the field sampling during 80 days (seven sampling dates distributed as evenly as possible) from 13 November 2013 to 10 February 2014. Due to the differences in the plant physiognomies of the sites, we measured LFMC in grasses at the W site and both grasses and shrubs (M. spinosum and S. filaginoides) in the E site. In both sites, we established three 500 × 500 m plots and average distance between plots was ≈2 km (Figure 1). All plots were near the road (National Road No. 23) (Figure 2) and on flat‐level terrain to avoid changes in LFMC caused by differences in topography. On each sampling date and plot, we randomly selected three spatial points (i.e., spatial locations within plots) for each leaf type (grasses and the two shrub species); we collected 80–100 g of live biomass from the nearest four individuals to these points (i.e., each observation came from a composite sample, Figure 2). Therefore, our total observations resulted in 252 measurements of LFMC (3 leaf types × 3 points × 3 plots × 7 sampling dates in the E site, plus 1 leaf type × 3 points × 7 sampling dates in the W site) from which five were lost (n = 247). This experimental design allowed us to compare plant functional types within the same site (grasses vs. shrubs in E site), sites within the same functional type (grasses in the W site vs. grasses in the E site), and species within the same functional type and site (M. spinosum vs. S. filaginoides in the E site).
We collected all samples between 12:00 and 16:00 hr local time. Immediately after collection, we packed the samples in individual hermetic plastic bags and transported them to the laboratory in a portable fridge. Once in the laboratory, we weighed the samples in a precision balance (0.01 g) to obtain their fresh weight (WF). Samples were oven‐dried at 80°C for 48 hr and reweighed to obtain their dry weight (WD). Finally, we calculated the LFMC (%) as: (WF − WD)/WD × 100.
2.3 Nonlinear mixed‐effects model
(1)
(2)
(3)

(4)






Of the multiples ways in which mixed‐effects models can be written, we have chosen that termed as “combining separate local regressions” (Gelman & Hill, 2007). We follow the Gelman and Hill's (2007) notation, who use subscript i to represent the smallest unit of observation, within‐plot observation (points) in our experimental design (i = 1, 2, …, 247); and j to indicate groups, plots in this case (j = 1, 2, …, 6). GE, SM, and SS are binary variables taking values 1 or 0, used to code leaf type (see Table 2 for interpretation of the associated terms) recorded for point i. With this notation, we try to establish a clear connection between mathematical expression and software output (Appendix S1). While this model considers observations (i) within plot j to be correlated (ϕ is termed as intraclass correlation, and estimated as a function of among‐groups and within‐groups variability; Aarts et al., 2014), residuals are assumed to be independent and normally distributed with homogeneous variances. Normal random effects (A0j, w0j, m0j, s0j) and independence between the within‐plot observations and random effects are also assumed.
| Model | Number of parameters | log‐Likelihood | AIC | ∆AIC |
|---|---|---|---|---|
| Nonlinear mixed‐effects (M1) | 15 | −950.8 | 1,931.7 | – |
| Linear mixed‐effects (M3) | 15 | −984.2 | 19,998.5 | 66.8 |
| Nonlinear fixed‐effects (M2) | 27 | −1,015.6 | 2,085.2 | 153.6 |
| Classical regression (M4) | 25 | −1,030.5 | 2,111.0 | 179.3 |
| Null (M5) | 2 | −1,423.9 | 2,851.7 | 920.1 |
Note
- AIC is a goodness of fit measure (likelihood or log‐likelihood) that penalizes for complexity (number of parameters).
(5)
(6)






;
;
; 
) as a function of leaf type:
(7)







;
;
; 
(8)
(9)







;
;
; 
Equation 9 refers to the more complex or global model, which includes variance modeling, temporal correlation, and fixed‐effects (leaf type) and random effects (plot) on all parameters of the response function. Nevertheless, not all of these components necessarily need to be in the model. If any of them is not important but included (the predictive capacity is not increased), the model will be overparameterized (Aho, Dewayne, & Peterson, 2014), which could cause convergence problems (Grueber, Nakagawa, Laws, & Jamieson, 2011). When convergence problems occur, reducing the model complexity is a possible solution (Bolker et al., 2009).
Therefore, we followed the Zuur's et al. (2009) protocol for fitting mixed‐effects models, first assessing the random structure and then the fixed effects. We compared models of different complexity level by using the Akaike information criterion (AIC), a goodness‐of‐fit measure (likelihood) that penalizes for complexity (number of parameters; see next section for more detailed discussion about AIC and multimodel inference). Although the model with the lowest AIC value can be chosen as the best one (Burnham & Anderson, 2002), in order to consider model uncertainty, differences in AIC should be large enough (Richards, 2005). We used delta AIC > 2, but other cutoffs could be chosen as rule of thumb (Harrison et al., 2018). To fit the model (see the R code), we first explored the variance–covariance structure (Barnett et al., 2010; West et al., 2007). Then, we evaluated if any of the four parameters (A, w, m, s) varied across the plots through random effects. For instance, to determine whether the maximum LFMC varied with plot we compared the model where A is random (A0j in Equation 9) with the model where A is unique for all the plots (A0). Later, we decided about the inclusion of the temporal correlation and variance functions assessing whether the data fit obtained from the model introduced in Equation 4 were improved by that from Equations 6 and 9. Lastly, we modeled the fixed effects examining what parameters of the response function varied with leaf type. For instance, to assess if leaf type influences the maximum LFMC we compared the model in which A depends on leaf type (A = A0 + A1GE + A2SM + A3SS) with the model where A is unique (i.e., general to all leaf types). When the effect of leaf type on any parameter was found important (i.e., delta AIC > 2), we assessed differences among its levels (our ecological question).
2.4 Alternative models
We fitted four alternative models, which were compared to the nonlinear mixed‐effects model. The first one was a (logistic‐type) nonlinear fixed‐effects model. In this case, time, leaf type, and plot are treated as fixed effects, and normality and independence among data are assumed (M2). The second alternative (M3) was a linear mixed‐effects model with time and leaf type (and its interaction) as fixed effects and plot as random effect (spatial nesting). This model included an ARMA temporal structure and the same variance function as the one used in the nonlinear mixed‐effects model. The third alternative model (M4) was a classical regression (i.e., assuming that the relation with time is linear, residuals follow a normal distribution and data are independent in space and time and show homogeneous variances). We also fitted a null model (i.e., without predictors, M5) which was considered as the baseline in the comparisons. In short, we compared five models: nonlinear mixed‐effects (M1), nonlinear fixed‐effects (M2), linear mixed‐effects (M3), classical regression (M4), and null (M5). M1 and M2 share the deterministic response function (logistic‐type) but differ in the stochastic structure; similarly, M3 and M4 share a linear deterministic response function but differ in the stochastic structure. M1 and M3 share the random structure (mixed‐effects) and differ in the type of temporal relationship assumed (nonlinear vs. linear), similarly, M2 and M4 share the random structure and differ in temporal relationship assumptions. Before comparison, each alternative model was fitted according to parsimony, just as the nonlinear mixed‐effects model (M1).
Model comparison was carried out under a multimodel inference framework (Burnham & Anderson, 2002) using the AIC (Burnham, Anderson, & Huyvaert, 2011). This inference framework is especially suitable for selecting among non‐nested models (Burnham & Anderson, 2004); the fitting process of each model also involved model comparison but in this case, they were nested (see previous section). Our modeling approach was carried out using AIC as an index of parsimony but other related statistics exist (modifications of AIC, such as AICc, and others, such as BIC or DIC). All of these criterions share a similar goal, that is, to find a balance between goodness of fit and complexity. They have advantages and limitations and should be chosen according to the context and needs (statistical paradigm, assumptions, data structure; Barnett et al., 2010). For instance, Bayesian approaches could provide advantages when fitting complex models with few data points by incorporating prior distribution for parameters (Davidian & Giltinan, 1995; Gelman et al., 2013). Our approach, however, is useful beyond the information criteria chosen to select models (for further discussions, see Aho et al., 2014; Murtaugh, 2009; Richards, 2005; Spiegelhalter, Best, Carlin, & Linde, 2014; Yang, 2005).
The models were fitted using the nlme(), lme(), and gls() functions of the nlme package (Pinheiro, Bates, DebRoy, Sarkar, & Core Team, 2016) in R 3.3.5 (R Core Team, 2017). When the effects of leaf type were found important, we used the emmeans() function of the emmeans package (Lenth, 2018). Pearson residuals were visually assessed (residuals vs. fitted values plot, residuals vs. predictors plot, and normal Q‐Q plot) for checking the models' assumptions. The analysis is available in the Supplementary Material.
3 RESULTS AND DISCUSSION
3.1 Model comparison
We found stronger support (lower AIC) for the nonlinear mixed‐effects model with these datasets (M1, Table 2). Both the residual analysis and the visual analysis of the fitted curves confirm a temporal pattern of LFMC during the fire season to be well‐described by a declining logistic‐type function. Indeed, the (nonlinear) deterministic component of this model was enough to capture all the temporal changes suggesting it is not necessary to include the temporal correlation structure. According to the residual pattern (Figure 4), assumptions are reasonable under the linear mixed‐effects model (M3). In this model, however, the LFMC nonlinear temporal pattern was captured by the residuals' autocorrelation (ARMA). On the other hand, neither the nonlinear fixed‐effects model (M2) nor the classical regression (M4) was appropriate to model temporal changes in LFMC (the residual analyses show violation to the assumptions of these models, see Figure 4).

The fact that both linear and nonlinear mixed‐effects models were suitable highlights the importance for variables such as LFMC to be addressed from a mixed‐effects modeling approach. Nonetheless, the nonlinear strategy was better than the linear one. While the general mixed‐effects approach accounts for spatial nesting, temporal dependence, and heterogeneity among leaf types, the nonlinear response function adds the capability to model seasonal patterns. In this regard, the logistic‐type model answers questions such as “what is the minimum moisture content of a species and when is it reached?” and enables estimation of the instantaneous drying speed (in a linear model it is only possible to know the average drying speed; the first derivative is constant; Paine et al., 2012). Hence, the nonlinear temporal relationship is not only underpinned by data but is also conceptually more relevant. In fact, nonlinear approaches allow statistical models based on physical, biological or ecological ideas (Jonsson et al., 2014).
3.2 Interpreting the nonlinear mixed‐effects model
Nonlinear functions provide an interesting approach to understanding the temporal dynamics of ecological variables (Pascual & Ellner, 2000). The proposed logistic‐type curve described LFMC temporal changes (Figures 4 and 5) using four parameters (A, w, m, s) varying with leaf type as a fixed effect. LFMC of both shrubs and grasses decreased from mid‐spring to summer, tracking the temporal trend of the Mediterranean‐type precipitation regime. The same overall sigmoidal pattern is observed for all of the leaf types, although considerable variation among them exists (Figure 5). In particular, our modeling effort suggests that leaf types differed in their maximum (A) and minimum LFMC (w) (Table 3). In Patagonian steppes, soil moisture increases with depth (Sala et al., 1989) and water from deeper soil layers is available for longer periods than shallow water since it is less affected by evaporative demand (Ferrante, Oliva, & Fernández, 2014). Because shrubs can reach water from deeper soil layers (Golluscio & Oesterheld, 2007), higher LFMC in shrubs than grasses should be expected. Accordingly, shrubs moisture content at the beginning of the fire season (
) was, on average, four times higher than for grasses and the stabilization value (
) was three times higher (Figure 5, Table S4). This implies different drying speed (slopes) between the functional groups, as inferred from the derivatives of their temporal curves (Figure 5).

| Parameter | Estimate | Meaning |
|---|---|---|
|
54.3% [42.5:66.0] | Maximum LFMC ( ) of grasses in the W site. A varies with plot and therefore a hyperparameter is estimated.
|
| A 1 | 30.9% [13.5:48.3] | Difference between of grasses in the E site and of grasses in W site.
|
| A 2 | 223.4% [191.9:254.6] | Difference between of M. spinosum and of grasses in the W site.
|
| A 3 | 240.3% [207.1:273.5] | Difference between of S. filaginoides and of grasses in the W site.
|
| w 0 | 29.1% [25.7:32.4] | Minimum LFMC ( ) of grasses in the W site.
|
| w 1 | −20.7% [−25.8: −15.6] | Difference between of grasses in the E site and of grasses in the W site.
|
| w 2 | 31.7% [16.4:47.0] | Difference between of M. spinosum and of grasses in the W site.
|
| w 3 | 26.7% [11.1:42.9] | Difference between of S. filaginoides and of grasses in the W site.
|
| m | 30.9 days [26.9:35.0] | Day when the LFMC half‐maximum occurs ( ) in the study area (data suggest one general value for all the leaf types).
|
| s | −16.1 [−20.8: −11.5] | Parameter controlling speed of change ( ) of the LFMC (data suggest one general value for all the leaf types, the negative value indicates vegetation to be drying during the fire season).
|
|
7.1% [6.0:8.6] | LFMC variability within‐plots (residual standard error) of grass ( ) in Site W at the beginning the fire season.
|
| δ 1 | 0.9 [0.7:1.2] | Ratio between the within‐plots standard error of the LFMC of grasses in Site E ( ) and grasses in W site ( )
|
| δ 2 | 3.2 [2.4:4.0] | Ratio between the within‐plots standard error of the initial LFMC M. spinosum ( ) and grasses in W site ( )
|
| δ 3 | 3.0 [2.4:3.9] | Ratio between the within‐plots standard error of the initial LFMC S. filaginoides ( ) and grasses in W site ( )
|
|
9.4% [3.9:15.8] | Variability among plots (standard deviation) in the maximum LFMC (random effects on A). |
Grass LFMC dynamics differed between sites (Figure 5). Specifically, grasses showed different saturation moisture and minimum moisture content (A1 ≠ 0, and w1 ≠ 0, respectively; see Table 3 and Appendix S1 to interpret both parameters). These different responses could be caused by environmental differences between sites and/or functional differences between grass species. At the beginning of the study (mid‐spring, when water availability begins to decrease in Patagonia; Sala et al., 1989), moisture content was higher in grasses from the E site (
= 85%) than that from the W site (
= 54%). The 95% confidence interval for such difference (31%) in the maximum LFMC between the grasses of the both sites (A1) spanned from 13% to 48% (Table 3), reflecting the degree of uncertainty in the true value of the point estimate. While the W site is dominated by F. pallescens, the dominant grass in the E site is P. speciosa. Both species have xerophytic foliar traits associated to resistance to water stress (Latour, 1979), but P. speciosa has more convoluted blades and stomatal crypts with higher trichome density (L. Ghermandi, data not published) and thus would prevent water loss more efficiently. In addition, in arid and semiarid areas, shrubs act as thermal buffers, increasing water availability (Villagra et al., 2011). Hence, shrub presence could benefit superficial soil water availability in the E site (shrubs are not present in the W site), increasing grass LFMC. These factors could allow grasses at the E site to maintain relatively high moisture during the initial phase of water stress (i.e., higher A; Figure 5). However, as the dry period (i.e., the fire season) progresses, greater aridity at the E site could overcome the initial advantages. The model suggests higher drying speed in site E grasses, mainly in the middle of the fire season (Figure 5), causing late December LFMC to become lower than that of grass growing in the W site (i.e., lower w; Figure 5). The effect of shrubs on the grasses LFMC dynamics, and thus on fire probability, could be other of the ecological interactions commonly observed in Patagonia between these two functional groups (Cipriotti, Aguiar, Wiegand, & Paruelo, 2014; Gonzalez & Ghermandi, 2019). Although both shrubs species have different root systems (Fernandez & Paruelo, 1988), their LFMC dynamics appear similar (
= 295%,
= 56% in S. filaginoides, and
= 278%,
= 61% in M. spinosum) (Figure 5). Again, it is worthy to recognize uncertainty around estimates (A2 and A3 in this case) and, hence, given the confidence intervals (Table 3).
In contrast to that observed in A and w, the steepness of the curves (s) was similar for all the leaf types and the time when the drying was highest (m) occurred simultaneously (31 days since beginning of the experiment). In other words, it is reasonable to assume s1, s2, s3, m1, m2, and m3 (Equation 9) to be zero. These results suggest absolute levels of moisture content to be leaf‐type‐specific while the drying rate could be a functional trait operating at ecosystem or plant community level. In ecological terms, selecting a simpler model implies that we are treating the different leaf types as behaving as a group with similar drying responses.
Mixed‐effects models allow us to understand and predict ecological variables at different hierarchies (Qian et al., 2010). In our example, the proposed model considered LFMC temporal curves varying with plot as a random effect (Figure 5); the results indicate that the random effect of plot was only important for A (i.e.,
). The later, along with the fact that plots had similar minimum LFMC (i.e., data suggested w not to vary with plot), could have implications for the behavior of fires occurring at different times along the fire season. Fires occurring at the beginning of the season (higher LFMC variability) should be more heterogeneous and less severe than at the end, when LFMC is lower and its spatial pattern is less variable. Nonetheless, because vegetation water status responds to rainfall variability, particularly in arid and semiarid regions such as extra‐Andean Patagonia (Golluscio, Sigal Escalada, & Pérez, 2009), LFMC dynamics models should incorporate interannual variability in precipitation as an explanatory variable. While our sampling period covered only one fire season, the proposed model allows adding precipitation (or other climatic variable) as a fixed effect (A, w, m, s = f[precedent precipitation]) or via a random effect (incorporating year (k) as an additional hierarchy: A0k, w0k, m0k, s0k) (Bolker, 2015). Both strategies would represent different conceptual models. If precipitation is incorporated as a fixed effect (i.e., a covariate), the model would be rather mechanistic (we would model the effect of water availability). If precipitation gets into the model as a random effect of year, we would estimate among‐years variability due to climatic differences. In addition, it would be possible to incorporate a plot‐level predictor (e.g., productivity) to model spatial variability in parameters at this level (
= f[plot productivity]) or even to add other spatial hierarchies (e.g., site) and to quantify spatial variability at greater scales. The unexplained variance could be reduced because of adding a plot‐level predictor, allowing more precise estimation of fixed effects (Schielzeth & Nakagawa, 2013).
Variance heterogeneity is expected in many ecological variables (Benedetti‐Cecchi, 2003). Within‐plot LFMC variability was three times higher in shrubs than in grasses but was similar between grasses from the two sites (
= 0.97) and between the two shrub species (
= 2.79;
= 3.17). Certainly, the model could be further simplified by applying a two‐level variance function (grasses/shrubs, that is,
as a function of plant functional type or growth form). The observed heteroscedasticity between growth forms could respond to differences in LFMC (higher values in shrubs), as commonly the variances tend to increase with the mean of the response variable. However, it could be also related to environmental conditions such as more homogeneous soil water availability for grasses than for shrubs (Golluscio & Oesterheld, 2007). In fact, variation in water availability is minimal in superficial soil layers (where grasses obtain the water) and increases with depth (Bucci, Scholz, Goldstein, Meinzer, & Arce, 2009). In addition, variance components of a LFMC mixed‐effects model could contain relevant information for planning field sampling linked to vegetation moisture monitoring from remote sensing. For instance, plot‐level information should be prioritized if random‐effect variance was significantly larger than residual variance (Schielzeth & Nakagawa, 2013). Here, random‐effect variance (among‐plot variability) was quantified from the standard deviation of A (
= 9%), which was similar to intraplot standard deviation of grasses from the W site (
= 7%). This type of information is critical for developing sampling protocols that improve estimates of plant moisture content (Yebra et al., 2013) and other seasonal variables (Watson, Restrepo‐Coupe, & Huete, 2019) from satellite images.
4 CONCLUSIONS
Our work covers a poorly addressed topic in ecology: illustrate the statistical modeling process using a nonlinear mixed‐effects framework. Similar to many other ecological variables, time series of vegetation moisture do not fit into classical statistical methods. We applied a nonlinear approach to model vegetation moisture dynamics proposing a logistic‐type function based on ideas about the dynamics of the system. Our model had greater support than alternative (and less complex) models. Parameter interpretation can be linked to vegetation features and environmental conditions showing how nonlinear mixed‐effects models could be used to advance ecological theory and practice. For instance, we addressed ecological questions about LFMC dynamic of grasses and shrubs under different aridity conditions, which could have applications in fire management. In this respect, we encourage researchers to propose statistical models based on conceptual ideas rather than adjusting data to standard models that many times involve data transformation to meet model assumptions. Due to the lack of worked examples in the literature, our approach can be useful to researchers addressing different ecological problems.
ACKNOWLEDGMENTS
This work was supported by grants from CONICET (PIP 112 201101 01091) and Fundación Bunge y Born. We thank INTA Bariloche and San Ramón ranch for collaboration with this study. We are especially grateful to Charlotte Reemts for English revision. We also appreciate the valuable work of the anonymous reviewers who significantly improved the quality of our manuscript.
CONFLICT OF INTEREST
None declared.
AUTHORS' CONTRIBUTIONS
FJO and LG conceived the ideas and designed the sampling design; FJO collected the data; FJO, FEM, and LOB analyzed the data; FEM developed the selfStart function; FJO and FEM wrote the R scripts; FJO and LAG led writing of the manuscript; and all authors contributed critically to drafts.
Open Research
DATA AVAILABILITY STATEMENT
We provided R codes and data (“data_lfmc.txt”) in Supporting Information. The codes include a self‐starting function (“s1.R”) along with the fit of the nonlinear mixed‐effects model, the fit of the alternative models, and model comparison (“s2.R”).
REFERENCES
Citing Literature
Number of times cited according to CrossRef: 2
- Jeremy J. Pritt, Richard D. Zweifel, Stephen M. Tyszko, Joseph D. Conroy, Variation in Reproductive Investment among Ohio Reservoir Largemouth Bass Populations, Transactions of the American Fisheries Society, 10.1002/tafs.10254, 149, 5, (552-564), (2020).
- Sam Jones, Testing the Technology of Human Capital Production: A General‐to‐Restricted Framework, Oxford Bulletin of Economics and Statistics, 10.1111/obes.12374, 0, 0, (2020).



) of grasses in the W site. A varies with plot and therefore a hyperparameter is estimated.
of grasses in the E site and
of grasses in W site.
of M. spinosum and
of grasses in the W site.
of S. filaginoides and
of grasses in the W site.
) of grasses in the W site.
of grasses in the E site and
of grasses in the W site.
of M. spinosum and
of grasses in the W site.
of S. filaginoides and
of grasses in the W site.
) in the study area (data suggest one general value for all the leaf types).
) of the LFMC (data suggest one general value for all the leaf types, the negative value indicates vegetation to be drying during the fire season).
) in Site W at the beginning the fire season.
) and grasses in W site (
)
) and grasses in W site (
)
) and grasses in W site (
)

