The channel head represents an important transition point from hillslope to fluvial processes. There is a nonlinear threshold transition across the channel head with sediment transport much larger in channels than on hillslopes. Deterministic specific catchment area, a, thresholds for channel initiation, sometimes dependent on slope, S, in the form of aSα ≥ C, have been suggested. In this paper the channel initiation problem is viewed probabilistically with a spatially variable probability of channel initiation that depends on slope, specific catchment area, and the probability distributions of median grain size, surface roughness, and excess rainfall rate. The channel initiation threshold C is cast as a random variable to characterize the variability of aSα at channel heads. Using field measurements from the Idaho Batholith, we show that median grain size measurements at each channel head explain a significant part of the observed variability of aSα. We then characterize the variability of model inputs (median grain size, roughness, and excess rainfall) using probability distributions and show that the probability distribution of area-slope threshold derived from these inputs matches the probability distribution of area-slope thresholds measured at field channel head locations. A gamma probability distribution provides a reasonable match to the distributions of area-slope threshold measured and modeled at channel heads in our study area and in other published channel head data.
 Laboratory, field, and theoretical studies of sediment transport mechanics have concluded that fluvial transport of sediment grains does not occur until a critical shear stress is exceeded [Shields, 1936; Yalin and Karahan, 1979]. Horton  proposed that an erosion threshold controls the location of channel heads. He suggested that the critical distance below the topographic divide required for sheet flow erosion is the same as that required for channel incision. According to Horton's theory, channels as erosion features may expand rapidly upslope in response to changed climate and land use conditions and can even form during individual storm events. Field experiments under rainfall simulation demonstrated that sheet flow is inherently unstable and can easily separate into streams, incise channels, and integrate into a network [Dunne and Aubry, 1986]. However, such a tendency can be reversed by raindrop impact, soil creep and other diffusive processes [Dunne and Aubry, 1986]. This has also been shown theoretically by Smith and Bretherton .
 On the basis of field evidence and theory, Montgomery and Dietrich [1988, 1989, 1994], Dietrich et al. , and Montgomery [1994, 1999] discussed the existence of a slope dependent contributing area threshold required to support a channel head. This threshold is similar to Horton's critical distance description for channelization but accounts for channel initiation by saturation overland flow and pore-pressure induced landsliding. Montgomery and Dietrich  and Dietrich et al.  showed that data from channel heads observed in their Tennessee valley field site fit an inverse relationship between the specific catchment area (upslope contributing area per unit contour width) “a” and local slope “S” of the form,
where α is an exponent that varies between 1 and 2 and C is a constant. They found that with α = 2, all the channel heads observed are captured between two topographic threshold lines with C values of 25 m and 200 m. This reveals a factor of eight variation in the contributing area sizes required for channel initiation for a given slope in the field. Montgomery and Dietrich [1992, p. 828] explained this observation as follows: “This scatter probably arises from both spatial and temporal variation in the hydrologic and erosional processes governing channel initiation and should introduce considerable variation into channel and valley development, thus contributing a random aspect to the appearance of many landscapes.” The model was thus generally successful at delineating slope-area bounds for channel head locations but acknowledged scatter due to spatial variations in hydrologic and erosional processes [Dietrich et al., 1993; Prosser and Dietrich, 1995; Prosser and Abernethy, 1996].
 The lack of a distinct topographic threshold at channel heads in the form suggested in the literature was also reported in case of intense gullying due to land use pattern changes [Prosser and Soufi, 1998; Desmet and Govers, 1997]. Desmet et al.  experimented with different area exponents in equation (1) to see how they could best match grid cells observed to be gullied in the field to grid cells predicted to be gullied, while setting the threshold C to keep the fraction of area mapped as gullies roughly constant. Their best results classified 70 to 80% of grid cells correctly, although this success does depend on their threshold C and does not consider the number of grid cells in excess of the threshold not considered to be gullied.
 Erosion is highly nonlinear with threshold functionality or dependence upon the occurrence of channelization. A model that uses a single channel initiation threshold based on average or central tendency parameters will predict significant erosion only in locations where channelization is predicted on a long-term basis. The work cited above has indicated the presence of variability in the channel initiation threshold and the need for a model that explicitly recognizes this variability. Here we take the logical next step of formalizing the description of this variability by interpreting the threshold C as a random variable, with probability distribution derived physically from the random variability of quantities involved in the erosion process. This description of channel initiation incorporates higher frequency temporary excursions of channelization into terrain that is on the average not channeled over longer timescales. Estimating sediment transport rates from hillslopes is important during extreme rainfall events, when ephemeral rills and gullies are formed. In this study the question of how to parameterize the variability of relevant hydraulic and hydrologic hillslope properties used deterministically in the previous models has been studied and a probabilistic approach to channel initiation has been developed. Our probabilistic model, recognizing the uncertainty and spatial as well as temporal variability in its parameters, predicts a contribution to erosion (represented probabilistically) due to channelization even at locations that do not meet the central tendency channelization threshold. It is our premise in this paper that this new approach provides further insights into the role of uncertainty in the prediction of erosion for soil management purposes.
 In what follows we first develop a theory of channel initiation that expresses the probability of channel initiation in terms of the randomness of channel initiation threshold C derived from primary random inputs of sediment size, additional roughness (roughness due to vegetation and obstructions) and excess precipitation rate and show its contribution to probabilistic representation of erosion and sediment transport. This theory assumes the initiation of channels where shear stress exceeds a critical value [Dietrich and Dunne, 1993] and develops the probability distribution for this occurrence. The theory can be used to map the probability of channel initiation both for a single channel forming runoff event and integrating over a range of events in time as long as appropriate probability distributions are used to characterize the model inputs. The single event approach, which emphasizes spatial variability, is used in this paper to test the concept through comparison to observed gully initiation from a single storm. The integrated approach that incorporates both spatial and temporal variability is most appropriate for practical management applications, but is less easily tested directly, so we suggest justification of the integrated approach based upon the results from single event tests.
 We then describe our field area where the channel heads were mapped and median sediment size, d50, was measured at each channel head. We test the theoretical relationship between d50 and channel initiation threshold. Finally we select reasonable probability distributions for the other random inputs to C and compare the probability distributions of derived C to the probability distributions of observed aSα at channel heads. We show that the distribution of derived C is well approximated by a gamma distribution, and that a similar gamma distribution matches the distribution of area slope observations in other published channel head data.
2. Probabilistic Approach for Channel Initiation
 The total shear stress associated with overland flow is
where, ρw is the density of water, g gravitational acceleration, y is the flow depth, and S is the sine of the slope angle. Effective shear stress acting on the bare soil surface is a portion of τt due to shear stress partitioning between vegetation, soil and other roughness elements such as obstructions and surface irregularities [Foster et al., 1989]. The effective shear stress, τf acting on the bare soil surface can be approximated by [Tiscareno-Lopez et al., 1994]
where, nb is Manning's roughness coefficient for bare soil and na is additional roughness due to vegetation, obstructions and surface irregularities [Arcement and Schneider, 1984]. Bare soil roughness can be explicitly related to median sediment diameter, d50. Many empirical relationships presented for nb have a general form [Yen, 1992]
where k is a constant and p = 1/6. We used k = 0.0474, the value proposed by Strickler for sand [see Yen, 1992]. In this relationship d50 is expressed in m. This equation has been used for overland flow [Julien and Simons, 1985]. With the d50 values we observed in the field this relationship gives values for nb in the range 0.01 to 0.03 consistent with the range of nb values for sand and gravel suggested for overland flow routing by Woolhiser et al.  and Engman et al. .
 Using laboratory and field data and some data from the literature Rouhipour et al.  showed that Manning's equation gives more accurate estimates of overland flow velocity than the Darcy-Weisbach and Chezy equations. Assuming uniform overland flow, the depth y in equations (2) from Manning's equation is
where, qo is the overland flow per unit contour width. Substituting equation (5) into equations (2) and (3), the effective shear stress can be written as
where, m1 = 0.6, n1 = 0.7 and β1 = ρwgnb1.5(nb + na)−0.9. If we assume that overland flow incises a channel when τf exceeds a critical channel initiation shear stress threshold,τc, then a channelization threshold would be
 We consider two models for runoff generation to estimate qo in equations (6) and (7). The first model assumes that qo is proportional to the specific catchment area, a, as
where r is the net water input rate (rainfall or snowmelt minus evaporation and infiltration). This we term “infiltration excess” because r is the precipitation in excess of infiltration independent of topography. Substituting equation (8) into equation (7) and rearranging gives
With the positive value for α, equation (9) predicts a negative relationship between slope and drainage area for constant C at channel heads.
 The second model for runoff generation assumes a limited lateral transport capacity in the soil profile equal to slope times transmissivity, TS. Overland flow is assumed to occur when this capacity is exceeded and is equal to the surface water input in excess of the lateral transport capacity;
is the specific catchment area required for saturation under steady state conditions and T is the transmissivity of soil profile. This is a saturation excess model for runoff generation similar to the topography-based model of catchment hydrology, TOPMODEL [Beven and Kirkby, 1979]. Runoff occurs only for locations where the wetness index a/S exceeds a threshold, here equal to T/r.
 Using the relationships in equations (7) and (11), the threshold for channel initiation for saturation excess overland flow can be written as
 Our model assumes channel initiation when the shear stress required to initiate sediment transport (incipient motion or detachment) from the bare soil surface is exceeded. This, through equation (10) gives the channelization threshold, C. The critical shear stress, τc, that goes into equation (10) is estimated from Shields'  equation as
where, τ* is the dimensionless critical shear stress for incipient motion or detachment of particles with the median grain size, d50, and ρs is the sediment density. For rough turbulent flows an average value of τ* ≅ 0.045–0.046 [Gessler, 1971; Miller et al., 1977; Yalin and Karahan, 1979]; τ* may also be obtained from the Shields diagram using the shear Reynolds number,
where ν is the kinematic viscosity and equation (2) has been used to obtain the right-hand expression. When τ* is evaluated using this approach it depends on r and topographic variables a and S. However, this dependence is weak. We simulated a range of a, S, and r values representative of the terrain and effective rainfall in our field site and found that τ* ranged narrowly between 0.04 and 0.046, so selected τ* = 0.043 for the remainder of this work. This has the advantage of keeping the channel initiation threshold C independent of topographic variables making for more meaningful comparisons with aSα at channel heads. There may in fact be a wider range in τ* values for rough turbulent flows [Buffington and Montgomery, 1997] that will likely contribute to the variability of channel heads observed in the field. However, τ* = 0.043 is consistent with the τ* values reported by [Buffington and Montgomery, 1997] for coarse sand and fine gravel observed in our field areas.
 The left-hand sides of equations (9) and (12) include topographic variables which can be derived from digital elevation models (DEMs) using algorithms to evaluate local slopes and specific catchment areas such as from Tarboton . The C parameter on the right hand side of (9) and (12) absorbs all the other hydrologic and geomorphic parameters. Obtaining proper values for C may require field studies and experiments as well as remote sensing analysis. However, even under a fixed land use condition or geological setting the parameters absorbed by C may show significant variability in space and time, particularly in the case of r. To account for this we assume spatially homogenous probability distributions for d50, na, and r, which are used to calculate C in equation (10). To simplify matters we ignore variability of r within the area contributing to each point and thus assume that equations (8) and (11) hold for each specific realization of random r. This assumption lets us characterize both spatial and temporal variations of the area-slope threshold C. The scheme to derive a probability distribution for C and to calculate probability of channel initiation over the terrain is shown in Figure 1. Figure 1 depicts the terrain inputs S and a, which are combined to aSα and random inputs d50, r, and na. We selected d50 as a random variable, because it can be used to estimate both nb and τc from equations (4) and (13), respectively, and it can be measured in the field. Once nb is estimated from d50, then possible ranges of additional Manning's roughness, na can be obtained from tables of total Manning's roughness (nb + na) coefficients for overland flow based on land use conditions [e.g., Engman, 1986; Woolhiser et al., 1990]. The quantities τc, nb, na, and r are combined through equation (10) to obtain C. This figure amounts to a Bayes net for estimating the probability distribution of C given uncertainty in the inputs d50, r, and na, characterized by probability distributions. For the infiltration excess overland flow model, the spatial probability of channel initiation (PCI) of a given location can be described by the probability of channel initiation threshold C being less than or equal to aSα calculated from the terrain, formalized as
where fC(C) is the probability density function of C. Evaluating the probability of channel initiation for saturation overland flow is more complex than that of infiltration excess because it involves the parameter as T/r, which should also vary randomly because of the randomness in r, and perhaps also T. First, we write equation (12) in the form of equation (9) as
where B = Sα+1T/r. Here we have the addition of two related random numbers. Let C′ = B + C, then the probability density function for C′, fC′(C′), can be evaluated at each location and integrated to give the probability of channel initiation due to saturation excess overland flow
Barling et al.  indicated that the steady state assumption originating in TOPMODEL and expressed here in equations (8) and (11) has some limitations. They calculated specific catchment area as a function of rainfall duration (partial contributing area). This idea may be implemented by using a partial contributing area in equations (15) and (17). In general we expect that the partial contributing area should be a function of water input duration, D, and so would use a(D)Sα as the upper limit in the integral of equations (15) and (17).
 We have not derived functional forms for the probability distributions of PCI given by equations (15) and (17). Instead, we use Monte Carlo simulation to generate samples of C for the numerical evaluation of PCI. We show numerically that the results from this Monte Carlo simulation can be approximated using a gamma distribution.
Equations (15) and (17) express the probability of channel initiation at each point conditional on probability distributions of the inputs, namely, sediment size, additional roughness, and runoff rate. The sediment size and additional roughness are spatial probability distributions. Only one realization of these will actually exist in any given landscape. However, the specific spatial pattern of sediment size and additional roughness is practically unobservable so the probability distributions quantify the uncertainty in knowing this information. Additional roughness, in part due to vegetation may also be affected by land management activities. Effective runoff on the other hand is both spatially and temporally variable and for practical applications should be quantified using a probability distribution for representative extreme events over a design period. The PCI concept can then, when coupled with local detachment (erosion) and the sediment transport models, and integrated spatially express the aggregate expected value of erosion. This integrated estimate may be non zero as long as the PCI is greater than zero somewhere in the area, even though a threshold erosion model may have predicted no erosion due to channel incision because the single average or central tendency erosion threshold was not exceeded.
3. Field Study
 The above-described methodology has been applied in the Idaho Batholith and its ability to characterize the probability distributions of observed C at channel head locations has been tested. This area consists of an extensive mass of granitic rock (16,000 miles2) that covers a large portion of Idaho and some parts of Montana. It is almost entirely mountainous and forested. Extremely erodible coarse textured soils are found on steep gradients that often exceed 70% [Megahan, 1974]. Average annual precipitation is approximately 1000 mm. Localized high-intensity rainstorms of short duration are common during summer. At other times of the year low-intensity storms with longer durations occur, often in conjunction with snowmelt. Following soil disturbance, the combination of steep topography, high soil erodibility, and high climate stress often results in accelerated surface erosion [Megahan and Kidd, 1972]. The specific study areas selected are Trapper and Robert E. Lee Creeks within the North Fork of the Boise River in southwestern Idaho (see Figure 2). Trapper Creek was intensely burned by a wildfire in 1994, and extreme gullying was initiated by a convective summer storm in 1995, possibly due to water repellent conditions of the surface soil. The gullies generated by this storm are probably ephemeral channels but, nevertheless, resulted in considerable erosion. Robert E. Lee (REL) Creek was partially burned to a light to moderate degree. Although REL Creek is adjacent to Trapper Creek, intense gullying did not occur in REL Creek, presumably because of either higher infiltration capacity from less water repellency or lower localized rainfall intensities or a combination of both these effects.
3.2. Field Observations
 A limited data set of channel head locations was collected using the definition of the channel head as the upstream limit of observable erosion and concentration of flow within definable banks [Montgomery and Dietrich, 1989]. Channel heads in Trapper Creek have head cuts of 0.3 m to 1m in width and an average of 0.5 m depth. Some head cuts were even observed at the ridge crest. Local slopes (over a length of 10 to 20 m) at the channel heads were measured in the field using a Suunto inclinometer and a GPS was used to record locations. Sediment size samples were collected just above the head cuts to approximate the sediment size in transport when the incisions occurred. These samples were sieved and the median size of each sample was determined. Contributing areas were derived from the 30 m DEM of the study site using the D∞ algorithm [Tarboton, 1997]. Contributing area was also checked in a few channel head locations by hiking perpendicular to what was judged to be the elevation contours, from both sides of the channel heads. This data for Trapper Creek is presented in Table 1.
Table 1. Observed Specific Catchment Area, Slope, and Median Grain Size at Channel Head Locations in Trapper Creek and Calculated Parameters Required to Estimate Channel Initiation Threshold, C, and Calculated C Values at Each Location
NA indicates sediment size distribution data are not available.
 Channels in Trapper Creek were usually discontinuous and faded out downstream from the most upstream limit of erosion within definable banks, with sequential head cuts occurring downslope. In these cases we also recorded the locations of the most downslope head cut, which we term the head of the continuous channel. Starting from the heads of the continuous channels, slopes were measured in the field at intervals from 20 to 40 m for distances ranging between 150 m and 500 m downslope. GPS was used to record the slope measurement locations and contributing areas were again derived from the 30 m DEM of the study site using the D∞ algorithm [Tarboton, 1997].
 We do not know the rainfall rate of the storm that incised the channels in Trapper Creek. However, some forest service personnel were exposed to that storm and they estimated that more than 2.54 cm (1 inch) of rain fell in less than half an hour. It is conceivable that this kind of a convective storm event could result in local intensities of between 50 mm/h and 100 mm/h.
 Two distinct types of channel heads were observed in REL Creek. We observed channel incisions because of saturation excess overland flow (SEOF) where fire effects were insignificant. The vegetation mat was broken mostly by seepage forces and sediment entrained by saturation overland flow. These channel heads support gravel bed streams with definable banks stabilized by vegetation. Infiltration excess overland flow (IEOF) channel incisions were observed where there is disturbance by light to moderate fire or forest harvest or there is bedrock exposure and coarse materials such as cobbles and rock fragments on the surface. There was not significant vegetation cover around IEOF channel heads in REL Creek and channels were ephemeral discontinuous small gullies. The channel heads located on hillslopes where there is coarse surface material, rock fragments or exposed bedrock were separated in the model analysis because the percentage of rock was not quantified in our field measurements limiting our ability to use methods that account for rock percentage [Abrahams and Parsons, 1991; Nearing et al., 1999].
4. Pointwise Comparison of the Observed aSα and Calculated C at Channel Heads
 The theory developed above suggested that variations in d50, na, and r are responsible for the variation in aSα at channel heads. Here the measured values of d50 in Trapper Creek are used to test the contribution of d50 to this variability. We set α = 1.167 following equation (9). We also set r = 35 mm/h in the infiltration excess runoff model assuming about 50% infiltration due to the water repellency remaining a year after the wildfire and rainfall around 70 mm/h in the range 50 to 100 mm/h reported for the channel forming storm. We adjusted na to optimize the fit of C values to aSα and obtained na = 0.052. This is within the range from 0.04 to 0.12 reported by Woolhiser et al.  and American Society of Civil Engineers  for sparse vegetation and surface litter.
indicate that about 40% of the variability in observed aSα may be attributed to d50. A similar fit can be obtained for different r values by adjusting na with both r and na still being within the range of their uncertainty.
5. Comparison of Channel Initiation Probability Distributions
 The pointwise comparison shown in Figure 3 revealed uncertainties associated not only with d50 but also with r and na. Here we employed the scheme given in Figure 1 to derive a probability distribution for C by Monte Carlo Simulation and compare the derived distribution with the aSα distribution observed at channel heads in the field. This is a single event test of the PCI concept with probability distributions of the inputs chosen as estimates of the single 1995 event responsible for initiation of the channels mapped.
 We found that the d50 data from Trapper Creek presented in Table 1 was well described by a lognormal distribution with mean d50 = 2 mm and standard deviation of 0.48 mm (Figure 4). Uniform distributions were used for the other two input variables, with r between bounds of 15 and 55 mm/h and na between bounds of 0.015 and 0.1. These distributions quantify the uncertainty associated with this single event in a simple way. In management applications that integrate over time, other distributions may be more appropriate, specifically for r. Fifteen hundred random values for d50, r, and na were generated. Figure 5 compares the cumulative distribution of the simulated C to the cumulative distribution of the observed aSα for channel heads observed in Trapper Creek. The bounds in the probability distributions for r and na have been adjusted by trial and error to achieve this fit but are well within the range that is plausible. The field observed channel head aSα have an average, standard deviation and skewness of 26 m 18.4 m and 1.13. The same statistics obtained for C from the Monte Carlo simulation scheme are 26 m, 18.5 m, and 1.56, respectively. Both the simulation statistics and the cumulative distribution of the simulated C show good correspondence with the observed aSα variability in the field.
 Two distinct channel initiation processes, IEOF and SEOF, were observed in REL Creek. We found that the d50 data set collected from REL Creek at sites where the channel heads were observed can be fit by a lognormal distribution (Figure 6). The measured d50 data have a mean and standard deviation of 3.12 mm and 1.05 mm, respectively. These numbers are larger than for Trapper Creek. REL Creek soils were generally coarser than Trapper Creek. The lognormal distribution (Figure 6) is used for both IEOF and SEOF channel initiation processes in REL Creek to characterize the d50 variability. Uniform distributions were used for the other two inputs r and na. We used r between bounds of 20 and 30 mm/h and na between 0.043 and 0.1 to estimate the PCI for IEOF channel heads. Comparison of the cumulative distributions of the simulated C and the observed aSα for IEOF channel heads in REL Creek is given in Figure 7a. The average, standard deviation, and the skewness of the field observed aSα of the IEOF incisions in REL Creek were 70 m, 33, m and 1.5, respectively. The same statistics obtained from the model are 73 m, 39 m, and 1.7. Since the locations in the REL Creek where IEOF channel heads were observed were lightly burned compared to Trapper Creek, our calibrations of r and na are consistent with lower runoff rates (higher infiltration or less intense rainfall) and slightly higher additional roughness values (more vegetation) in those sites.
 The IEOF channel heads shown in Figure 7a (as circles) do not include those where coarse surface material, rock fragments and bedrock exposure was observed. Instead, the probability distribution of these is shown separately (as triangles) in Figure 7a. Apart from two points at the high end of the distribution all these channel heads with coarse material, rock fragments, and exposed bedrock lie to the right of the aSα distribution from the other more sandy locations, consistent with the idea that there is a higher critical shear stress threshold at these locations.
 At most of the locations of SEOF channel heads saturation and exfiltration was observed in the field without any significant sediment detachment around the channel heads. Equation (11b) gives the specific catchment area required for SEOF. In this equation a value of T/r = 900 m defines a zone of saturation that just includes all the SEOF channel head locations that we mapped. This provides an upper bound on T/r because a finite increment of flow is required to initiate sediment detachment around the channel heads. We modeled the saturation excess channel initiation process assuming a constant T/r of 600 m in equations (11) and (12). From DEM analysis we saw that this value saturates the axis of the major hollows in the watershed and leaves out most of the IEOF channels where no signs of recent sediment transport were observed. Various combinations of T and r can produce a T/r saturation threshold value of 600 m. For example, a T of 40 m2/d requires a steady state precipitation rate of approximately 0.066 m/d. We analyzed 69 years of available precipitation records in the region and found that a daily rainfall of 0.066 m (66 mm) has a return period of about 80 years. The presence of snow can enhance the surface water input due to snowmelt by the rain on snow effect being added to rainfall inputs. However, there is also leakage from the shallow soil into bedrock fractures common in the area. Megahan and Clayton  presented 58 bedrock-saturated hydraulic conductivity tests ranging from 0 to 1.7 m/d with a mean and a median value of 0.175 m/d and 0.02 m/d.
 We used a constant r of 0.066 m/d (2.75 mm/h) with T = 40 m2/d to characterize the rainfall plus possible snowmelt minus deep percolation losses and evapotranspiration for the SEOF channel initiation analysis. For comparison Montgomery  used T = 65 m2/d in the Oregon coast range and Dietrich et al.  used T = 17 m2 in their Tennessee Valley California study area. We used the same lognormal probability distribution for d50 from field observations in REL Creek (Figure 6) as was used for the IEOF channel initiation analysis. We selected 0.0135 and 0.006 for the lower and upper bound of the uniform distribution for na through calibration. Here the effect of additional roughness (na) is minor because the vegetation mat was broken by seepage forces in most sites. Figure 7b compares the cumulative probability distributions of the observed (a-as)Sα and the simulated C at SEOF channel heads. Since the calibrated additional roughness values are very low, and r and T were held constant, Figure 7b shows how the random variation of median sediment sizes can produce significant variation of channel head locations on the terrain.
 In the PCI theory, equations (15) and (17) require the probability density function of C. Several parametric probability distributions such as exponential, normal, lognormal and gamma were fitted using the mean and variance of the aSα observed. Among those we found that gamma distribution gave the best correspondence with the observed aSα and simulated C distributions (Figures 5 and 7). The gamma probability density function is given by
where C is the threshold for channel initiation and λ and k are the distribution parameters which can be estimated from the mean, μC, and variance, σC2, of the observations using μC = k/λ and σC2 = k/λ2. Γ( ) is the gamma function.
6. Sensitivity Analysis
Figure 8 shows the sensitivity of the cumulative distribution of C to the distributions used for d50, r, and na. In Figure 8a, cumulative distributions for C were generated holding d50 at its minimum observed, mean and maximum observed values but generating na and r from their random distributions. Figure 8b shows the cumulative distributions obtained holding r fixed at its lower bound, mean and upper bound values while generating na and d50 from their random distributions. Figure 8c shows the cumulative distributions obtained holding na fixed at its lower bound, mean and upper bound values while generating r and d50 from their random distributions.
Montgomery and Dietrich  tabulated source areas and slopes at 63 channel heads in their Tennessee Valley California study area. Figure 9 plots the cumulative distribution of ASα for these data and shows that it is also well approximated by a gamma distribution. Note that in this comparison contributing area A in m2 is used rather than specific catchment area because the colluvial fill width given in the Montgomery and Dietrich paper is not the same as contour width in the definition of specific catchment area. We have not attempted to fit parameters to the distributions for r, na, and d50 for this area because we have insufficient information to do so. Nevertheless, we present these data as third party data illustrative of variability of channel initiation threshold of the same form as predicted by our theory.
 Since the gamma distribution fits the data well, we used it in equation (15) to derive a map of PCI over the study watersheds (Figure 10). Here aSα is evaluated at each point from the DEM. PCI is then mapped at each location as the cumulative probability from the incomplete gamma function associated with the aSα value and fitted gamma distribution parameters. The maps show the topographic expression of PCI. Channel head locations towards the upper ends of high PCI zones are consistent with the theory. The PCI values mapped in Trapper Creek correspond to the gamma distribution fitted to observed channel heads (all IEOF) in Figure 5. The PCI values mapped in REL Creek correspond to the gamma distribution fitted to the observed IEOF channel heads in Figure 7a. The different gamma distributions lead to different PCI values for these watersheds, which reflect the effects of the more intense fire in Trapper and increased PCI following the fire.
7. Analyzing Hillslope to Channel Transition Using PCI in Slope-Area Diagrams
 Slope versus contributing area diagrams are often used to unravel geomorphic processes from DEMs and field observations [Montgomery and Dietrich, 1992; Tarboton et al., 1992; Ijjasz-Vasquez and Bras, 1995]. In order to show the use of PCI for evaluating the channelization process on slope-area diagrams, Figures 11 and 12 show the slope-area representation of the PCI theory using contours of equal PCI for our study watersheds. The probabilistic approach generates a specific PCI as a function of slope and specific catchment area. The probability of channel initiation increases with the increase of either slope or specific catchment area. In Trapper Creek where the heads of continuous channels were mapped separately, we see that heads of observed continuous channels tend to occur in zones with high PCI (the triangles on Figure 11 that occur where PCI > 0.7), while channel heads not associated with continuous channels occur where PCI is lower (circles on Figure 11).
 In section 5 we used a T/r = 600 m with equation (11b) to model saturation excess. The line separating saturated from unsaturated areas is shown on Figure 12 (the dashed saturation line) according to this relationship. This separates naturally in slope-area space the channel heads where saturation was observed (SEOF channel heads) from those where saturation was not observed (IEOF channel heads)
 Erosion threshold theories for channel initiation [e.g., Montgomery and Dietrich, 1988, 1989; Dietrich et al., 1993; Montgomery and Foufoula-Georgiou, 1993] have often attributed the observed inverse relationship between the drainage area to support a channel head and the local slope at channel heads to the analytical form of erosion threshold functions such as equation (9). A positive relationship between drainage area and local slope at channel heads can be explained (referring to equations (11) and (12)) in terms of the increase in drainage area required to support runoff with increasing slope in a saturation threshold runoff model [Montgomery and Dietrich, 1994]. Both of these effects are seen in Figure 12 where the IEOF constant PCI contours have a negative slope-area relationship and the saturation line and constant PCI contours for SEOF where slope is large have a positive slope-area relationship. We observed (Figure 11) a positive relationship between area and slope at channel head locations in Trapper Creek, but an inverse relationship similar to the previous studies in IEOF channel heads in REL Creek (Figure 12) and a somewhat constant slope with varying specific catchment area in SEOF channel heads in REL Creek. Prosser and Soufi  found no clear slope dependency on the drainage area required to support gully heads in intensely gullied hillslopes following land use change.
 In interpreting these results we feel that the observed slope-area relationships derived from the topography of landforms and the functional relationship between area and slope derived from the excess-shear stress model with constant threshold are two different concepts. The negative (or positive) slope-area relationships obtained from equality in equations (9) or (12) assume a fixed C at all channel heads, whereas the point of this paper has been to explore spatial and temporal variability in C, characterized using probability. Channel formation in equation (9) or (12) requires that at a point, a combination of area and slope should be greater than a certain threshold independent of the morphology of the landscape. Area-slope trends on the topography are established over the long-term evolution of the landscape due to the interactions between diffusive hillslope and fluvial channel processes. Therefore, depending on the C threshold at an instant in time (e.g., a thunderstorm event following a wildfire), channels may erode different hillslope positions where different processes dominate over the long term. If channelization occurs where diffusive processes dominate over the long term (convex hillslopes), then a positive relationship between area and slopes at observed channel heads is expected. The gully heads observed in Trapper Creek after significant disturbance are an example of this effect. Similarly, when channel heads are observed where fluvial sediment transport is dominant (concave valleys), then the area-slope relationship would be negative. One may also observe channel heads in the field where over the long-term diffusive and fluvial processes are in balance. In this case channel head data would reveal no distinct relationship between area and slope. The only case for which channel heads observed in the field may follow the functional form of the relationship between area and slope derived from the shear stress theory is the rather special case where all the model parameters of equation (10) are constant in space and time. In this case an erosional threshold line on area-slope plots would show the transition point between hillslopes and channels [e.g., Tucker and Bras, 1998, Figure 11].
 On the basis of the observations presented in this paper, we conclude that channels can incise in different topographic locations depending on the spatial and temporal variability of climate and land cover. The probabilistic channel initiation zone on the slope-area diagrams shifts back and forth along the specific catchment area axis (compare Figures 11 and 12) depending on the model input values, such as variation of runoff rates and additional roughness due to vegetation cover, that characterize climate and land cover variability.
 Flow concentration due to channelization is an important mechanism in erosion and sediment transport. In this paper we have presented a probabilistic theory for modeling channel initiation and tested the theory using field data as an initial step in quantifying this process considering the inherent uncertainties in nature. In this paper we first showed that measured variability of sediment grain size is related to and accounts for a significant portion of the variability in channel initiation threshold observed in the field. We then showed that the probabilistic model developed is capable of producing a probability distribution of channel initiation threshold that matches the observed slope-area dependent channel initiation threshold (Figures 5 and 7). We recognize that probability distributions comparisons are weaker than point-wise comparisons, but have no way to quantify the specific spatial patterns of inputs necessary for more detailed pointwise tests of the model. In the probability distribution comparisons presented, uniform distributions were assumed for some of the inputs. The specific form of these may be questioned. In particular, exponential or gamma distributions are more common distributions used for rainfall. The uniform distribution was used as a convenient noninformative a priori choice. Without specific information it is easy to understand and conceptualize equal likelihood between upper and lower bounds. On the basis of our experimentation, there is sufficient flexibility in the model that it is relatively insensitive to the specific form of the input distribution chosen. It is more important to quantify the variability and uncertainty using some probability distribution rather than worry about the precise form of distribution chosen.
 Estimating fluvial sediment transport from hillslopes first requires predicting the channeled portions of the terrain. A model that uses a single channel initiation threshold will predict significant erosion only in locations where channelization is predicted on a long-term basis. Our probabilistic model provides a way to account for the less frequent contribution to erosion due to channelization even at locations that do not meet the single channelization threshold.
 The effects of different land management practices on channel initiation and the erosion risk can be visualized by mapping the PCI using DEMs. In forested watersheds, for example, these maps may be useful for spatial planning of forest harvests and developing soil conservation techniques after wildfires. Mapping PCI can also be used in planning land use in agricultural watersheds where the effects of different crop patterns and tillage practices on PCI and the potential for gully erosion is of concern.
 The hydrologic response of a basin, such as hydrograph characteristics, is strongly related to geomorphology and climate [Rodriguez-Iturbe, 1993]. The closer the channel head is to the watershed divide the longer distance the flow will travel in channels rather than on hillslopes. Expansion of ephemeral gullies to unchanneled hillslopes within a storm event decreases flow travel times within the channel network [Beven and Wood, 1993]. Mapping the probability of channel initiation on the terrain (Figure 10) reveals the probabilities of flow transport in both overland and channeled states and contributes to understanding the effects of geomorphologic changes on the hydrologic response of the watersheds.
 We appreciate the thoughtful reviews by David Montgomery and an anonymous WRR reviewer. We are greatly appreciative of financial support from the U.S. Department of Agriculture under contract 9901085 awarded through the Water Resources Assessment Protection Program of the National Research Initiative Competitive Grants Program. We thank field technicians Tom Black and Bernie Riley.