The Gravity Recovery and Climate Experiment, GRACE, will deliver monthly averages of the spherical harmonic coefficients describing the Earth's gravity field, from which we expect to infer time-variable changes in mass, averaged over arbitrary regions having length scales of a few hundred kilometers and larger, to accuracies of better than 1 cm of equivalent water thickness. These data will be useful for examining changes in the distribution of water in the ocean, in snow and ice on polar ice sheets, and in continental water and snow storage. We describe methods of extracting regional mass anomalies from GRACE gravity coefficients. Spatial averaging kernels were created to isolate the gravity signal of individual regions while simultaneously minimizing the effects of GRACE observational errors and contamination from surrounding glacial, hydrological, and oceanic gravity signals. We then estimated the probable accuracy of averaging kernels for regions of arbitrary shape and size.
 Time-variable gravity changes are caused by a combination of postglacial rebound, fluctuations in atmospheric mass, and the redistribution of water, snow, and ice on land and in the ocean. The spatial resolution of gravity data obtained from satellite measurements has not yet been sufficient to separate the effects of these processes from one another. Data from GRACE, launched in 2002 should provide dramatically improved time-variable gravity measurements. GRACE will deliver monthly averages of the spherical harmonic coefficients describing the Earth's gravity field at scales of a few hundred kilometers and larger.
 From the gravity field estimates, we expect to infer time-variable changes in mass, averaged over arbitrary regions having length scales of a few hundred kilometers and larger, to accuracies of better than 1 cm of equivalent water thickness. These data will be useful for examining changes in the distribution of water in the ocean, in snow and ice on polar ice sheets, and in continental water and snow storage. These quantities can then be used to assess and improve climate models, to better understand large-scale hydrological processes, and to monitor the distribution of land-based water for agricultural and water resource applications. Combined with radar altimetry over the oceans, these data can improve estimates of the time-varying ocean heat storage, as well as deep ocean currents. In polar regions, GRACE data can be used to study postglacial rebound and, in conjunction with laser altimetry, to constrain the mass balance of ice sheets.
 Because the spatial resolution of GRACE is on the order of a few hundred kilometers, an estimate of a surface-mass anomaly will not be a point measurement, but rather a spatial average. Wahr et al.  introduce an averaging method based on a simple Gaussian filter. However, this method does not isolate a specific region. In order to use GRACE to estimate regional changes in surface mass, techniques must be developed which extract regional mass anomalies from GRACE gravity coefficients. In this paper, we describe methods of creating spatial averaging kernels which isolate the gravity signal of individual regions while simultaneously minimizing the effects of GRACE observational errors and the contamination from surrounding glacial, hydrological, and oceanic gravity signals. We then estimate the probable accuracy of averaging kernels for regions of arbitrary shape and size.
2. Inferring Surface Mass Changes From the Time-Variable Gravity Field
 It is usual to represent the Earth's gravity field in terms of the shape of the geoid, the equipotential surface that most closely coincides with mean sea level over the ocean. The geoid, N, can be expanded as a sum of normalized associated Legendre functions, lm [see, e.g., Chao and Gross, 1987]:
where θ is colatitude, ϕ is longitude, a is the mean radius of the Earth, and Clm and Slm are dimensionless Stokes coefficients. GRACE Project personnel will use the GRACE measurements to solve for the Stokes coefficients up to degree l ∼ 100 every 30 days, and these coefficients will be made available to users. The spatial scale, λ, associated with a particular Stokes coefficient is inversely proportional to its angular degree, l, and can be found approximately by the relation λ = 20,000 km/l, so that l ≤ 100 correspond to length scales of ∼200 km and larger.
 Using these coefficients, it will be possible to infer changes in the gravity field from one 30-day period to the next, and so to study processes involving the redistribution of mass within the Earth and on or above its surface. GRACE will be accurate enough to be sensitive to changes in the Earth's gravity field caused by fluctuations in continental water storage and the polar ice sheets, as well as by changes in atmospheric and oceanic mass distribution. The contributions from the atmosphere can be estimated from independent atmospheric data and largely removed [Velicogna et al., 2001; Swenson and Wahr, 2002]. Because most of the remaining sources of time-variable mass change are confined to a thin layer at the Earth's surface, one can approximate the vertically integrated water and ice mass as a surface mass density. With this approximation, Wahr et al.  show that a local change in surface mass density, Δσ(θ,ϕ), can be related to changes in the Stokes coefficients, ΔClm and ΔSlm, by
where ρE is the average density of the solid Earth and kl are the load love numbers representing the effects of the Earth's response to surface loads. The love numbers can be obtained from Wahr et al. .
 The Stokes coefficients produced by GRACE will contain measurement errors, Δδlmc and Δδlms, such that
Satellite measurement errors include system noise error in the intersatellite range rate, accelerometer error, error in the ultrastable oscillator, and error in the orbits.
 If the coefficients of the satellite errors and the surface mass anomaly at n times t1,t2,…tn are uncorrelated for every (l, m) and (l′, m′),
then two terms comprise the expression for the variance of the inferred surface mass anomaly at the point (θ, ϕ):
where the contribution to the variance due to satellite measurement error is
converts the geoid coefficients to surface mass coefficients, and Λll′ m m′ are the covariance matrices of the GRACE measurement errors. If, for example, the statistical properties of the measurement errors were the same for each of the n months ti, i = 1, n, then
While calculations utilizing the full covariance matrices are outlined in Appendix A, the calculations in this paper are made with certain assumptions which greatly simplify the covariance matrices. First, we assume Δδlmc and Δδl′m′s are uncorrelated for all values of l, l′, m, m′; equations (10) and (11) therefore vanish. Second, we assume Δδlmc and Δδl′m′c are uncorrelated, and Δδ lms and Δδl′m′s are uncorrelated, unless l = l′ and m = m′, so that equations (9) and (12) vanish in those cases. Last, errors are assumed to depend on spatial scale but not orientation; that is, Δδlmc = Δδlms and these coefficients depend on l but not m. With these assumptions, equation (7) becomes
Because of our assumptions concerning Δδlmc and Δδlms, the contribution of the satellite measurement errors to the variance of the surface mass anomaly estimate (equation (13)) is independent of location. Bl are the degree amplitudes of the variance of the satellite errors and describe the contribution of the error at a particular wavelength to the variance of the geoid. In this paper we use preliminary estimates of Bl as a function of l provided by B. Thomas and M. Watkins at the Jet Propulsion Laboratory (personal communication, 1996) that are consistent with those described in Jet Propulsion Laboratory .
Figure 1 shows the square root of var(σ)sat as a function of l. For values of l greater than about 5, the GRACE satellite error estimates increase rapidly with increasing l. In principle, the summations in equations (2) and (13) include contributions from all wavelengths, up to l = ∞, and the satellite errors will lead to extremely inaccurate results. Because GRACE will deliver ΔClm and ΔSlm only to l ∼ 100, the sums will in practice be truncated at ltrnc = 100. Although neglecting the contributions of coefficients with l > 100 will reduce the amount of satellite error present in the estimate of Δσ(θ, ϕ), the error from the larger remaining values of l will still seriously degrade the solutions. In addition, using a truncated sum in equation (2) is not equivalent to a point measurement, Δσ (θ,ϕ) because it lacks components having length scales less than ∼200 km.
 These issues can be avoided by averaging Δσ(θ, ϕ) over a region. Spatial averaging reduces the contributions from large l to the summation in equation (2), reducing the effects of both satellite error and misrepresentation of the gravity field due to the absence of short-wavelength components.
3. Spatial Averaging to Improve Accuracy
 The accuracy of estimates of surface mass anomalies can be improved by spatial averaging, at the expense of spatial resolution. An exact averaging kernel, ϑ(θ, ϕ), is a function which describes the shape of the basin (e.g., a river basin, a region of the ocean floor, an ice sheet, a political boundary) according to
The change in vertically integrated water storage averaged over an arbitrary region is
where dΩ = sinθ dθ dϕ is an element of solid angle. Integrating ϑ(θ, ϕ) over the sphere gives Ωregion, the angular area of the region of interest. Using equation (2), equation (16) can be reexpressed by a sum of Stokes coefficients as
where ϑlmc and ϑlms are the spherical harmonic coefficients describing ϑ(θ, ϕ):
One effect of the basin coefficients in equation (17) is to reduce the contribution to from the Stokes coefficients for large l. For example, Figure 2 shows the degree amplitude spectrum,
for disc-shaped basins having radii of 1000, 500, and 100 km. The spectrum has been normalized by dividing by ϑ0 for each case. As the basin size increases, its degree amplitude is concentrated at relatively smaller l, corresponding to longer wavelengths. Averages over larger regions therefore are influenced less by poorly known short-wavelength signals than are smaller regions.
 Basin averages calculated using Stokes coefficients provided by GRACE will differ from the true basin average due to the presence of satellite measurement errors and the absence of coefficients for l > 100. Failure to include all ϑlmc and ϑlms results in an inaccurate representation of the basin shape. Figure 3 shows cross sections of ϑ reconstructed from ϑlmc and ϑlms for different values of ltrnc, the value of l at which the summation in equation (18) is truncated. The ringing present in these reconstructions near the boundaries of the basin mask, called the Gibbs phenomenon, is due to the absence of basin coefficients with l > ltrnc. Ringing increases as ltrnc decreases, resulting in a basin average which samples more of the region outside the basin.
 With these errors in mind, we write the basin average computed using Stokes coefficients provided by GRACE as the sum of the true basin average, satellite error, and truncation error:
4. Reducing Satellite Measurement Error
 Because GRACE will provide Stokes coefficients only to degree and order ∼100, components of surface-mass variability corresponding to l > 100 will be absent from the calculation of the basin average. The contribution of high l Stokes coefficients to large basin averages may be unimportant, because, as shown in Figure 2, the basin coefficients become quite small for high l, but as basin size decreases their contribution becomes more significant. In addition to error due to truncation, GRACE data will contain satellite measurement errors. The simplest way to reduce these errors would be to truncate the sum in equation (23) at some ltrnc < 100. However, decreasing the value of ltrnc increases equation (24), the amount of truncation error. We would like to find a method of decreasing the satellite errors in the estimates of basin-averaged surface mass change without increasing the error due to truncation.
 The exact averaging kernel equation (15) changes discontinuously from a value of 1 to 0 at the basin boundaries, resulting in the presence of short-wavelength ϑlmc and ϑlms in the expansion of ϑ(θ, ϕ). Because the satellite errors are greatest at large l, the contribution from these short-wavelength coefficients dominates equation (23), the basin-averaged satellite measurement error. The expansion of an averaging kernel which varies smoothly across the basin boundary has less power in its short-wavelength coefficients than does the exact averaging kernel. However, the basin average computed using a smoothed averaging kernel will less accurately represent the true basin average. This approximate average will be influenced by mass signals outside the basin, referred to as leakage, as well as over- or under-estimating the contribution of the signal inside the basin. While the approximation error cannot be eliminated, we will show that one can create smoothed averaging kernels which produce a reduction in satellite measurement error while keeping the amount of leakage error in the basin-averaged estimates of surface-mass change to an acceptable level for basins having length scales of a few hundred kilometers and larger. We describe two types of averaging kernels. The first kind, described in section 4.1, incorporates a Gaussian filter, and is relatively simple to visualize and compute. The second kind, described in section 4.2, is more complicated but provides a means of minimizing either the leakage error or the satellite measurement error, or the sum of the two.
 An approximate basin average can be obtained by replacing the exact averaging kernel, ϑ(θ, ϕ), by an approximate averaging kernel, (θ, ϕ), in equation (16):
where denotes the approximate basin average. Expanding as
the approximate basin average can be expressed in terms of Stokes coefficients as
When computed using this approximate averaging kernel, the contribution of satellite measurement error to the variance of the average surface mass anomaly becomes
4.1. Gaussian Smoothing
 A smooth averaging kernel, (θ, ϕ), may be created in a straightforward way by convolving the basin function, ϑ(θ, ϕ), with a Gaussian filter:
where equation (29) is integrated over solid angle and, following Jekeli , the Gaussian filter, W(θ, ϕ, θ′, ϕ′), depends only on the angle γ between two points (θ, ϕ) and (θ′, ϕ′), i.e., cos γ = cos θ cos θ′ + sin θ sin θ′ cos(ϕ − ϕ′),
is the half width of the Gaussian smoothing function; when γ = , . The new averaging kernel, , changes smoothly from a value of 1 inside the boundary to a value of 0 outside the boundary over a horizontal distance of approximately . For this type of averaging kernel, the weighting coefficients in (26) are defined according to
Wl may be computed recursively by the following relation:
Figure 4 shows the result of using a Gaussian filter to smooth the exact averaging kernel, in this case that of the Missouri river basin. Changing the half width of the Gaussian filter allows one to control the relative amounts of satellite and leakage error. Small values of lead to an approximate averaging kernel which changes rapidly at its boundaries; the weighting coefficients therefore contain power at short-wavelengths and the error budget is dominated by satellite error. Large values of reduce the amplitude of the short-wavelength weighting coefficients, but increase the amount of error due to leakage from regions surrounding the basin of interest.
Figure 4a shows the exact averaging kernel. It has a value of 1 inside the basin and 0 outside the basin. As previously noted, the ringing at the boundaries is due to the finite upper limit (ltrnc = 100) in the summation in equation (26) used to calculate (θ, ϕ). Figures 4b and 4c show the effects of smoothing using 200 and 400 km Gaussian filters, respectively. At 200 km, the averaging kernel changes continuously across the basin edge, while at the same time nearly reproducing the basin shape. When a 400 km filter is applied, the averaging kernel becomes much smoother than that in Figure 4b and samples a larger portion of the surrounding regions.
4.2. Optimizing the Averaging Kernel
 While smoothing the exact averaging kernel with a Gaussian filter provides a simple and intuitive way of creating an averaging kernel with decreased short-wavelength components, it may not provide the most accurate estimate of the basin average. We outline two minimization techniques which incorporate measures of satellite and leakage errors to create an optimal averaging kernel. In the first case we minimize the sum of the satellite and leakage errors, while in the second case we fix one type of error to a specific value and minimize the other type.
 If ϑ is the exact averaging kernel, then the total approximation error, or leakage, in our estimate of the mass anomaly of the basin at time ti is the difference between equations (16) and (25):
The variance of the leakage is then
While the construction of an optimal averaging kernel can incorporate a signal covariance function having any angular dependence (described in Appendix A), the expression for var(εlkg) is greatly simplified if we assume that there exists a correlation which depends only on the angular distance, α, between (θ, ϕ) and (θ′, ϕ′), so that
where σ02, the variance of the surface mass signal at any point, is assumed to be uniform over the basin and its surroundings, and G(α) is an appropriate function for describing the correlation (in section 5, G(α) will be chosen to be a Gaussian function of α). Replacing and ϑ with their respective spherical harmonic expansions, the variance of the basin average leakage becomes
4.2.1. Minimization of total error
 To find averaging kernel coefficients which minimize the total error, we set the partial derivatives with respect to Wlmc and Wlms of the sum of the error variances to zero,
which produces the following set of equations:
Equation (41) makes it possible to find optimal averaging coefficients for a specific basin, from knowledge of the signal variance (σ02), the signal correlation function (Gl) and the degree variances of the GRACE measurement errors (Bl2).
4.2.2. Lagrange multiplier method
 While the previous method fulfills the goal of minimizing the total error, it requires a priori estimates of both the amplitude and angular dependence of the covariance function to do so. If these estimates are unavailable or unreliable, it may desirable to create an averaging kernel which does not depend on a foreknowledge of the signal characteristics. An alternative definition of leakage, independent of the signal, is the ratio of the spatial variance of the difference between the exact and approximate averaging kernels to that of the exact averaging kernel,
One may use the method of Lagrange multipliers to create an averaging kernel which minimizes this leakage error subject to a constraint on the value of satellite measurement error. Let δ2 be the desired variance of the average satellite measurement error equation (28) and let Δ2 = δ2 Ωregion2. Denoting the Lagrange multiplier by λ, we determine the values of Wlmc, Wlms, and λ that minimize the quantity
where we have absorbed the 4πΩregion in equation (42) into λ. Setting the partial derivatives of ξ with respect to Wclm and Wslm equal to zero gives the set of equations
Setting the partial derivative of ξ with respect to λ equal to zero returns the requirement that the effects of the satellite measurement error be equal to Δ2:
which can be combined with (44) to give an equation for λ
Once λ is determined from equation (46), it can be used in equation (44) to solve for Wlmc and Wlms. In general, there are approximately ltrnc2 values of λ which are solutions to equation (46). However, there exists only one solution which is positive, and it is this root which provides the true leakage minimum; the negative roots are only local extrema. If one inadvertently specifies a value for Δ2 which is greater than that obtained by using the exact averaging kernel coefficients, ϑlmc and ϑlms in equation (28), then no positive roots of equation (46) exist.
 To create an averaging kernel which fixes the leakage error (42) to a specific value while minimizing the satellite error, one still uses equation (44). λ, however, is determined from
 To assess the probable accuracies of the approximate averaging kernels, we first examine the one-dimensional case of a disc-shaped basin. To compute the leakage error, one requires an estimate of the length scale, d, of the spatial correlation of the surface-mass change signal. One source of surface-mass variability is terrestrial water storage. While surface water, snow water, and groundwater are all sources of water storage variability, Rodell and Famiglietti  determined that the largest component of variability in the American Midwest is soil moisture. We assume that this relationship characterizes all of the basins examined in this study. Studies such as Entin et al. , Vinnikov et al. , and Cayan and Georgakakos  have shown the spatial coherence of soil moisture to have a length scale ranging from 200 to 800 km. While Entin et al.  employ a covariance function which decays exponentially with distance, we have chosen to use a Gaussian distribution (equation (30)) to describe the correlation between surface-mass variability at different locations. This assumption is based on the ease of manipulation of the Gaussian function in the spherical harmonic domain. In this case, the coefficients Gl, of the spatial covariance function (equation (37)) are computed in the same manner as the smoothing coefficients Wl, using the recursion relation (34) where d, the spatial correlation length scale, replaces .
 To compare the Gaussian smoothing process to the minimization process, we first create Gaussian kernels for disc-shaped basins with filters of various half widths. For each kernel, we compute the satellite error using equation (28). Next, we use the Lagrange multiplier method to create an optimal averaging kernel for that value of satellite error, and compare the leakage ratios for the two types of averaging kernel. We define the leakage ratio, Rlkg, as the square root of the ratio of the variance of the error caused by the approximate averaging kernel to the variance of the true basin-averaged signal. Rlkg is thus independent of the amplitude of the covariance function. Because the leakage contains the effects of both the approximate averaging kernel and the finite upper limit in the summations in the expressions for the basin averages, we split the sum in equation (42) at ltrnc. Dividing by the expression for the correct surface-mass variance gives
The first term on the right-hand side of equation (48) describes the approximation error made by replacing the true basin function, ϑ(θ, ϕ), by the approximate averaging kernel, (θ, ϕ). In this paper we assume GRACE will only supply Stokes coefficients to degree ltrnc = 100. The second term describes the truncation error due to neglecting terms l > 100 when calculating the basin average. While the approximation error depends on the form of a particular averaging kernel, the truncation error has a constant value fixed by the value of ltrnc and, through the Gl, the value of d. The maximum value of satellite error for a given basin shape can be found by setting wlmc and wlms to ϑlmc and ϑlms in (28) when solving for . The exact averaging kernel, ϑ, has an additional property: because its approximation error is identically zero, its leakage ratio is entirely due to truncation error. Figure 5 shows the leakage ratio as a function of satellite error and spatial correlation scale of the signal, d, for a 500 km radius, disc-shaped basin. The level of truncation error can be seen in Figure 5 as the value taken by Rlkg as the satellite error approaches its maximum value (dashed line). From Figure 5a it can be seen that when surface-mass anomalies are correlated to about 10 km, truncation error is about 30% of the signal variance, and dominates Rlkg for satellite errors greater than about 3 mm. As the signal correlation length, d, increases, the amplitude of the short-wavelength components of the surface-mass signal decrease, and therefore the truncation error decreases. As the truncation error diminishes, the difference between the minimization method and the Gaussian smoothing method becomes more apparent. For example, the leakage due to the Gaussian method at 1 cm satellite error and 200 km correlation length is almost double that due to the minimization method. The effectiveness of the minimization method in reducing leakage error is even more pronounced in the case of a 200 km radius disc (Figure 6).
Figure 7 shows the reconstructed cross sections of the Gaussian and optimal averaging kernels designed to produce a satellite error of 1 cm for two disc-shaped basins. Each plot assumes a correlation length, d, of 200 km. Figure 7a shows a 500 km disc. In order to achieve a 1 cm satellite error, the Gaussian averaging kernel has a roll-off, , of 125 km. A visual comparison of the two averaging kernels confirms Figure 5c; the optimal kernel more closely resembles the truncated basin shape (dashed line) and therefore produces less leakage than the Gaussian averaging kernel. The truncated basin shape lacks the short-wavelength components needed to reproduce the exact basin shape (solid line), but because the correlation length is sufficiently large, the component of Rlkg due to truncation error is ≪1.
Figure 7b shows a 200 km disc. The Gaussian averaging kernel in this case employs a roll-off of 215 km to produce the required 1 cm satellite error. Again, the truncated basin shape does not well represent the exact basin shape, but Figure 6c shows the truncation error component to be <5%. However, the leakage is greater than for the 500 km disc because neither averaging kernel closely resembles the truncated basin shape. At 1 cm satellite error, the leakage ratio in Figure 6c is nearly 0.5. This large approximation error is due to the smoothness in the averaging kernels required by the condition that the satellite error must be 1 cm.
6. Comparison With Real Drainage Basins
 While the calculations in the previous section employ disc-shaped basins in order to better understand the effectiveness of each of the averaging kernels, a real basin, whether it describes an oceanic region, a polar ice stream, a river basin, or a political boundary, is rarely shaped so simply. For six North American river basins (Figure 8) of varying shapes and sizes, we calculate the average satellite and leakage errors for each basin using the Gaussian smoothing method and the Lagrange multiplier method. The river basin masks were extracted from the HYDRO 1K Elevation Derivative Database maintained by the U.S. Geological Survey EROS Data Center.
 For a basin average with 1 cm satellite error, leakage ratios obtained using a Gaussian averaging kernel range from approximately 0.4 for the smallest basin (220,000 km2 in area) to 0.1 for the largest basin (1,400,000 km2 in area) for a correlation length of 200 km (Figure 9). Leakage ratios for the minimization method are about half this amount, ranging from 0.2 to 0.05. Taking 4 cm as a typical value for the RMS variation in midlatitude soil moisture [Entin et al., 2000, Table 1], one would then expect leakage due to the use of the Gaussian averaging kernel to account for about 1.6 cm of error for basin (Figure 9a) and 0.4 cm of error for basin (Figure 9f). Use of the minimization method would reduce the leakage to approximately 0.8 cm for basin (Figure 9a) and 0.2 cm for basin (Figure 9f). If the correlation length of soil moisture variability is 500 km (Figure 10), the leakage ratio for a basin with 1 cm satellite error ranges from about 0.2 for the smallest basin to less than 0.05 for the largest basin when the Gaussian smoothing method is used. Applying the optimal kernel results in leakage errors which are negligible for even the smallest of these basins. In this case, one could set the satellite error to as low as 0.2 cm without producing a significant amount of leakage in the basin averaged estimates of surface mass variability.
 GRACE, scheduled for launch in 2002, will deliver monthly averages of the Earth's geoid, in the form of Stokes coefficients. The finite number of Stokes coefficients provided by GRACE prohibits the estimation of surface mass anomalies at a point, but regional averages can be made. Because the expected satellite measurement errors increase rapidly for short wavelengths, an average calculated using the exact representation of the basin shape, the spectrum of which has power at all wavelengths, is influenced by all the errors associated with these gravity data. A smooth averaging kernel contains less power at short wavelengths, and its use in computing basin averages will lead to a smaller satellite error. However, any approximate averaging kernel will no longer exactly sample the region of interest, and signals from surrounding areas will leak into the estimate of the basin average.
 We have outlined different methods for creating approximate averaging kernels which extract the average surface-mass anomaly of a desired region from the GRACE geoid coefficients. Each of these methods reduce the amount of satellite error present in the basin average. One method uses a Gaussian filter (30) to smooth the exact averaging kernel (equation (15)), providing a conceptually and computationally simple means of creating a new averaging kernel. In this case, Wlmc and Wlms used in equation (27), are given by equations (32)–(34). However, tuning the width of the Gaussian filter to change the amount of satellite error in the average requires a certain amount of trial-and-error. In addition, the leakage introduced by this smoothed averaging kernel may not be minimized.
 Two minimization techniques for finding Wlmc and Wlms were outlined, one of which created an averaging kernel which minimized the total error (equation (41)). The other technique used a Lagrange multiplier (equation (44)). The Lagrange multiplier, λ, either minimized the leakage for a given value of satellite error (equation (46)) or minimized the satellite error for a given leakage level (equation (47)). Both methods, described in Appendix A, can incorporate the full covariance matrices of satellite error and/or expected signal. Full satellite error covariance matrices will be provided by GRACE Project personnel when the Stokes coefficients are made available to users. A priori signal covariance matrices may be obtainable from hydrological models or other data sets. However, both minimization techniques simplify considerably under the assumption of azimuthal symmetry.
 The method for developing weighting coefficients derived by minimizing the equations describing the total error requires an a priori description of the expected signal covariance. Because of the additional complexities of analyzing these data, we will assess this method in further studies. Instead, we used a Lagrange multiplier method to create averaging kernels which were optimized without the use of an expected signal variability. The Lagrange multiplier minimization method is somewhat more complicated than applying a Gaussian filter, but it decreases the leakage errors considerably. Calculations for a few drainage basins in North America having areas ranging from 2.2 × 105 to 1.4 × 106 km2, indicate that surface-mass change signals, spatially correlated at length scales of about 200 km, can be retrieved using an averaging kernel which has been smoothed with a Gaussian filter to an accuracy of 10–40% of the signal amplitude plus <1 cm satellite measurement error. Using the Lagrange multiplier minimization method reduces the amount of leakage error by about half, so that mass anomalies can be computed to an accuracy of 5–20% of the signal amplitude plus <1 cm satellite measurement error. Surface-mass change signals spatially correlated at longer length scales can be retrieved with almost no leakage error and satellite errors of less than a few millimeters.
Appendix A:: Inversion With Arbitrary Covariance Functions
 While more complicated than the case of an azimuthally symmetric covariance function, it is possible to use the full signal and satellite error covariance matrices to construct an optimized averaging kernel. Beginning with (36), , ϑ, and σ are replaced with their respective spherical harmonic expansions. The variance of the leakage becomes
where Γll′mm′ are the covariance matrices of the signal:
The variance of the basin-averaged satellite measurement errors is
where Λll′mm′ are the covariance matrices of the GRACE measurement errors (see equations (9)–(12)).
A1. Minimization of Total Error
To find the averaging kernel coefficients which minimize the total error, we set the gradient with respect to of the sum of the error variances to zero,
Equation (A9) is a linear system, which can be solved numerically for .
A2. Lagrange Multiplier Method
 As an alternative to equation (A6) we may define the leakage as the ratio of the spatial variance of the difference between the exact and approximate averaging kernels to that of the exact averaging kernel,
This definition of leakage is free from the assumptions regarding the form of the expected signal required by equation (A6). However, those assumptions provided a means of directly comparing leakage error to satellite error. In the absence of such a connection, one cannot minimize the total error. Instead, one may use a Lagrange multiplier technique to minimize either leakage error or satellite error, subject to a constraint on the other type of error. Denoting the Lagrange multiplier by λ, we determine the values of and λ that minimize the quantity
where Δ2 is the desired variance of the satellite measurement errors, averaged over the basin. The 4πΩregion in equation (A10) has been absorbed into λ. Setting the gradient of ξ with respect to equal to zero gives
where is the identity matrix. Setting the partial derivative of ξ with respect to λ equal to zero returns the requirement that var(εsat) = Δ2, which can be combined with to give an equation for λ
 This method can also be applied to the converse of this problem: minimizing satellite error for a given value of leakage. In this case, equation (A12) still describes the averaging kernel coefficients. However, λ is determined now by solving
where R is the desired ratio of the spatial variances (equation (A10)).
 We wish to thank Steven Jayne and Isabella Velicogna for their helpful discussions. With the assistance of Lynda Lastowka, the river basin boundaries were extracted from the HYDRO 1K Elevation Derivative Database maintained by the U.S. Geological Survey EROS Data Center from their web site at http://edcdaac.usgs.gov/gtopo30/hydro/. This work was partially supported by NASA grants NAG5-9450 and NAG5-7703 to the University of Colorado, by a CIRES Innovative Research Grant, and a NASA Earth Systems Science Fellowship awarded to Sean Swenson.