Agricultural activities have dramatically altered our planet's land surface. To understand the extent and spatial distribution of these changes, we have developed a new global data set of croplands and pastures circa 2000 by combining agricultural inventory data and satellite-derived land cover data. The agricultural inventory data, with much greater spatial detail than previously available, is used to train a land cover classification data set obtained by merging two different satellite-derived products (Boston University's MODIS-derived land cover product and the GLC2000 data set). Our data are presented at 5 min (∼10 km) spatial resolution in longitude by longitude, have greater accuracy than previously available, and for the first time include statistical confidence intervals on the estimates. According to the data, there were 15.0 (90% confidence range of 12.2–17.1) million km2 of cropland (12% of the Earth's ice-free land surface) and 28.0 (90% confidence range of 23.6–30.0) million km2 of pasture (22%) in the year 2000.
 Human land use activities are a force of global significance [Foley et al., 2005]. Humans have extensively modified the Earth's land surface, altering ecosystem structure and functioning, and diminishing the ability of ecosystems to continue providing valuable resources such as food, freshwater and forest resources, and services such as regulation of climate, air quality, water quality, soil resources.
 As such, agriculture is partly or wholly responsible for environmental concerns such as tropical deforestation and biodiversity loss, fragmentation and loss of habitats, emissions of important greenhouse gases, losses of soil quality through erosion and salinization, decreases in quantity and quality of water resources, alteration of regional climates, reduction in air quality, and increases in infectious diseases [Foley et al., 2005]. On the other hand, agricultural expansion and intensification has provided a crucial service to humanity by meeting the food demands of a rapidly growing population [Cassman and Wood, 2005], and thereby involves a trade-off between food production and environmental deterioration [DeFries et al., 2004; Foley et al., 2005].
 In this paper, we present a critical update to our global agricultural land cover data sets. In particular, we present new global data sets for the year 2000, developed using an order-of-magnitude enrichment of our agricultural inventory data, a combination of two different satellite-derived global land cover data sets for year 2000, and improved methods to merge the satellite data and inventory data. We also present, in addition to an updated map of global croplands for the year 2000, a new map of global pastures, as well as estimated confidence intervals for both of these data sets. These new data sets will form valuable products for the global environmental change community.
2. Data Sets
 In this section we describe how we compiled the two different sources of information used in this study: (1) satellite-based global land cover classification data sets; and (2) ground-based agricultural census/inventory data sets.
2.1. Satellite Data Sets
 We used two different high-resolution (1-km) satellite-based, global land cover classification data sets that are available for circa 2000: Boston University's Moderate resolution Imaging Spectrometer (MODIS) based global land cover product [Friedl et al., 2002] (BU-MODIS hereafter), and the Satellite Pour l'Observation de la Terre (SPOT) VEGETATION based Global Land Cover 2000 (GLC2000) data set [Bartholome and Belward, 2005]. The BU-MODIS land cover product used data acquired from 15 October 2000 to 15 October 2001 to derive 17 land cover classes using a supervised classification scheme (see Table 1 for legend). The GLC2000 data set utilized data acquired from 1 November 1999 to 31 December 2000 to derive 22 global land cover classes on the basis of regional classifications performed with the expertise of regional institutions (Table 1).
Table 1. Land Cover Classification Schemes Used by the Two Global Satellite Data Sets
 We applied a simple set of climatic parameters to mask obviously nonagriculture areas within the satellite data sets, else we obtain some spurious results in the Northern Hemisphere high latitudes (e.g., land classified as open shrublands are found both in the western United States and boreal Canada; because open shrublands have a small fraction of cultivation in the former we end up with croplands in boreal Canada also). This mask included all regions north of 50°N with Growing Degree Day (GDD; base 5°C) less than 1000°C d (see Figure S1). This GDD cutoff approximates that empirically estimated by Ramankutty et al. [2002a] (using their previous cropland data set that does not include a mask, hence there is no circularity), and is conservative compared to the 2000°C d cutoff for agriculture used by Cramer and Solomon . GDD data were calculated according to Ramankutty et al. [2002a], and interpolated to 1km resolution. We also masked out protected/minimal use areas in the central part of the Australian continent (Bureau of Rural Sciences, Land Use of Australia, Version 3: 2000/2001, Department of Agriculture, Fisheries and Forestry, Australian Natural Resources Data Library, Canberra, Australia, available at http://www.nlwra.gov.au/Data_Library/index.aspx), which otherwise gets classified entirely as pasture.
2.2. Agricultural Inventory Data
 We extensively compiled cropland and pasture inventory data for the globe at the national and subnational level (Figures 1a–1c and Table 2; more details in Text S1) for circa year 2000. We compiled data for 15,990 different administrative units of the world, ranging from political units like countries, states and counties, which represents a 46-fold improvement in the richness of our inventory data compared to our previous effort (348 units in the work of Ramankutty and Foley ). For 57 countries, we compiled census data at the subnational level (e.g., “Level 1” indicating states in the United States or India, provinces in Canada or Argentina, departments in Bolivia or Columbia, etc., and “Level 2” indicating smaller units like U.S. counties, Brazilian municipios, or Indian districts). For 159 countries, we used national level statistics from the Food and Agriculture Organization's (FAO) FAOSTAT database (http://faostat.fao.org); for these 159 countries, we calculated an average around the year 2000 using data from 1998 to 2002. For another 19 countries in our database, no FAOSTAT data was available, and we set the data to be missing (see Text S1 for details).
 We compiled the cropland and pasture data to be consistent with the FAO definition of “Arable lands and permanent crops” and “Permanent pastures” respectively. Arable land is defined by FAO (http://faostat.fao.org/site/375/default.aspx) as including “land under temporary crops (double-cropped areas are counted only once), temporary meadows for mowing or pasture, land under market and kitchen gardens and land temporarily fallow (less than 5 years). The abandoned land resulting from shifting cultivation is not included in this category. Data for arable land are not meant to indicate the amount of land that is potentially cultivable.” Permanent crops are defined as “land cultivated with crops that occupy the land for long periods and need not be replanted after each harvest, such as cocoa, coffee and rubber; this category includes land under flowering shrubs, fruit trees, nut trees and vines, but excludes land under trees grown for wood or timber.” Permanent pastures are defined as “land used permanently (5 years or more) for herbaceous forage crops, either cultivated or growing wild (wild prairie or grazing land). The dividing line between this category and the category ‘Forests and woodland’ is rather indefinite, especially in the case of shrubs, savannah, etc., which may have been reported under either of these two categories.”
 The agricultural inventory data were seldom available exactly for the year 2000 because agricultural censuses are only taken every 5–10 years a in most industrialized nations of the world, and less frequently in other countries. We collected inventory data between the years 1998 to 2002 where possible, but in several instances we relied on older data (see Table 2). For nations where the inventory data did not fall within the 1998–2002 period, or where cropland or pasture data were unavailable but some proxies were available (such as harvested area of individual crops or heads of livestock), we estimated cropland and pasture data for circa 2000 by calibrating the available information to national totals from FAOSTAT (see Text S1 for details).
 The quality of our census data is varied. Some regions of the world are not well represented in terms of the resolution of inventory data, with the African continent and the Former Soviet Union being the most underrepresented. Sometimes, the national level census statistics were inconsistent with the FAOSTAT data; in such cases we mostly relied on the national statistics (as recommended by FAO), except for a few cases where we believed that FAOSTAT data were more reliable (see Text S1 for details). Inconsistencies were a result of either unclear definitions of the category, or sometimes poor reporting by the national statistics agency. For example, the cropland census statistics for China has been noted by various studies to be particularly problematic [Crook, 1993; Frolking et al., 1999; Seto et al., 2000; G. K. Heilig, Can China Feed Itself?: A System for Evaluation of Policy Options, International Institute for Applied Systems Analysis, 1999, available at http://www.iiasa.ac.at/Research/LUC/ChinaFood/index_m.htm]. Here we have used the data for China from the study of Verburg and Chen , which seems reliable. The definition of permanent pasture is particularly problematic, as acknowledged by the FAO (http://faostat.fao.org/site/375/default.aspx). Some countries (e.g., the United States) clearly distinguish between grassland pasture and range and forest use land grazed, while most countries do not. So it is not clear whether grazed forest land or semiarid grazing is included under the definition of pastures. One egregious example is Saudi Arabia for which FAOSTAT reports 1.7 million km2 of permanent pasture in 2000, which is 80% of its total land area. However, most of Saudi Arabia is arid land and it is clear that much of the nomadic grazing areas are included under pasture. The Saudi Arabian subnational census data we obtained reports 486 km2 of pasture, which is 3500 times smaller; we have chosen to rely on this lower value. Similarly, FAOSTAT reports 4 million km2 of pasture in China. However, Verburg et al.  report a total of only 2.6 million km2 of grassland in China. In this case, we have used the FAOSTAT value for the total amount of pasture in the country (but our final estimates for China are lower; see section 4.1, paragraph 2). Similar inconsistencies exist in other countries and are reported in Text S1.
 In summary, our agricultural inventory database yields a global total of 15 million km2 of cropland and 31.5 million km2 of pasture. This compares to 15.3 million km2 of cropland and 34.4 million km2 of pasture reported by FAOSTAT; significantly, our census data compilation and interpretation yields about 8% less pasture than FAOSTAT. The difference between FAOSTAT and our own inventory for pasture can mainly be explained by differences in Saudi Arabia (as described earlier in section 2.2, paragraph 4), Australia, Nigeria, Brazil, Mexico, Indonesia, Argentina, Colombia, Russia, and Spain. There were very few countries where the national census pasture area was greater than FAOSTAT; the one exception is Iran, for which FAOSTAT reports 0.4 million km2 of pasture while the national census reported 0.9 million km2. For croplands, while the global total areas are comparable between FAOSTAT and our inventory, there are significant national differences. For example, the national inventories of Australia, Brazil, Canada, China and Turkey report lower cropland area compared to FAOSTAT, while Iran, Argentina, Nigeria, Mexico, and Indonesia report greater cropland area.
 The basic methodology for creating the new cropland and pasture data sets originated from our earlier work [Ramankutty and Foley, 1998], where we calibrated a high-resolution satellite-derived land cover data set against agricultural inventory data to derive a global map of croplands for 1992. In this paper, however, we updated the methodology in three important ways: (1) Instead of a single satellite-derived land cover data set, here we used a merger of two different satellite data sets; (2) instead of calibrating only the a priori identified agricultural land cover classes against inventory data (as in the work of Ramankutty and Foley ), we utilized all the land cover classes in our training procedure (as in the work of Hurtt et al.  and Cardille et al. ); and (3) because we had much higher-resolution census data compared to our previous efforts, we considered the census data sets to represent an approximate “truth”, and used the two-step method developed by Ramankutty  whereby the satellite data is used to spatially locate agricultural lands within an administrative unit, but the total area of agricultural land in the administrative unit is derived from the census data (with some exceptions; see section 3.5). The following section provides a detailed description of the steps taken to create the final data set (see Figure 2 for a flowchart of our algorithm).
3.1. Combining the Two Satellite-Derived Land Cover Classification Data Sets
Giri et al.  compared the GLC2000 and BU-MODIS land cover data sets and found some consistency at the aggregate class level, but widespread disparities in the details. A few other studies have also tried to intercompare and harmonize these two and other global land cover data sets [Herold et al., 2005; Jung et al., 2006; See and Fritz, 2006]. The GLC2000 data set was developed using SPOT vegetation data with the assistance of regional experts and used a flexible classification scheme. The BU-MODIS data set, on the other hand, was developed using a globally consistent procedure with a fixed classification scheme, but without regional expertise. The two data sets therefore bring different kinds of expertise and information that are potentially complimentary. Here we decided to capitalize on both by combining them into a single land cover data set. To do so, we overlaid the BU-MODIS and GLC2000 data sets and developed new land cover classes that contain all possible combinations of the individual land cover classes (see Table 3 for an example). The resulting combined land cover data set consists of 391 possible land cover types and is shown later in section 3.3 to provide more accurate results when calibrated against the inventory data, as opposed to using either data set individually.
Table 3. Examples of Combined Land Cover Categories
1 (evergreen needleleaf forest)
1 (tree cover, broadleaved, evergreen)
1 (evergreen needleleaf forest)
2 (tree cover, broadleaved, deciduous, closed)
1 (evergreen needleleaf forest)
3 (tree cover, broadleaved, deciduous, open)
1 (evergreen needleleaf forest)
4 (tree cover, needle-leaved, evergreen)
11 (shrub cover, closed-open, evergreen)
12 (shrub cover, closed-open, deciduous)
13 (herbaceous cover, closed-open)
14 (sparse herbaceous or sparse shrub cover)
3.2. Step 1: Calibrating the Satellite Data Sets Against the Agricultural Inventory Data
 We followed methods developed in previous efforts [Ramankutty and Foley, 1998; Cardille et al., 2002; Ramankutty, 2004] to merge the satellite data sets and the agricultural inventory data to develop global maps of croplands and pastures for the year 2000. As mentioned in section 3 (paragraph 1), our statistical data fusion procedure is different from Ramankutty and Foley  in a couple of major ways. First, to allow for potential misclassifications in the satellite-derived land cover data sets [e.g, Hurtt et al., 2001; Cardille et al., 2002], and also because pasture is not explicitly identified as a land cover class by the satellite-derived data sets, we utilized all land cover classes in our training procedure, as described below. Second, we used the two-step procedure of Ramankutty  which assumes that the inventory data is the “truth” (except for identified outliers), and uses the satellite data to spatially disaggregate the census data within each administrative unit.
 For each administrative unit, i, the proportion of cropland and pasture area from the inventory data, cfi and pfi respectively, was calculated by dividing the inventory cropland and pasture areas by the total land area (Ai) for the administrative unit. Then, we determined λj,i, which is the proportion of each of the satellite-derived land cover classes, j, within administrative unit i.
 We have = 1.0, where nλ is the number of land cover categories in the satellite data set.
 We formulated a linear model relating the satellite-derived data sets to the agricultural inventory data, as follows:
where αj and βj are unknown parameters associated with each land cover category j, and ɛc,i and ɛp,i are error terms that represent the residual difference between the inventory and linear model predicted cropland and pasture proportions respectively.
 Additionally, (1) and (2) were subject to the following constraints,
These constraints ensured that the cropland or pasture proportions in any pixel (when the model is later applied at pixel level) will be between 0 and 100%, and that the sum of cropland and pasture proportions will be less than or equal to 100%.
 We used a least squares minimization method to solve for the parameters αj and βj. In particular, we specified the weighted least squares error (LSE) to be minimized as:
where ni is the number of administrative units, and ωi is a term that weights the residuals ɛc,i and ɛp,i by the land area, Ai, of each administrative unit, normalized by the maximum value, i.e.,
To estimate the parameters in equations 1 and 2, we used a multiple linear regression model from the STARPAC package (http://www.cisl.ucar.edu/softlib/STARPAC.html). We developed three separate models, first using the BU-MODIS and GLC2000 data sets separately, and then using the combined land cover data set. In each case, we started with a complete model specifying all the land cover classes as potentially being cropland or pasture; the following classes: BU13 (urban), BU15 (snow and ice), BU16 (barren), BU17 (no data), GLC19 (bare), GLC20 (water), GLC21 (snow and ice), GLC22 (artificial surfaces), and GLC23 (no data), and their combinations in our combined land cover data set, were left out of the model. We then used stepwise regression using backward selection to estimate the parameters. The details of this procedure are outlined in Text S2.
 We applied our optimization procedure separately to six different regions of the world (Figure S2), similar to Ramankutty and Foley . These six regions were a compromise between selecting small enough regions with similar agricultural characteristics, but large enough regions to have enough observations within each to obtain robust parameter estimates. Clearly some of the regions extend across different types of agricultural land uses, but subdividing the world into smaller regions resulted in too few observations in some regions to get robust model estimates.
 The estimated parameter values (not shown) were used to make global cropland and pasture maps at 5 min spatial resolution in latitude by longitude for the BU-MODIS and GLC2000 data sets individually, and for the combined land cover data set (Figure 3). For each 5 min grid cell, x, y (latitude by longitude), we determined λj,x,y, which is the proportion of each of the satellite-derived land cover classes, j, within 5 min grid cell x, y. Using the estimates of αj and βj, we calculated cropland and pasture proportions in each 5 min grid cell, cfx,y and pfx,y respectively, using:
3.3. Comparison of the Performance of BU-MODIS, GLC2000, and Combined Data Sets
 We now present a comparison of the inventory agricultural land area to the predicted values for each data set from calibration Step 1 (from equations 6 and 7, aggregated from 5 min resolution to the administrative level) (Table 4). The cropland and pasture areas predicted using the combined satellite data set is better correlated to the inventory data compared to the models using either the BU-MODIS or GLC2000 data sets alone in every region except for a few exceptions (croplands in Africa/Middle East, and pastures in Europe and Russia and Australia/NZ). Further, neither the BU-MODIS nor the GLC2000 data set always performs better than the other; for example, in Africa and Middle East, GLC2000 predicts cropland much better than BU-MODIS, but the reverse is true for pastures. We next present a regional comparison of the inventory and predicted agricultural land areas from the combined data set (Figures 4a and Figures 4a and 4b). The most notable differences lie in the underestimation of pasture areas in Asia and Australia/New Zealand. We will discuss the significance of this underestimation in the next section.
Table 4. Weighted Correlation Coefficient Between Inventory Data and Model Predictions From Step 1a
Each data point is weighted by the area of the administrative unit.
Africa and Middle East
Europe and Russia
 We now consider the spatial patterns of predicted agricultural land from the combined data set versus the individual data sets, using South American pastures as an example (Figure 5). In Figure 5a the inventory data shows large extent of pasture along the arc of deforestation, along coastal and southern Brazil, in Uruguay, and in the Patagonia region of Argentina. Figures 5b and 5c are the BU-MODIS and GLC2000 based pasture maps. Both do a reasonable job of reproducing the inventory data in Figure 5a. The most glaring difference is in the Nordeste (northeast) region of Brazil, where both data sets overestimate the distribution of pasture as compared to the inventory data in Figure 5a. The combined land cover data was able to best reproduce the inventory data, especially in the Brazilian Nordeste region and Patagonia (Figure 5d).
 Thus by using the combined data set we are able to capitalize on whichever satellite-based land cover data set is best suited to each region. If we were to use either the BU-MODIS or GLC2000 data sets alone, we would get reasonably good global results, but would lose accuracy in some regions. The use of the combined land cover data set especially yields marked improvements over Asia and South America (Table 4). Therefore we use the combined data set in the next step of this study.
3.4. Bootstrap Procedure to Estimate Uncertainty
 We further used a bootstrap technique in order to estimate the uncertainty in our parameter estimates. This procedure was applied at this stage for only the combined land cover data set. We performed 1000 bootstrap runs, where the entire census data was sampled with replacement each time, and reestimated our regression model using the combined satellite-based land cover data set (Sampling with replacement used the standard statistical procedure wherein the census data formed the population (of ni values, the total number of administrative units), and for each of the 1000 sample sets we randomly selected ni values from the population, with the sampled value being replaced back into the population. Each sampling outcome is therefore independent of the previous outcome; that is, every census data value has equal probability of being chosen during every sampling event, no matter whether it has been chosen before). This yielded a distribution of values for each parameter αj and βj, and we report the mean and 90% confidence intervals here (see Data Set S1).
 Using the 1000 estimates of αj and βj, we calculated 1000 estimates of cropland and pasture proportions in each 5 min grid cell, cfx,y and pfx,y respectively, using:
From these 1000 estimates, we calculated the mean, 5th percentile and 95th percentile values of the cropland and pasture values, represented as cfx,ymean, cfx,y5th%;ile, and cfx,y95th%;ile, and as pfx,ymean, pfx,y5th%;ile, and pfx,y95th%;ile, respectively (figures not shown from this stage of analysis).
3.5. Step 2: Adjusting the Predicted Cropland and Pasture Data to Match Inventory Data
 In this final step, we followed the methods of Ramankutty , to adjust our spatially explicit predictions from Step 1 (the bootstrap model estimates using the combined data set) to match the inventory data at the administrative level where available. To do so, we first aggregated our 5 min resolution cropland and pasture data sets to the administrative level. We then compared them to the inventory data to derive a correction factor for each administrative unit. The correction factors were, however, constrained to be within a factor of 5 (i.e., to lie between 0.2 and 5.0) for administrative units that were considered outliers in the regression, thereby trusting the satellite-based land cover data more than the inventory data in those cases. Outliers were determined to be those administrative units with residuals (predicted cropland area from the Step 1 calibration procedure minus inventory cropland area) that were greater than 2 standard deviations from the mean. Correction factors were set to 1.0 for administrative units with missing data, thereby relying on the satellite-estimated spatial patterns from Step 1 in these units. We then applied Pycnophylactic Interpolation [Tobler, 1979] to the administrative level correction factors to obtain a smooth surface of correction factors at 5 min resolution (without this smoothing, artificial boundaries between administrative units might appear in the final product; note, however, that only the correction factors were smoothed, so any real boundaries in the original satellite data will remain). The spatial correction factors were then applied to our results from Step 1 to derive our final maps of cropland and pastures at 5 min resolution (Figure 6) and respective confidence intervals (Figure 7). The final equations for cropland and pasture proportions can be represented as
where μcfx,y and μpfx,y are the spatially explicit correction factors for cropland and pasture respectively.
4.1. Total Global Area of Croplands and Pastures in 2000
 Our final results (Figure 6) indicate that there were 15.0 (95% confidence range of 12.2–17.1) million km2 of cropland and 28.0 (95% confidence range of 23.6–30.0) million km2 ofpasture in the world in the year 2000. This compares to 15.3 million km2 of cropland and 34.4 million km2 of pasture reported by the FAOSTAT database. Thus we predict significantly lower extent of pasture (by 6.1 million km2 or ∼18% lower) than reported by FAO. Our own inventory data reports 15.0 million km2 of cropland and 31.5 million km2 of pasture. Thus our inventory data for pasture is already lower than FAO statistics; this difference was explained earlier in section 2.2. Our cropland extent for the year 2000 of 15.3 million km2 is lower than the 18.0 million km2 of cropland in 1992 from our previous study [Ramankutty and Foley, 1998]. This does not mean that cropland extent has decreased from 1992 to 2000; rather the difference is a result of changes in methodology: We plan to reconstruct a consistent historical time series; until then a comparison of our 1992 and 2000 data sets is precluded.
 Our final predicted pasture area is even lower than our inventory data, especially in Asia and Australia/New Zealand (as already evidenced in Figures 4a and 4b). We anticipated that this problem would be overcome in Step 2, when we scaled our spatial cropland and pasture data to match our inventory data, but despite this our final predicted extent of pasture differs significantly from our inventory data. This is because we did not allow pixels with predicted agriculture proportions of 0% in Step 1 to be scaled, and each pixel was limited to have a maximum agriculture proportion of 100%. The biggest difference is in China, where the inventory reports 4 million km2 of pasture, but we estimate a final area of only 2.9 million km2. Our estimate, however, is similar to the 2.6 million km2 of grassland reported by Verburg et al. . Verburg and van Keulen  discuss that much nontraditional grazing occurs in China, with goats grazing along roads, dry river beds, and in fields after harvest, and pigs in back yards; it is not clear how these landscapes should be characterized. It is also likely that grazed forestland and semiarid land are included under the pasture category in the Chinese inventory. Other big differences in pasture area arise in Australia (inventory area of 3.2 million km2 versus our prediction of 2.7 million km2), Mongolia (inventory of 1.3 million km2 versus our estimate of 0.9 million km2), Mauritania (inventory of 0.4 million km2 versus our 0.1 million km2), Iran (inventory of 0.9 million km2 versus our 0.6 million km2), and the United States (inventory of 2.3 million km2 versus our 2.1 million km2). It is interesting to note that all of these countries have significant amount of semiarid land. One potential explanation could be the variability in grazing associated with the interannual changes in climate that can be quite extreme in semiarid regions; the satellite data set only captures a single year of land cover condition, which can vary from year to year. For croplands, while the global totals agree, there are compensating national level differences but these differences are less significant compared to the differences in pasture areas.
 Our final predicted cropland extent of 15.0 million km2 in year 2000 amounts to roughly 12% of the global land area (excluding Greenland and Antarctica), and pasture extent of 28.0 million km2 amounts to 22% of global land area. Thus humans are using 34% of the global land area for their agricultural needs.
4.2. Geographic Distribution of Croplands and Pastures in Year 2000
 We analyzed the distribution of agricultural regions by 14 different regions of the world (Table 5). The greatest proportion of croplands in the world are found in South Asia, Europe, Southeast Asia, and United States, east of the Mississippi, while the greatest proportion of pastures are found in Argentina, Uruguay and Chile, Pacific developed countries, China, Mexico and Central America, United States, west of the Mississippi, and tropical Africa. The smallest proportion of croplands are found in Canada, the Pacific developed Countries, and northern South America, while the smallest proportion of pastures are found in Southeast Asia, Canada, and United States, east of the Mississippi.
Table 5. Cropland and Pasture Areas by Different Regions of the World
Land Area, million km2
Cropland, million km2
Pasture, million km2
Cropland, % of Land Area
Pasture, % of Land Area
United States, east of Mississippi
United States, west of Mississippi
Mexico and Central America
South America, northern Portions
Argentina, Uruguay, and Chile
North Africa and Middle East
Former Soviet Union
Pacific developed countries
 We also examined which potential natural vegetation types of the world have been most affected by agriculture (Table 6). We overlaid our agricultural maps over the global map of potential natural vegetation developed by Ramankutty and Foley . We find that croplands have mostly replaced temperate deciduous forests (in Europe and eastern United States), and tropical deciduous forests (in South Asia), while, pastures have mostly replaced grasslands, savannas, and shrubland. Roughly 30% of temperate deciduous forests have been converted to cropland, while 50% of grasslands have been converted to pasture. However, this global picture varies regionally (Table 7). While forests have been cleared for croplands predominantly in Asia, a substantial amount of savanna and grasslands have been converted to croplands in North America, Africa, and the Former Soviet Union. Also, a significant amount of forests in South America have been cleared for pastures, even though globally most pastures have replaced savanna/grasslands.
Table 6. Cropland and Pasture Areas by Different Biomes of the World
Land Area, million km2
Cropland, million km2
Pasture, million km2
Cropland, % of Biome Area
Pasture, % of Biome Area
Tropical evergreen forest
Tropical deciduous forest
Temperate broadleaf evergreen forest
Temperate needleleaf evergreen forest
Temperate deciduous forest
Boreal evergreen forest
Boreal deciduous forest
Evergreen/deciduous mixed forest
Table 7. Cropland and Pasture Areas by Different Regions and Biomes of the World
Area in million km2
Former Soviet Union
 Next, we examined the amount of spatial overlap between croplands and pastures (Figure 8). Our analysis shows that cropland and pastures are distinct biomes over much of our planet's land surface. The regions of the world with significant overlap lie along the western edge of cultivation in the midwestern United States and in Texas, northeast Brazil, in parts of West Africa, eastern China, Europe, and the former Soviet Union, scattered regions elsewhere. This is not to say that grazing does not occur in cultivated regions of the world; many regions of the world are characterized by multifunctional agricultural lands, subject to different uses during different parts of the year (e.g., grazing occurs following the harvest of a crop) [Reenberg and Fog, 1995]. Therefore this lack of overlap partly reflects the inability of global monitoring systems, including satellite data and agriculture inventory data, to characterize multiple uses, and land is often classified as a single category. Our final estimates likely underestimate the real overlap between cultivation and grazing, especially the multiple uses that occur within a year.
 We also investigated the frequency distribution of croplands and pastures globally, and across the 14 different regions of the world. We calculated probability distributions using all pixels with nonzero cropland or pasture values, and estimated what proportion of total cropland or pasture area can be attributed to different categories of cropland or pasture proportions (Figure 9). Globally, we find that a large proportion of the total cropland area comes from land that is between 60% and 80% cultivated. This pattern is observed in most regions of the world, except in northern South America, tropical Africa, and North Africa and Middle East (where larger number of pixels with low proportion of cultivation contribute most to the total cropland area, although in the latter two regions, there are a few pockets of greater than 90% cropland). With pastures, the total area is dominated by pixels with greater than 90% pasture. This pattern is true over most regions of the world, but is reversed in the eastern United States, Europe, and Southeast Asia (and South Asia, to a lesser extent). However, it is to be noted that the proportion of pasture in a grid cell provides no information on the grazing intensity in that grid cell; an area of grassland with very low stocking density of livestock would have the same proportion of pasture as one with very high stocking densities.
4.3. Evaluation Against Other Independent Data Sets
 While there are no consistent, global spatial data sets of agriculture in the year 2000 to evaluate our products against, there are numerous regional products against which we can compare our global products (see Figures S3–S12). While many of these regional data are not for the year 2000 (and are sometimes a decade older), they nevertheless provide an independent measure of the large-scale spatial patterns of agriculture in these regions. These data were not always available in a consistent digital format; they were often in vector format that is difficult to quantitatively compare to our raster data and sometimes only available as images in publications. Therefore rather than making quantitative comparisons we present detailed regional maps from our data set compared to these independent regional maps, as online supplements (Figures S3–S12).
 Our regional comparisons are mostly for croplands because there are few spatial data sets depicting pastures (likely because it is difficult to distinguish between natural and grazed grasslands). Visual comparison (Figures S3–S12) suggests that our distribution of croplands is reasonable in North America (United States, Canada, and Mexico), and so is the distribution of pastures in the United States (except for noticeable differences in northeastern Texas, and eastern Oklahoma, arising from lower pasture census values). In the Amazon Basin, our distribution of both croplands and pastures shows similar geographic patterns but greater intensity compared to Cardille et al. . This may be a result of our data being representative of 2000, while Cardille et al.  data is for the mid 1990s (changes in the southern Amazon are rapid) or because of differences in statistical methods with the regression tree method of Cardille et al.  not able to deal well with extremes. In China also, our geographic patterns of croplands matches well with that of Liu et al. , but our intensity is lower (the differences are greater than can be explained by the different spatial resolutions of the two data sets). In West Africa, our cropland distribution from this study is a significant improvement over our earlier work [Ramankutty and Foley, 1998], but consistent with our more recent effort [Ramankutty, 2004]. Our distribution of croplands compares reasonably well with Africover data in East Africa [Latham, 2001] (http://www.africover.org/) but is not a significant improvement over our earlier effort, while in South Africa, our estimated patterns are an improvement over our previous effort but of much lower intensity. In Europe, our data set of croplands compares well to the CORINE land cover database. We seem to underestimate croplands in eastern Australia, while our distribution of pastures is significantly underestimated in the Northern Territory. The problem with pastures in Australia actually emerges from Step 2, with the census data in the Northern Territory suggesting very little pastures; our predicted pattern from Step 1 compares better to the data from the Australian Natural Resources Atlas.
5. Discussion and Conclusions
 We have merged two different satellite-based land cover classification data sets with an extensive compilation of national and subnational level agricultural census statistics to develop global maps of croplands and pastures for the year 2000. These maps form the first comprehensive characterization of the distribution of global agricultural lands in the year 2000, describing the spatial extent of croplands and pastures within 5 min resolution (∼10km) grid cells in latitude by longitude; in addition, 90% confident intervals on the mean estimates are presented for the first time. In creating these new data sets, we have built on our earlier work in 1998 [Ramankutty and Foley, 1998], where we developed a statistical “data fusion” method to merge satellite data and census data to map global croplands in 1992. Here we have improved our statistical methods, combined two new satellite data sets, and enriched our agricultural census data to update our global croplands map to the year 2000, and to create a new global pasture data set for 2000. We would like to strongly caution that our two croplands maps from 1992 and 2000 cannot be directly compared to detect changes over that time period: Changes in methodology and data sources preclude such a direct comparison. We plan to develop a consistent historical time series of croplands and pastures in the future.
 The global area of cropland from this study for the year 2000 of 15.0 million km2 is smaller than the area of 18 million km2 for 1992 estimated by our earlier study [Ramankutty and Foley, 1998]. This is not a real decrease, but rather an artifact of change in methodology between the two studies. In this study, we used our Step 2 to scale the calibrated cropland patterns to identically match the census data. In our earlier study our final result was directly out of the calibration and we did not do any scaling. We have changed our philosophy here to trust the total area of agricultural land reported by the census data (unless they are outliers in the Step 1 calibration procedure), and use the satellite data sets for information on the spatial distribution within each census administrative unit.
 The global area of pasture of 28.0 million km2 is 18% lower than the standard FAOSTAT estimate of 34.4 million km2. The major differences are found in Saudi Arabia, Australia, China, and Mongolia. This is likely because the census data on pasture reported to FAOSTAT include grazed forestland and semiarid land. The definition of pasture has always been problematic, as acknowledged by FAOSTAT (http://faostat.fao.org/site/375/default.aspx), and one way to improve the situation in the future may be to develop global maps of livestock density [e.g., Kruska et al., 2003; W. Wint, Global Livestock Distributions, 2005, data archive prepared by Environmental Research Group Oxford Limited for the Pro-Poor Livestock Policy Initiative of the Animal Production and Health Division of the Food and Agriculture Organization of the United Nations, Rome, Italy, available at http://ergodd.zoo.ox.ac.uk/agaagdat/index.htm], and overlay that data with an independent global estimate of herbaceous vegetation.
 Although we now have new global estimates of the distribution of cropland and pastures, several caveats need to be noted. First, there is much misunderstanding and confusion regarding the definitions of croplands and pastures. In this study, we have followed the FAO definition, as described in section 2.2 (paragraph 2). For croplands, this includes temporary fallow lands (less than 5 years), which are not cultivated. It is not clear how strictly this restriction of less than 5 years was applied when accounting for fallow land. For example, the U.S. data on croplands used by FAOSTAT includes idled cropland, which includes land under the Conservation Reserve Program that amounts to roughly 9% of the total cropland area, and is often idled for longer than 5 years [Lubowski et al., 2002]. Secondly, in tropical nations characterized by extensive fallow cropping systems, such a definition may include much land that is not currently cultivated, and therefore portray a misleading picture of what may be commonly thought of as cropland. Finally, these definitions of croplands say nothing about the productivity of the land, but this is the topic of our companion article [Monfreda et al., 2007].
 The definition of pasture is subject to even greater uncertainty. The definition of pasture is not consistently clear in the census data that we compiled in this study, or any of the maps we compared our data against (Figures S3–S12). Where is the dividing line between herding and grazing? For example, is reindeer herding reported under pastureland? Does the data represent both planted pastures and natural pastures? Is grazing underneath a forest cover or in semiarid areas included in the pasture data? In other words, pasture is only a subset of the land (on herbaceous vegetative cover) that is used for grazing. The data on extent of pasture also says nothing about the intensity of grazing: An acre of land with one cow and another acre with 10 cows would both be considered 1 acre of pastureland.
 An additional concern related to the definition of croplands and pastures arises from the existence of multifunctional landscapes, as discussed in section 4.2 (paragraph 3) [Reenberg and Fog, 1995]. In some countries, especially in Asia and Africa, land is cropped for a while, and then after harvest, is grazed for the remainder of the year. Thus during the year the land is put to multiple uses, and it is not clear how to classify these lands, and how these lands were accounted for in the census statistics. Mixed use classifications need to be used to characterize such landscapes rather than discrete classes such as cropland and pasture. It is not clear how much of the global agricultural land area is influenced by such multifunctional land use practices. Future agricultural census data compilation methods need to be encouraged to deal explicitly with multifunctional landscapes.
 Finally, our synthesized data sets have uncertainty related to the fact that we are trying to merge two different observation systems: remote-sensing based and ground based. Remote sensing satellites can only observe land cover, i.e., the top of the vegetative canopy, and have little information on what happens below the canopy. The ground-based land use data, on the other hand, may include different information. For example, the cropland census data include permanent crops such as tree crops. It is not clear whether the remote sensing observations consider tree crops as tree cover or whether they classify them as cropland. Similarly, as discussed in section 2.2 (paragraph 4), census data may not distinguish between grazing on grasslands, forests, and bare ground.
 There is great demand by the global environmental change community to understand how global agricultural lands are changing and evaluate their implications for a sustainable future [e.g., Tilman et al., 2001; Foley et al., 2005]. Therefore despite large uncertainties we need to make progress toward developing new methods to characterize the spatial patterns of global agricultural lands. Here we have obtained the best available global data on agricultural lands and synthesized them to create a single homogeneous database of the world's croplands and pastures. We believe that these data sets would be enormously useful to at least two different communities of scientists/practitioners: (1) global change scientists, interested in the consequences of global agriculture for climate, carbon cycle, water resources, etc., who would use these data sets for global-scale analysis or as inputs to climate and ecosystem models [e.g., McGuire et al., 2001; Myhre and Myhre, 2003; Jain and Yang, 2005]; and (2) ecologists and conservation practitioners, interested in the role of agriculture in modifying natural ecosystems and habitats, reducing biodiversity, and enhancing species extinction [e.g., Green et al., 2005; Vandermeer et al., 2005].
 We are grateful to Peter Verburg from Wageningen University for sharing his China data. We would like to thank Mark Friedl from Boston University for providing feedback on early versions of this work. We would like to thank George Allez for fabulous help with data compilation and entry. The comments from Eric Lambin, Alberte Bondeau, and another anonymous reviewer greatly helped us in improving this manuscript. This work was accomplished through support from a NASA Interdisciplinary Science (IDS) grant, for which we are thankful.