Oil tracking in the Gulf of Mexico in response to the Deepwater Horizon accident requires timely and accurate observations of major circulation patterns such as the Loop Current and LC eddies. When the eastern GOM becomes nearly isothermal at the surface and the use of sea surface temperature imagery is limited, MODIS ocean color data can be used instead. However, frequent and extensive sun glint prevents such an application when glint reflectance, Lg, is >0.01 sr−1. Here, an empirical approach is developed to remove sun glint and clouds based on band ratios between the Rayleigh-corrected reflectance (Rrc) at 469, 555, 645, 859, and 1240-nm. To minimize the effect of residual errors due to variable aerosols and imperfect glint correction, a color index (CI) is derived to represent the color patterns. Comparison between results from adjacent days with different glint and aerosol patterns suggests that the approach is able to derive consistent color patterns under severe sun glint (Lg < 0.15 sr−1). Tests of the approach over the Tropical Atlantic, East China Sea, and ocean waters off South Africa further validate the approach's general applicability. The color index (CI) also shows significant correlation with MODIS band-ratio Chl (<1 mg m−3) for each case examined. The simple design of the approach makes it straightforward to implement for other subtropical and tropical regions when a qualitative MODIS CI is desired to infer circulation patterns and to trace eddies under severe sun glint.
 Tremendous success has been achieved in establishing a long-term data record for several key parameters derived from satellite ocean color measurements [McClain et al., 2004, and references therein]. The data products, starting from the CZCS (1978–1986) era to today's SeaWiFS, MODIS, MERIS, and other sensors, have seen continuously increasing usage in various Earth Science disciplines, from research, management, to education (see technical report series of the International Ocean Color Coordination Group).
 Nearly all previous efforts in ocean color algorithm development have focused on accurate, validated bio-optical data products such as surface chlorophyll-a concentrations (Chl) [O'Reilly et al., 2000, and references therein]. Accuracy is the key because time-series analyses call for the most robust products to study temporal changes. However, there are cases where accuracy should be relaxed to allow for more data coverage. An example is when ocean color patterns are more important than absolute Chl values. In these cases, the current accuracy-driven algorithm/processing design often makes observation difficult under non-optimal conditions, for example when significant sun glint is present (Figure 1a).
 The recent Gulf of Mexico (GOM) oil spill event represents an example of such a case. Following a massive explosion and fire burning on 20–21 April 2010, the resultant Deepwater Horizon oil spill posed an unprecedented threat to the GOM ecosystems. Numerous efforts from Federal and state agencies, academia, environmental groups, and private entities were made to monitor, study, and mitigate the potential impacts. One such effort was oil tracking by combining satellite observations and numerical modeling (Y. Liu et al., Tracking the Deepwater Horizon oil spill: A modeling perspective, submitted to Eos, Transactions, American Geophysical Union, 2010).
 The oil tracking effort has two components. The first is to delineate surface oil slicks using MODIS imagery [Hu et al., 2009] and Synthetic Aperture Radar imagery [Liu et al., 2000]. The second is to model and observed circulation patterns such as the Loop Current and LC eddies that may pull the oil to reach remote regions such as the Atlantic Ocean [e.g., Hu et al., 2005]. During the summer months, the GOM becomes isothermal at the surface with minimal spatial contrast in sea surface temperature. During this period, ocean color imagery could be used to observe spatial patterns to infer circulations (data from altimeters have much lower resolution and are not available in near real-time). Unfortunately, when the default NASA processing software (SeaDAS6.1) is used to process the data, an image mask is created for the areas under significant and extensive sun glint, making the image useless (Figure 1b). This is due to both unreliable glint correction and saturation of the 1-km ocean color bands.
 Recognizing the pressing need to observe ocean circulation patterns in near real-time in response to the unprecedented oil spill, the objective of this work is to develop a new approach to derive MODIS ocean color patterns under severe sun glint. The manuscript is arranged as follows. The background for glint correction is briefly introduced, followed by the approach to derive a color index (CI) using a baseline subtraction method, and the approach for glint correction and cloud masking. Finally, the accuracy and potential application of the MODIS CI imagery for the global ocean are discussed.
2. Current Glint Correction
 The satellite measured radiance from an image pixel, after correction for gaseous absorption, comes from several sources:
where λ is the wavelength, Lr is from Rayleigh scattering, La is from scattering by aerosols and aerosol-Rayleigh interactions, t is the pixel-to-satellite diffuse transmittance, Lw is the water-leaving radiance, T and T0 are the pixel-to-satellite (view angle: θ) and sun-to-pixel (solar zenith: θ0) beam transmittance, respectively, F0 is the extraterrestrial solar irradiance, and Lg is the normalized sun glint reflectance (in units of sr−1) that depends only on the sea state and solar-viewing geometry (Θ). For simplicity, whitecap contribution is omitted here.
 The ultimate goal of the atmospheric correction (including glint correction) is to derive Lw from Lt. The procedure of using near-infrared bands to derive aerosol properties and then correct the visible bands in the absence of sun glint has been described by Gordon and Wang .
 When sun glint is present, Wang and Bailey  developed a procedure (based on Cox and Munk's  model to estimate sea surface roughness) to model Lg using surface winds and Θ, and the correction was implemented in SeaDAS. The correction is supposed to derive reliable Lw for Lg < 0.01 sr−1, beyond which an image mask is created to prevent further processing (the pixels with 0.005 < Lg < 0.01 sr−1 are flagged as “high glint”). However, Figure 1 shows an example where most of the eastern GOM has Lg > 0.01 sr−1. Even after the SeaDAS processing options are forced to bypass the flag checking, most of the pixels are still masked (Figure 1b) due to atmospheric correction failure (the correction was not designed to deal with high reflectance in the near-IR or shortware-IR). Aside from the glint contamination, MODIS 1-km ocean color bands saturate over bright targets. The saturation radiance for bands 547, 667, 748, and 869 nm is about 6.96, 3.50, 2.23, and 1.30 mW cm−1μm−1 sr−1, respectively. For θo = 30°, these values correspond to at-sensor reflectance of 10.1%, 6.3%, 4.75%, and 3.7%, respectively.
 Another correction approach is to derive the glint contribution from the measured near-IR radiance [Hochberg et al., 2003] instead of a wind-dependent model. The relative glint patterns are first derived from the NIR band (scaled by the darkest and brightest pixels), and then used to subtract the glint contribution proportionally from the co-registered visible bands. One limitation of this approach is the lack of the correction term Tλ(θ)T0,λ(θ0) (equation (1)), which depends not only on the pixel location but also on wavelength. For a small Ikonos image the term may be assumed as a constant, but this assumption is no longer valid for large-swath (2330-km) MODIS images.
3. An Ocean Color Index
 Since MODIS ocean bands at 1-km resolution saturate over severe sun glint, MODIS land bands at 500-m and 250-m resolutions must be used. These bands were designed to cover a higher dynamic range at the price of lower signal-to-noise ratio for the ocean, but are sufficient to observe color patterns. Here, a color index (CI) is derived as:
where Rrc is the Rayleigh-corrected reflectance and the numbers denote MODIS wavelengths in nanometers. CI is actually Rrc,555 normalized against a linear baseline between 469 and 645 nm.
Figure 1c shows the MODIS CI image. There is no longer data saturation or image mask. However, two problems exist. The first is that the color patterns are distorted (high CI values) under sun glint. This is because of the relatively lower T469T0,469 and lower Rrc,555′ in equation (2). The second is that cloud pixels show high CI values. These artifacts need to be corrected.
4. Empirical Glint Correction
 Glint contribution to Rrc is proportional to T(θ)T0(θ0)Lg(Θ) (equation (1)), where both T and T0 depend on Lr (known) and La (unknown). Although Lg can be modeled [Wang and Bailey, 2001], the accuracy depends on wind (110-km per pixel resolution) and bottom depth. The model artifacts, independent of the MODIS measurements, can induce large errors for 1-km or higher resolution MODIS data when Lg is > 0.01 sr−1.
 Based on these considerations, an empirical, partial correction was developed. Visual examination of several images in June 2010 in the eastern GOM suggested that Rrc,859 = 0.02 might be used as a threshold to find pixels with glint contamination, and glint contribution in other bands was scaled to Rrc,859 as:
Rg was then subtracted from Rrc, with the latter used in equation (2) to derive the CI. The coefficients were determined through analyzing the statistical relationships between the visible and NIR bands in glint and glint-free (Lg < 0.001 sr−1) regions from the June 2010 images, and through trial-and-error adjustment until consistent color patterns can be observed between adjacent glint and glint-free regions. The coefficients were determined as α = 0.73, β = 0.87, and γ = 0.93, and they were used to process all glint-contaminated images since June 2010. Figure 1d shows the corrected CI image corresponding to Figure 1c.
5. Empirical Cloud Masking
 A simple threshold method of cloud masking will not work because both clouds and sun glint are bright (high Rrc values). More sophisticated methods [e.g., Frey et al., 2008] use 1-km thermal bands and other tests to flag clouds. The 1-km resolution may not be adequate for the 500-m data, and its ability to differentiate sun glint from clouds is yet to be tested.
 Based on the fact that sun glint is more reddish than clouds, an empirical method was developed through examining the statistical relationship between the spectral bands for glint and cloud pixels, respectively, from the June 2010 images. A pixel is classified as cloud if
where the shape factor is defined as: S(Rrc,469, Rrc,555) = Rrc,555 − 1.27* Rrc,469. This method basically lifts the cloud threshold of Rrc,1240 = 0.0235 [Wang and Shi, 2006] to 0.04, because the baseline subtraction method can effectively remove thin-cloud contaminations [e.g., Hu, 2009].
Figure 1d shows that the empirical glint correction led to consistent color patterns between glint and glint-free regions, and the empirical cloud masking was able to distinguish clouds from glint for Lg < 0.15 sr−1. This is 15 times higher than the glint mask threshold used in SeaDAS. Further, the downgrade in resolution from 500-m to 1-km removed small (1–2 500-m pixels) clouds to improve visualization. Other examples from adjacent days in Figure 2 show similar results, where CI patterns in the same regions appear consistent through time, even when glint patterns varied significantly. The circulation patterns of the LC and LC eddies are clearly visible in these de-glinted images. More examples for other regions (Tropical Atlantic, East China Sea, and waters off South Africa) are provided in the auxiliary material.
7. Discussion: Accuracy and Application
 The CI method differs from the traditional band-ratio algorithms because of its use of the linear baseline subtraction. Previous efforts using baseline subtraction focused on the red and NIR bands to detect algal blooms [Letelier and Abott, 1996; Gower et al., 2005; Hu, 2009], and this is the first time that a blue-green-red band subtraction is attempted. Hu  showed that when detecting floating algae, a baseline subtraction is impacted less by atmospheric correction errors or changes in aerosols/observing conditions than is a band-ratio method. This is primarily because those errors are linearly proportional to wavelength to the first order, and therefore can be subtracted. Likewise, the residual errors of the empirical glint correction as well as errors induced by the threshold of Rrc,859 = 0.02 may be removed, at least in part, by the linear subtraction in the CI method.
 Thus, the baseline subtraction method works because 1) most of the aerosol effects and glint effects are removed by subtraction, and 2) in most open oceans the 469-nm band is more sensitive than the 555- and 645-nm bands to changes in biomass (Chl), and CI is effectively a measure of the relative changes between 469 and 555 nm. If Chl can be derived using the 469/555 ratio, in principle it can also be derived using the 469-555-645 CI. Indeed, even though the objective here is to derive the relative color patterns, Figure 3 shows that CI is significantly correlated with Chl for Chl < 1 mg m−3, and the relationship appears to be stable over time. If we were to derive Chl on 17 June using CI on 17 June and the relationship from 12 June, the RMS “error” in the CI-derived Chl would be 20.5% (0.085 in log scale) for all data points with CI ≤ 0.02 (n=298907) (this represents all “clear-water” pixels in the eastern GOM). Further, most of the scatter for Chl < 1 mg m−3 originates from different geographic regions (e.g., LC eddies, NE GOM, central-west Florida), which will not affect the color contrast in the CI image for a given region. For example, much tighter relationships are observed between CI and Chl in two arbitrary nearshore-offshore transects near Charlotte Harbor and Florida Keys (red and green symbols in Figure 3, respectively). Therefore, for most waters (in the 12 and 17 June cases, >85% of the valid pixels), CI may be used as a relatively stable index for Chl.
 The limited examples above show preliminary success of the approach for the GOM since June 2010. Can it be extended to other regions? Although a complete evaluation over the entire global data archive is difficult to achieve due to the data volume, several regions were randomly selected, including the Tropical Atlantic (Amazon River plume and North Brazil Current), the East China Sea (Yangtze River plume), and waters off South Africa. These cases represent a variety of situations on aerosols (optical thickness at 869 nm between 0.02 and 0.3), glint conditions (Lg between 0 and 0.15 sr−1), and solar/viewing geometry (θ0: 15–30°; θ: 0–40°). The results, presented in the auxiliary material (Figures S1–S3), suggest that the approach and the GOM-based correction coefficients may be generally applicable in the global ocean. For all cases, the coefficient of determination (r2) between MODIS CI and band-ratio Chl is always > 0.9 for Chl < 1.0 mg m−3 (slopes and intercepts for the CI-Chl regression, as in Figure 3, range between 0.0078–0.0099 and 0.0125–0.0158, respectively), with RMS difference between CI-predicted Chl and band-ratio Chl < 30% (Figures S1–S3).
 The empirical glint correction is only a partial correction specifically designed to be used with the MODIS CI, where the T(θ)T0(θ0) effect in equation (1) is implicitly removed in equation (3). However, the residual errors from the partial correction, often much weaker in spatial contrast than that of the underlying ocean color patterns, would not affect the interpretations of the various water masses. Indeed, CI images are particularly useful in observing meso-scale ocean circulations because of their significantly improved coverage. For the GOM, the patterns derived from near-daily MODIS data were not only used in qualitative validation of the numerical circulation models (Liu et al., submitted manuscript, 2010), but also used to visually estimate the surface oil trajectory with oil locations determined from MODIS and SAR data and superimposed on the CI images (Figure 2, black outlines). For other subtropical and tropical regions, Figures S1–S3 show the approach's ability to reveal river plume and meso-scale eddy features, including the rings off South Africa associated with the Agulhas Current. Given the extensive sun glint coverage in all subtropical and tropical regions for most time of the year, a similar approach may be implemented for any particular region to derive qualitative ocean color patterns that are otherwise impossible to observe. Such MODIS CI imagery may add important values to any regional ocean observing systems [Weisberg et al., 2009].
 One limitation of the approach, however, is that it is specific to the satellite instrument, i.e., all instrument characteristics including calibration are implicitly included in the empirical coefficients. Indeed, when the current coefficients, derived using the MODIS/Aqua instrument, were applied to MODIS/Terra to remove sun glint, significant residual errors occurred. Therefore, the coefficients in equation (3) need to be adjusted for MODIS/Terra, after the significant striping errors are corrected.
 An empirical approach is developed to derive a MODIS color index (CI) using a baseline subtraction method from the Rayleigh-corrected reflectance. MODIS land bands at 500-m and 250-m resolutions are used to avoid saturation, and empirical coefficients are derived from statistics of several GOM images in June 2010 to remove sun glint contamination and to differentiate sun glint from clouds. The objective, as demanded by the various oil-spill response efforts, is to derive color patterns even under the most significant sun glint (Lg ∼ 0.15 sr−1) to help explain major circulation patterns in the GOM. A preliminary comparison with the “standard” MODIS band-ratio Chl shows that the empirical MODIS CI appears to be a reliable index to represent the surface ocean biomass for most of the GOM ocean waters. Tests in other subtropical and tropical regions suggest that the same approach and empirical coefficients may be applicable in the global ocean.
 This work is supported by the US NASA Ocean Biology and Biogeochemistry program and Gulf of Mexico program. MODIS data are provided by University of South Florida (F. Muller-Karger) and NASA Goddard Space Flight Center. The help from the NASA SeaDAS team (S. Bailey and B. Franz) in an attempt to remove glint in SeaDAS processing is appreciated. The author is also grateful to two anonymous reviewers for their critical comments and suggestions to test the approach's applicability in the global ocean.