Primary and secondary contributions to ambient levels of volatile organic compounds (VOCs) and aerosol organic carbon (OC) are determined using measurements at the Pittsburgh Air Quality Study (PAQS) during January–February and July–August 2002. Primary emission ratios for gas and aerosol species are defined by correlation with species of known origin, and contributions from primary and secondary/biogenic sources and from the regional background are then determined. Primary anthropogenic contributions to ambient levels of acetone, methylethylketone, and acetaldehyde were found to be 12–23% in winter and 2–10% in summer. Secondary production plus biogenic emissions accounted for 12–27% of the total mixing ratios for these compounds in winter and 26–34% in summer, with background concentrations accounting for the remainder. Using the same method, we determined that on average 16% of aerosol OC was secondary in origin during winter versus 37% during summer. Factor analysis of the VOC and aerosol data is used to define the dominant source types in the region for both seasons. Local automotive emissions were the strongest contributor to changes in atmospheric VOC concentrations; however, they did not significantly impact the aerosol species included in the factor analysis. We conclude that longer-range transport and industrial emissions were more important sources of aerosol during the study period. The VOC data are also used to characterize the photochemical state of the atmosphere in the region. The total measured OH loss rate was dominated by nonmethane hydrocarbons and CO (76% of the total) in winter and by isoprene, its oxidation products, and oxygenated VOCs (79% of the total) in summer, when production of secondary organic aerosol was highest.
If you can't find a tool you're looking for, please click the link at the top of the page to "Go to old article view". Alternatively, view our Knowledge Base articles for additional help. Your feedback is important to us, so please let us know if you have comments or ideas for improvement.
 Airborne particulate matter (PM) can adversely affect human and ecosystem health, and exerts considerable influence on climate. Effective PM control strategies require an understanding of the processes controlling PM concentration and composition in different environments. The Pittsburgh Air Quality Study (PAQS) is a comprehensive, multidisciplinary project directed at understanding the processes governing aerosol concentrations in the Pittsburgh region [e.g., Wittig et al., 2004a; Stanier et al., 2004a, 2004b]. Specific objectives include characterizing the physical and chemical properties of regional PM, its morphology and temporal and spatial variability, and quantifying the impacts of the important sources in the area.
 Volatile organic compounds (VOCs) can directly influence aerosol formation and growth via condensation of semivolatile oxidation products onto existing aerosol surface area [Odum et al., 1996; Jang et al., 2002; Czoschke et al., 2003], and possibly via the homogeneous nucleation of new particles [Koch et al., 2000; Hoffmann et al., 1998]. They also have strong indirect effects on aerosol via their control over ozone production and HOx cycling, which in turn dictate oxidation rates of organic and inorganic aerosol precursor species. Comprehensive and high time resolution VOC measurements in conjunction with particle measurements thus aid in characterizing chemical conditions conducive to particle formation and growth. VOC data can also yield information on the nature of source types impacting the study region [Goldstein and Schade, 2000], photochemical aging and transport phenomena [Parrish et al., 1992; McKeen and Liu, 1993], and estimates of regional emission rates [Barnes et al., 2003; Bakwin et al., 1997], all of which can be useful in interpreting other gas and particle phase measurements.
 This paper describes the results from two field deployments, during January–February 2002 and July–August 2002, in which we made in situ VOC measurements alongside the comprehensive aerosol measurements at the PAQS site, with the aim of specifically addressing the connection between atmospheric trace gases and particle formation and source attribution. The data set provides an opportunity to examine aerosol formation and chemistry in the context of high time resolution speciated VOC measurements.
 The specific goals of this paper include: characterizing the dominant source types impacting the Pittsburgh region, their composition and variability; assessing the relative importance of different types of VOCs to regional photochemistry, and the relationship between aerosol concentrations and the chemical state of the atmosphere; and quantifying the relative importance of primary and secondary sources in determining organic aerosol and oxygenated VOC (OVOC) concentrations. For the latter we quantify the primary emission ratios for species with multiple source types, by correlation with combustion and photochemical marker compounds.
2.1. Pittsburgh Air Quality Study (PAQS)
 The field component of the Pittsburgh Air Quality Study was carried out from July 2001 through August 2002. Measurement platforms consisted of a main sampling site located in a park about 6 km east of downtown Pittsburgh, as well as a set of satellite sites in the surrounding region. For details on the PAQS study, see Wittig et al. [2004a] and the references cited therein. Measurements described here were made at the main sampling site.
2.2. VOC Measurements
 A schematic of the VOC measurement setup is shown in Figure 1. To provide information on as wide a range of compounds as possible, two separate measurement channels were used, equipped with different preconditioning systems, preconcentration traps, chromatography columns, and detectors. Channel 1 was designed for preconcentration and separation of C3–C6 nonmethane hydrocarbons, including alkanes, alkenes and alkynes, on an Rt-Alumina PLOT column with subsequent detection by FID. Channel 2 was designed for preconcentration and separation of oxygenated, aromatic, and halogenated VOCs, NMHCs larger than C6, and some other VOCs such as acetonitrile and dimethylsulfide, on a DB-WAX column with subsequent detection by quadropole MSD (HP 5971).
 Air samples were drawn at 4 sl/min through a 2 micron Teflon particulate filter and 1/4″ OD Teflon tubing (FEP fluoropolymer, Chemfluor) mounted on top of the laboratory container. Two 15 scc/min subsample flows were drawn from the main sample line, and through pretreatment traps for removal of O3, H2O and CO2. For 30 min out of every hour, the valve array (V1, V2, and V3; valves from Valco Instruments) was switched to sampling mode (Figure 1, as shown) and the subsamples flowed through 0.03″ ID fused silica-lined stainless steel tubing (Silcosteel, Restek Corp) to the sample preconcentration traps where the VOCs were trapped prior to analysis. When sample collection was complete, the preconcentration traps and downstream tubing were purged with a forward flow of UHP helium for 30 s to remove residual air. The valve array was then switched to inject mode, the preconcentration traps heated rapidly to 200°C, and the trapped analytes desorbed into the helium carrier gas and transported to the GC for separation and quantification.
 As noninert surfaces are known to cause artifacts and compound losses for unsaturated and oxygenated species, all surfaces contacted by the sampled airstream prior to the valve array were constructed of Teflon (PFA or FEP). All subsequent tubing and fittings, except the internal surfaces of the Valco valves V1, V2, and V3, were Silcosteel. The valve array, including all silcosteel tubing, was housed in a temperature controlled box held at 50°C to prevent compound losses through condensation and adsorption. All flows were controlled using Mass-Flo Controllers (MKS Instruments), and pressures were monitored at various points in the sampling apparatus using pressure transducers (Data Instruments).
 In order to reduce the dew point of the sampled airstream, both subsample flows passed through a loop of 1/8″ OD Teflon tubing cooled thermoelectrically to −25°C. Following sample collection, the water trap was heated to 105°C while being purged with a reverse flow of dry zero air to expel the condensed water prior to the next sampling interval. A trap for the removal of carbon dioxide and ozone (Ascarite II, Thomas Scientific) was placed downstream of the water trap in the Rt-Alumina/FID channel. An ozone trap (KI-impregnated glass wool, following Greenberg et al. ) was placed upstream of the water trap in the other channel leading to the DB-WAX column and the MSD (Figure 1).
 Sample preconcentration was achieved using a combination of thermoelectric cooling and adsorbent trapping. The preconcentration traps consisted of three stages (glass beads/Carbopack B/Carboxen 1000 for the Rt-Alumina/FID channel, glass beads/Carbopack B/Carbosieves SIII for the DB-WAX/MSD channel; all adsorbents from Supelco), held in place by DMCS-treated glass wool (Alltech Associates) in a 9 cm long, 0.04″ ID fused silica-lined stainless steel tube (Restek Corp). A nichrome wire heater was wrapped around the preconcentration traps, and the trap/heater assemblies were housed in a machined aluminum block that was thermoelectrically cooled to −15°C. After sample collection and the helium purge, the preconcentration traps were isolated via V3 (see Figure 1) until the start of the next chromatographic run. The traps were small enough to permit rapid thermal desorption (−15°C to 200°C in 10 s) eliminating the need to cryofocus the samples before chromatographic analysis (following Lamanna and Goldstein ). The samples were thus introduced to the individual GC columns, where the components were separated and then detected with the FID or MSD.
 Chromatographic separation and detection of the analytes was achieved using an HP 5890 Series II GC. The temperature program for the GC oven was: 35°C for 5 min, 3°C/min to 95°C, 12.5°C/min to 195°C, hold for 6 min. The oven then ramped down to 35°C in preparation for the next run. The carrier gas flow into the MSD was controlled electronically and maintained constant at 1 mL/min. The FID channel carrier gas flow was controlled mechanically by setting the pressure at the column head such that the flow was 4.5 mL/min at an oven temperature of 35°C. The carrier gas for both channels was UHP (99.999%) helium which was further purified of oxygen, moisture and hydrocarbons (traps from Restek Corp.).
 Zero air for blank runs and calibration by standard addition was generated by flowing ambient air over a bed of platinum heated to 370°C. This system passes ambient humidity, creating VOC free air in a matrix resembling real air as closely as possible. Zero air was analyzed daily to check for blank problems and contamination for all measured compounds.
 Compounds measured on the FID channel were quantified by determining their weighted response relative to a reference compound (see Goldstein et al. [1995a] and Lamanna and Goldstein  for details). Neohexane (5.15 ppm, certified NIST traceable ±2%; Scott-Marrin Inc.) was employed as the internal standard for the FID channel, and was added by dynamic dilution to the sampling stream. Compound identification was achieved by matching retention times with those of known standards for each compound (Scott Specialty Gases, Inc.).
 The MSD was operated in single ion mode (SIM) for optimum sensitivity and selectivity of response. Ion-monitoring windows were timed to coincide with the elution of the compounds of interest. Calibration curves for all of the individual compounds were obtained by dynamic dilution of multicomponent low-ppm level standards (Apel-Riemer Environmental Inc.) into zero air to mimic the range of ambient mixing ratios. A calibration or blank was performed every 6th run.
 The system was fully automated for unattended operation in the field. The valve array (V1, V2 and V3) and the preconcentration trap resistance heater circuit were controlled through the GC via auxiliary output circuitry. The PC controlling the GC was also interfaced with a CR10X data logger (Campbell Scientific Inc.), which was triggered at the outset of each analysis run. The inlet valve, the standard addition solenoid valve and the water trap cooling, heating and valve circuitry were switched at the appropriate times during the sampling cycle by a relay module (SDM-CD16AC, Campbell Scientific) controlled by the data logger. Relevant engineering data (time, temperatures, flow rates, pressures, etc.) for each sampling interval were recorded by the CR10X data logger with a AM416 multiplexer (Campbell Scientific Inc.), then uploaded to the PC and stored with the associated chromatographic data. Chromatogram integrations were done using HP Chemstation software. All subsequent data processing and QA/QC was performed using routines created in S-Plus (Insightful Corp.). Instrumental precision, detection limits, and accuracy for each measured compound during this experiment, along with the 0.25, 0.50, and 0.75 quantiles of the data, are given in Table 1.
Table 1. Concentration Quantiles and Figures of Merit for Measured VOCs
2.3. Aerosol, Trace Gas, and Meteorological Measurements
 Additional measurements which are used in this paper are described briefly below. For a more thorough overview of the gas and particle measurement methods and results from PAQS, the reader is directed to Wittig et al. [2004a] and the references cited therein.
 Semicontinuous measurements of PM 2.5 (i.e., <2.5 μm diameter) particulate mass were made using a tapering element oscillating microprobe (TEOM) instrument (Model 1400a, Rupprecht & Patashnick Co., Inc.). PM 2.5 nitrate and sulfate were also measured on a semicontinuous basis using Integrated Collection and Vaporization Cell (ICVC) instruments (Rupprecht & Patashnick Co., Inc.) [Wittig et al., 2004b]. Aerosol number size distributions (0.003–10 μm) were quantified using an array of particle sizing measurements: a nano scanning mobility particle sizer (SMPS) (TSI, Inc., Model 3936N25), standard SMPS (TSI, Inc., Model 3936L10), and Aerodynamic Particle Sizer (APS) (TSI, Inc., Model 3320). Aerosol number size distribution measurements were made semicontinuously throughout the PAQS campaign [Stanier et al., 2004a]. Aerosol organic carbon (OC) and elemental carbon (EC) were quantified in situ throughout the study with 2–4 hour time resolution using a Sunset Labs in situ carbon analyzer (A. Polidori et al., manuscript in preparation, 2005).
 O3, NO, NO2, CO and SO2 were measured continuously with commercial gas analyzers (Models 400A, 200A, 300 and 100A, Teledyne Advanced Pollution Instrumentation). Measurements of relevant meteorological parameters (incoming radiation, air temperature, wind speed and direction, precipitation, and relative humidity) were also made continuously throughout the experiment.
3. Results and Discussion
3.1. Meteorological Conditions
 Observed wind speed and direction for the two study periods (9 January to 12 February and 9 July to 10 August 2002) are shown as a wind rose plot in Figure 2. Throughout this paper, data collected during the January–February 2002 deployment will be referred to as “winter” data and that collected during July–August 2002 as “summer” data. Winds in the winter were predominantly out of the west (south to northwest), whereas in the summer southeasterly and northwesterly winds were most common (Figure 2). There was a diurnal cycle in wind speed in both seasons, with stronger winds during the day and weaker winds at night (not shown).
3.2. Factor Analysis
 Factor analysis can be used to categorize measured compounds into distinct source groups based on the covariance of their concentrations, creating an understanding of the variety of sources contributing to a broad range of measured species [Sweet and Vermette, 1992; Thunis and Cuvelier, 2000; Lamanna and Goldstein, 1999]. In this section we characterize the dominant source types impacting the Pittsburgh region in summer and winter, based on a factor analysis of the VOC data set, combined with other available trace gas and high temporal resolution aerosol data. Compounds are grouped into factors according to their covariance, and the strength of association between compounds and factors is expressed as a loading matrix. Each factor is a linear combination of the observed variables and in theory represents an underlying process which is causing certain species to behave similarly. Prior knowledge of source types for the dominant compounds is then used to assign source categories to the statistically identified factors.
 The analysis was performed using principal components extraction and varimax rotation (S-Plus 6.1, Lucent Technologies Inc.). Species having a significant amount (>8%) of missing data were excluded from the analysis. Results for the winter and summer data sets are presented in Tables 2 and 3, respectively, and discussed in detail below. Compounds not loading significantly on any of the factors are omitted from the loadings tables.
 Six factors were extracted from the winter data set, which accounted for a total of 83% of the cumulative variance (Table 2). Each of the six factors accounted for a statistically significant portion of the variance (P < 0.01, where P is the statistical probability of incorrectly attributing a nonzero fraction of the variance to a given factor). The analysis was limited to six factors since including more factors failed to account for more than an additional 2% of the variance in the data set.
 Factor 1, explaining 44% of the total variability in the data set, was associated most strongly with short-lived combustion-derived pollutants, such as the anthropogenic alkenes and aromatic species, in addition to NOx and the gasoline additive methyl-t-butyl ether (MTBE). We attribute this factor to local automobile emissions. The diurnal cycle exhibited by this factor (Figure 3a) showed a clear pattern, higher during the day than at night, and with prominent peaks during the morning and evening rush hours. Note that factor 1 accounted for 44% of the data set variability, indicating that automobile exhaust was most strongly responsible for changes in atmospheric VOC concentrations in Pittsburgh in the winter. Note also, however, that none of the aerosol parameters included in the factor analysis (PM 2.5 mass, aerosol sulfate and nitrate mass, and aerosol number density) loaded significantly on this factor, suggesting that this source was a relatively minor contributor to these components of regional PM.
 Factor 2, accounting for 10% of the variance, was associated exclusively with the anthropogenic alkanes (Table 2), most strongly with propane, and probably represents leaks of propane fuel or natural gas. None of the aerosol measurements loaded on this factor. Factor 2 was on average highest with winds out of the south, and the diurnal pattern showed a maximum in the early morning before dawn (Figure 3b), with a minimum in the afternoon.
 The third factor, accounting for 9% of the data set variance, like factor 1 was associated with some gas-phase combustion products (such as CO, benzene and propyne). Unlike factor 1, however, it also contained a significant aerosol component, in particular sulfate and PM 2.5 mass. The diurnal cycle of factor 3 (Figure 3c) was distinct from that of factor 1, with higher concentrations at night, and no noticeable rush hour contribution. The highest levels of factor 3 were seen with winds out of the south-southeast. We attribute this factor to industrial emissions from point sources in the region. In particular, the U.S. Steel Clairton Works, which is the largest manufacturer of coke and coal chemicals in the United States, and is located 11 miles to the south-southeast of Pittsburgh, may have been a significant contributor to this factor.
 Factor 4 was composed of species (acetone, acetaldehyde, methylethylketone (MEK)) that are both emitted directly and produced photochemically. Acetone and acetaldehyde are also known to have significant biogenic sources [Schade and Goldstein, 2001]; however, biogenic emissions are unlikely to be a dominant source of these compounds in the Pittsburgh winter. PM 2.5 mass was also associated with this category, consistent with the importance of both primary emissions and secondary production of regional aerosol. The diurnal cycle of factor 4 (Figure 3d) showed evidence of both primary and secondary influence. Daytime concentrations were slightly higher than at night, and there was a marked increase in the morning which was coincident with sunrise. Unlike factor 1, this factor did not show the distinct morning and evening peaks coinciding with rush hour. The day-night difference was much less than in summer (see following section), likely reflecting weak wintertime photochemistry and a consequently greater relative impact from direct emissions. The relative importance of primary and photochemical sources for these compounds is explored further in section 3.3.
 Factor 5, which explained a further 6% of the variance, was negatively associated with ozone and nuclei mode aerosol number density, and positively associated with total PM 2.5 mass, aerosol nitrate and accumulation mode number density. This factor may represent the combined influences of photochemical activity and mixed layer dynamics. Production of ozone and nucleation mode particles is driven by sunlight, and owing to their relatively short lifetimes their concentrations were highest during the day and lower at night. By contrast, longer lived pollutants less strongly impacted by photochemistry exhibited higher concentrations at night when winds were calmer and vertical mixing limited. In addition, partitioning of semivolatile species such as nitrate into the particle phase is thermodynamically favored by the colder temperatures and higher relative humidity at night.
 The 6th factor, accounting for 6% of the variability, was associated with gas phase SO2, aerosol sulfate, PM 2.5 mass, and accumulation mode number density. Factor 6 showed a diurnal pattern with higher impact during the day than at night, consistent with a photochemically driven process (Figure 3f). However, nucleation mode number density did not load significantly on this factor. This factor may reflect regional coal burning power plant emissions of gases and particles, and the subsequent photochemical aging of those emissions.
3.2.2. Summer Trace Gas and Aerosol Data Set
 Six factors were extracted from the summer data set, which together accounted for 77% of the variability in the observations (Table 3). Each of the six factors accounted for a statistically significant portion of the variance (P < 0.01). Including additional factors explained less than 2% of the remaining variance. The PM 2.5 measurements had a large number (19%) of missing values, and as there was a strong correlation (r2 = 0.92) between PM 2.5 mass and aerosol volume measured with the SMPS, missing PM 2.5 concentrations were estimated by scaling to aerosol volume prior to performing the factor analysis.
 As with the winter data, the dominant factor, explaining 42% of the total variance, was associated with anthropogenic alkenes, aromatics, MTBE and other markers of tailpipe emissions (Table 3). The diurnal cycle of this source type (Figure 4a), however, with a sharp early morning maximum at sunrise and a broad afternoon minimum, was markedly different than in the winter, when traffic patterns determined the diurnal pattern. In summer, a deeper daytime mixed layer and more rapid photooxidation combined to give rise to the observed temporal pattern. The fact that benzene is not associated with factor 1 is due to the influence of a nearby source (not associated with other tailpipe compounds or solvents), which resulted occasionally in extremely elevated benzene levels. If the factor analysis is repeated after removing the highest (>0.9 quantile) benzene values, benzene in fact loads most strongly on this automotive factor.
 Factor 2 encompassed compounds, such as acetone, acetaldehyde, and isoprene, known to have photochemical sources, sunlight dependent biogenic sources, or both. We thus interpret this factor as representing a combination of these radiation-driven source types. The clear diurnal pattern for this source category (Figure 4b) reflected its light dependent nature, and suggests, for the associated OVOCs, that photochemical and/or biogenic production were more important than direct combustion emissions. The association of 1-butene with factor 2 suggests a regional light-driven biogenic 1-butene source, as has been reported for other locations [Goldstein et al., 1996].
 Factor 3, consisting of fine particle number (accumulation mode only; nuclei and aitken mode number densities were not included in the analysis as they contained too many missing values), PM 2.5 mass, sulfur dioxide, and particle sulfate, had a weak diurnal pattern containing a maximum at midday (Figure 4c). The correlation of acetone and MEK with the other species associated with this factor may arise from distinct sources which lie along the same transport trajectory, or may reflect long-range transport of pollution with concurrent photochemical production.
 The fourth factor, which explained 7% of the cumulative variance, associated with combustion markers such as benzene, NOx and CO, is analogous to the source represented by the third factor extracted from the winter data set. The two factors both exhibited diurnal patterns with concentrations elevated at night and early morning (Figures 3c and 4d), and in both cases the highest levels were associated with winds from the south-southeast. Again, we attribute this factor to industrial emissions. PM 2.5 loaded on the analogous factor in the winter data set, but was not significantly associated with this factor in the summer. This may be due to the fact that concentrations of all measured PM components increased significantly during the summer, and so the contribution of this local source to the total PM 2.5 mass was less important during this time. The fifth factor accounted for a further 5% of the data set variance and was associated exclusively with oxidation products of isoprene: methacrolein (MACR), methylvinylketone (MVK) and 3-methylfuran.
 Propane, isobutane and butane grouped together on factor 6, which likely represents propane fuel or natural gas leakage. The diurnal pattern for this factor (Figure 4f) was similar to that of factor 1, with a strong predawn maximum and afternoon minimum. There was also a weak negative association with ozone, as there was with factor 1, owing to the co-occurrence of the maximum mixed layer depth (and lowest levels of factor 1 and factor 6 compounds) with the maximum daily ozone concentrations.
3.2.3. Summary of Factor Analysis Results
 The results of the factor analyses provide a context from which to interpret the combined VOC and fine particle data sets. In both seasons, local tailpipe emissions formed a substantial component of the ambient VOC concentrations. They did not, however, significantly impact the aerosol species that were included in the factor analysis. Nonautomotive combustion emissions, probably from industrial point sources in the area, were an important source of aerosol mass, as well as of CO, NOx and several unsaturated hydrocarbons. There was pronounced photochemical production of OVOCs such as acetone, MEK, and acetaldehyde in summer. Diurnal concentration patterns indicated that this source was more important than primary combustion emissions. In winter this was not the case, although secondary production was still evident. Along with isoprene, 1-butene showed evidence of a local light-driven biogenic source. There was a distinct source of alkanes that did not appear to be a significant source of other compounds, which was likely leakage of propane fuel or natural gas. Finally, ambient PM showed evidence of a significant secondary component even in winter. The importance of primary and secondary sources to OVOC and OC levels is explored in detail in the following section.
3.3. Source Apportionment of OVOCs and Aerosol Organic Carbon
3.3.1. OVOC Source Apportionment
 Oxygenated VOCs can make up a sizable and even dominant fraction of the total VOC abundance and reactivity, in the urban [Grosjean, 1982; Goldan et al., 1995a], rural [Goldan et al., 1995b; Riemer et al., 1998], and even remote marine atmosphere [Singh et al., 1995, 2001]. Many OVOCs, such as acetone, MEK and acetaldehyde, are known to have a diversity of sources, including combustion emissions, photochemical production from both anthropogenic and biogenic precursor species, and direct biogenic emissions. Understanding the magnitudes of these sources in different environments is prerequisite to an accurate representation of odd hydrogen cycling and ozone chemistry in models of atmospheric chemistry and air quality from the local to global scale.
 Here we present a new approach to unraveling source contributions to such species. We define the ambient concentrations of VOC species Y (χy, in ppt) as being the sum of direct combustion (χyc) and other components (χyo), which could represent secondary or biogenic sources, as well as a background concentration (χa),
 For relatively long lived species, such as acetone, χa may be considered to represent a regional background level. In this case, χa will presumably include contributions from both combustion and secondary/biogenic production that has taken place elsewhere and been integrated into the regional background. For acetaldehyde, a compound with an atmospheric lifetime of only a few hours, there was nonetheless a nonzero observed minimum concentration in both summer and winter. Here, the parameter χa may represent a relatively invariant area source that maintains ambient levels of acetaldehyde above a certain threshold. In either case, we operationally define the background concentration of each species as the 0.1 quantile of the measured concentrations [Goldstein et al., 1995b].
 If Y and a combustion tracer, such as toluene, are emitted in a relatively consistent ratio from different types of combustion sources, then χyc can be estimated as
where (Y/TOL)E is the primary emission ratio of Y relative to toluene, and χtol represents toluene enhancements above background (ppt; see the following section for a discussion of the choice of combustion marker). χyo is then given by
In (3), χtol, χy, and χa are known quantities. All that is required to calculate the combustion (χyc) and secondary plus biogenic (χyo) components of species Y is the primary emission ration (Y/TOL)E.
 To determine (Y/TOL)E for each species Y, we make use of the combustion tracers associated with the first factor in the factor analyses (Tables 2 and 3). For a given value of (Y/TOL)E, we can calculate a χyo vector, and the coefficient of determination (r2) between χyo and each of our combustion tracers. By varying (Y/TOL)E over a range of possible values and repeating this calculation, we can derive r2 between the calculated χyo and each of our combustion tracers, as a function of (Y/TOL)E. At low values of (Y/TOL)E, the calculated χyo will still contain a significant combustion component. At high values of (Y/TOL)E, χyo will become dominated by the χtol term. At the correct value for (Y/TOL)E all contributions of combustion emissions should be removed from χyo, and hence correlation of χyo with a pure combustion parameter should be at a minimum. Conversely, if the noncombustion sources of Y are dominantly photochemical, then the correlation between χyo and a photochemically derived VOC should reach a maximum at that same point.
 The results of performing this analysis for Y = acetone, MEK and acetaldehyde are shown in Figure 5. Each solid line shows the coefficient of determination between an individual combustion marker and χyo, as a function of the value of (Y/TOL)E that was used to calculate χyo. The compounds used as markers of combustion (V, with mixing ratios χv) were those VOCs thought to be solely or predominantly derived via combustion processes (propyne, 2-methylpropene, t-2-butene, c-2-butene, 2-methyl-1-butene, 3-methyl-1-butene, t-2-pentene, benzene, ethylbenzene, p-xylene, m-xylene, o-xylene) and NOx. Dashed lines show r2 between χyo and VOCs thought to be solely photochemically produced (MACR and MVK, which were present above detection limit in the summer experiment only), as a function of (Y/TOL)E.
 There is a well defined minimum in the curve for the combustion markers, the location of which, for a given oxygenated VOC species Y, is consistent across all marker compounds. For the summer data, the location of this minimum coincides with the maximum r2 value for the photochemically produced tracer species. We interpret the location of the critical value of r2 as the representative (Y/TOL)E value for that time of year (Table 4).
Table 4. OVOC Combustion Emission Ratios and Source Contributionsa
Note that the median values of the source contributions do not necessarily add up to the median ambient concentration as the median is not a distributive property.
IQR, interquartile range.
 Primary emission ratios, relative to toluene, for acetone, MEK and acetaldehyde were all substantially (1.4–2.4 times) higher in January–February 2002 than in July–August 2002. Since the emission ratio depends on the toluene as well as OVOC emission strength, seasonal changes in the emission ratio can be due to changes in the numerator, denominator or both. This issue is discussed further in the following section. The primary emission ratios calculated in this section are averages over the sources impacting the air masses that were sampled during the course of the study. They therefore represent integrated regional emission ratios for Pittsburgh in January–February and July–August 2002.
 Urban and industrial VOC emission ratios depend on a number of factors, in particular vehicle fleet and fuel characteristics as well as types of industrial activity in the region. Such variability complicates efforts to construct reliable emission inventories for use in air quality modeling, and emphasizes the utility of the approach developed here, which provides top-down observational constraints on regional pollutant emission ratios. On-road studies of motor vehicle exhaust in the U.S. (generally carried out during summer) report emission ratios for acetone, MEK and acetaldehyde relative to toluene ranging from 2–4%, 2–12%, and <1–8% (molar basis) respectively for light-duty vehicles [Kirchstetter et al., 1999; Fraser et al., 1998; Zielinska et al., 1996; Kirchstetter et al., 1996]. Heavy-duty or diesel vehicles emit substantially higher amounts of these OVOCs relative to toluene, with emission ratios frequently greater than unity [Zielinska et al., 1996; Staehelin et al., 1998]. Inventory estimates (including mobile, point and nonpoint sources) of annual acetaldehyde and MEK emissions in Allegheny County are 14% and 10% those of toluene respectively on a molar basis (see http://www.epa.gov/ttn/chief/net/index.html), substantially lower than the values determined here (Table 4). If inventory estimates of toluene emissions are accurate, this suggests that acetaldehyde and MEK emissions are underestimated by factors of approximately 3.8 and 2.6 (from the average of the summer and winter ratios, Table 4).
 For the summer data, χyo for both acetone and MEK exhibited a well-defined maximum correlation with MACR and MVK (Figure 5), indicating that the other, noncombustive, source represented by χyo is likely to be largely photochemical. For acetaldehyde, the poor correlation of χyo with MACR and MVK suggests that χyo is not exclusively photochemical in nature, and may contain another significant component such as biogenic emissions.
 For comparison, Figure 6 shows results of the same analysis for Y = MACR and MVK, species whose only significant known source is from photochemical oxidation of isoprene. In this case, the minimum correlation of χyo with combustion derived VOCs (and maximum correlation with MVK or MACR) occurs at a combustion emission ratio (Y/TOL)E of zero, showing that there are no significant primary emissions of these compounds.
 With (Y/TOL)E determined by the critical points in Figure 5, the contributions to the concentration of species Y from background (χa), combustion emissions (χyc), and other sources (χyo) as a function of time can then be calculated from (2) and (3). Contributions of χa, χyc, and χyo to the ambient levels of acetone, MEK, and acetaldehyde in summer and winter are summarized in Table 4. Negative values of χyo were assumed to contain no secondary or biogenic material and were set to zero.
 Ambient concentrations of acetone, MEK and acetaldehyde during summer were on average 3–4 times higher than winter (Table 4). Increases in background concentrations were responsible for a significant portion of this winter to summer difference, with summer background levels on average 2.5–5 times higher than in the winter. However, the fraction of the total concentration due to the background was comparable in summer and winter. In both seasons, the background made up, on average, slightly over half of the overall abundance for all three compounds (Table 4). The higher summer background concentrations for these species were probably due to increased nonlocal photochemical production and biogenic emission during that time of year.
 The absolute contribution from combustion to atmospheric mixing ratios was very similar in summer and winter, despite the large changes in emission ratios (which were higher in winter by factors of 2.4, 2.0 and 1.4 for acetone, MEK and acetaldehyde; see discussion in following section). However, total concentrations were substantially higher in summer, and combustion emissions were a significantly smaller fraction of the total source (Table 4).
 Other sources, which we assume to be predominantly photochemical but which also likely include some biogenic emissions in summer, were substantially higher in summer for all three compounds. Median summer values of χyo were over 5 times higher than in winter for acetone and MEK and nearly 4 times higher for acetaldehyde.
 With the exception of MEK, combustion was not the major source of these compounds, even in winter. For MEK, combustion emissions were more important than other sources (χyo) in the winter (a median of 23% versus 13%). This was not the case in the summer, however, nor was it true for acetone or acetaldehyde in either season. For acetone, other sources were twice as important as combustion emissions in the winter and ten times as important in the summer. For acetaldehyde, other sources were 50% larger than combustion emissions in winter and 4 times larger in summer.
 Diurnally averaged OVOC source contributions, overlaid with ozone concentrations, in winter and summer are shown in Figure 7. For the summer data set, the other OVOC sources (χyo) showed a strong photochemical signature: low at night, increasing after sunrise and peaking in the afternoon. For each compound, acetone, MEK and acetaldehyde, the χyo term tracked ozone quite closely. For the winter data set, the χyo terms for each OVOC showed a much weaker photochemical signal, and the relative contribution from combustion was substantially larger than in the summer. Note that since χyc for each OVOC is defined as χtol(Y/TOL)E, i.e. the observed toluene enhancements multiplied by a primary emission ratio, diurnal patterns in χyc shown in Figure 7 reflect that of toluene.
3.3.2. Choice of Combustion Marker and Seasonal Patterns in Emission Ratios
 Repeating the above analysis using other combustion derived compounds instead of toluene as the primary emission tracer resulted in only minor changes to the calculated OVOC partitioning (Table 4) and did not alter any of the conclusions. This gives us confidence that this approach to partitioning VOC source contributions is robust. For a given primary emission tracer, the calculated OVOC emission ratios, given by the critical r2, were consistent using compounds that are solely combustion derived (e.g. alkenes and alkynes) and compounds that have additional anthropogenic noncombustion sources, such as evaporative losses and chemical processing (e.g. benzene and toluene). Hence the approach is not sensitive to slight differences in source profiles for the marker compounds. We conclude that the calculated emission ratios represent an integrated regional primary pollution signal, rather than one specific source type.
 In addition, we note that the combustion markers employed to calculate the OVOC primary emission ratios relative to toluene (Figure 5) have lifetimes that vary by nearly a factor of 50, yet they give consistent emission ratio estimates. This may indicate that much of the variability observed is relatively local and not driven by photochemical lifetime or by the different sampling footprints for species of different lifetimes.
 Seasonal differences in the OVOC primary emission ratios, however, calculated relative to the tracer compound, changed dramatically depending on the tracer used. This is to be expected since the primary emission ratios are sensitive to changes in both the numerator and denominator, and different combustion tracers do not necessarily have identical seasonal patterns in emission strength. While the primary OVOC emission ratios relative to toluene were all higher in the winter, OVOC emission ratios calculated relative to alkenes and alkynes were generally 2–3 times higher in the summer. It is possible that noncombustion toluene sources, i.e. evaporative emissions, are higher in summer which would decrease the OVOC emission ratio for that time of year. However, the short-lived alkenes are oxidized much more rapidly in the summer months due to higher concentrations of OH and ozone. Over a given source-receptor distance, then, the alkenes would be more depleted relative to the OVOCs in the summer than in the winter. This would lead to higher OVOC:alkene emission ratios in the summer, as observed. While this effect would also occur with toluene, either the effect was small due to toluene's longer lifetime (10 times that of t-2-butene) and/or it was offset by increased emissions.
3.3.3. Quantification of Secondary Organic Aerosol
 Organic carbon (OC) constitutes a significant fraction of atmospheric aerosol [Lim and Turpin, 2002; Cabada et al., 2002, 2004; Tolocka et al., 2001]; however, its origin and composition remain poorly understood. OC consists of hundreds or thousands of individual organic compounds. Both anthropogenic sources (e.g. combustion) and biogenic sources (e.g. plants) can contribute to aerosol organic carbon via direct emission of particles (primary OC), and via emission of gas-phase precursor compounds that partition into the aerosol phase upon oxidation (secondary OC). Clarifying the roles of primary and secondary OC production is an important step toward an improved understanding and modeling of the sources, morphology and effects of aerosol OC. The technique of minimizing (maximizing) the correlation between combustion (photochemical) tracer compounds and the photochemical component of a species of interest, developed in the previous section, also has utility in determining the primary emission ratio for pollutants other than VOCs. Here, we apply the method to quantify the relative importance of primary and secondary OC sources in the study region.
 As above, aerosol organic carbon concentrations (Moc, in μg of carbon per cubic meter, μgC/m3) are defined as being composed of combustion (Mc) and other (Mo) components, plus a regional background (Ma) [Turpin and Huntzicker, 1995]:
 Elemental carbon (EC, or soot) is an aerosol component whose only source is direct emission from combustion. If both OC and EC are emitted from primary sources according to a characteristic averaged OC:EC emission ratio (OC/EC)E, then the combustion-derived organic carbon can be estimated as
where Mec represents elemental carbon enhancements above background (in μgC/m3), and Mo is given by
 The background term, Ma, represents noncombustion primary OC (e.g. from biogenic sources) as well as any regional aerosol organic carbon background. As above, we estimate Ma as the 0.1 quantile of the measured OC concentrations. Mo is then assumed to be exclusively secondary OC. It should be pointed out, however, that if there exist significant sources of primary OC which do not correlate with EC and are highly variable through time (and thus are not entirely captured by the Ma parameter), then Mo may also contain some primary influence.
 One challenge associated with the EC tracer method as it has been applied in the past involves defining the OC:EC ratio of primary emissions, as this can vary significantly between sources and consequently as a function of time. In addition, defining (OC/EC)E from ambient OC and EC concentration data requires that there be a subset of data with no significant secondary contributions to the measured OC concentrations. The typical approach is to qualitatively eliminate data points that are likely to be impacted by significant secondary production or other factors such as rain events, and regress OC on EC for that subset of data dominated by primary OC [Turpin and Huntzicker, 1995; Cabada et al., 2004]. This then gives a regression slope that is in theory reflective solely of primary emissions. The parameter Ma, reflecting primary noncombustion OC, is then assumed to be constant and given by the intercept, enabling the calculation of Mo. In the event of significant temporal variability in the primary OC:EC ratio impacting the sampling site, this process may be repeated on subsets of the data.
 Here we employ the technique developed in the previous section, using the range of markers for primary and secondary processes provided by the VOC data set to define the characteristic OC:EC primary emission ratio for the Pittsburgh region in summer and winter. This approach avoids the need to carefully select time periods that will yield the “correct” value of (OC/EC)E. In addition, the suite of primary and secondary VOCs available provides bounds on the value of (OC/EC)E appropriate to a given time period. The secondary organic aerosol is then calculated according to (6).
 The coefficient of determination between Mo and combustion and photochemically derived VOCs is shown in Figure 8 as a function of (OC/EC)E for winter and summer. Again, the critical point of the curves gives the representative value of (OC/EC)E for that time of year.
 The median value of (OC/EC)E determined for the winter data set was 1.85 (IQR: 1.82–1.86), whereas that for the summer data set was lower with a median of 1.36 (IQR: 1.27–1.48) (Table 5). Substantially higher particulate concentrations of levoglucosan were observed in the winter, indicative of increased wood combustion. More widespread wood burning is a likely cause of the higher primary OC:EC emission ratio at that time of year. Colder engines and less efficient combustion may have also contributed to the higher wintertime ratio.
Table 5. OC Combustion Emission Ratios and Source Contributionsa
Note that the median values of the source contributions do not necessarily add up to the median ambient concentration as the median is not a distributive property.
IQR, interquartile range.
 Using the derived values of (OC/EC)E for summer and winter, we can then calculate Mo, the secondary OC, according to (6). Timelines of the total (Moc), combustion (Mc), and secondary (Mo) aerosol organic carbon concentrations (in μgC/m3) for winter and summer 2002 are plotted in Figure 9, and quantiles of these quantities are given in Table 5. Note that since Mc is defined as Mec(OC/EC)E, there are occasional episodes where Mc > Moc. Negative values of Mo were assumed to contain no secondary material and were set to zero.
 Ambient concentrations of aerosol organic carbon in the summer experiment were on average twice as high as in the winter (Table 5). Background levels (Ma) made up a significant fraction of the total ambient aerosol OC concentrations in both seasons. Background aerosol OC concentrations were slightly higher in summer but a larger fraction of the total in winter (median of 49% versus 35%). Similarly, combustion OC was slightly higher in the summer, however, it made up a larger fraction of the total OC in winter (median of 30% versus 19%). Secondary organic carbon (Mo) accounted for a median of 16% (IQR: 0–35%) of the aerosol OC in winter, and 37% (IQR: 15–56%) in summer (Table 5).
 A. Polidori et al. (manuscript in preparation, 2005) carried out an analysis of the primary and secondary components of OC in Pittsburgh during the PAQS study using the Turpin and Huntzicker  EC tracer method. For overlapping time periods (10 January to 12 February and 10–31 July 2002), they calculate median secondary OC concentrations of 0.21 μgC/m3 (15% of total OC) and 1.04 μgC/m3 (47% of total OC) respectively. These values are in good agreement with those calculated here for the same periods: 0.20 μgC/m3 (16% of total OC) and 1.15 μgC/m3 (43% of total OC) (note that these values differ slightly from those in Table 5 since they do not reflect identical time periods).
3.4. Characterization of the Chemical State of the Atmosphere: VOC Contributions to OH Loss
 Photochemical production of secondary organic aerosol (SOA) depends on the chemical state of the atmosphere, both in terms of oxidative capacity, and in term of the quantity and nature of gas phase organic material that is present to form aerosol. In this section we describe the relative importance of different classes of VOCs to tropospheric photochemistry in the Pittsburgh region in summer and winter, and show that higher levels of photochemically active compounds are present in summer, when SOA levels are highest, due to biogenic emissions and photochemical production of OVOCs.
 A useful measure of air mass chemical reactivity is the OH loss rate (LOH, s−1), defined as
where ki is the reaction rate constant for species i with the hydroxyl radical [Atkinson, 1994], and χi is the concentration of i in molec/cm3. LOH has units of s−1 and represents the inverse lifetime of the hydroxyl radical with respect to reaction with the measured compounds.
 Daytime (1000–1600 EST) values of LOH were calculated for the following groups of compounds: total (all measured VOCs plus CO); alkanes; alkenes + alkynes; aromatics; OVOCs; isoprene plus its oxidation products methacrolein, methylvinylketone, and 3-methylfuran; and CO (Figure 10; Tables 6 and 7).
Table 6. Quantiles of Daytime OH Loss Rate: Winter Dataa,b
 Due to analytical challenges, VOC measurements in many field studies of air quality and atmospheric chemistry comprise only the anthropogenic nonmethane hydrocarbons (NMHCs; alkanes, alkenes, alkynes and aromatics). In Pittsburgh during January and February 2002, these species accounted for a substantial portion (approximately 60%) of the total measured OH loss rate. However, while their collective OH reactivity was only slightly less in summer (0.68 s−1 versus 0.89 s−1), their importance relative to other VOCs was dramatically lower, as they accounted for only 11% on average of total LOH during summer. Similarly, the CO reactivity was comparable in both seasons (median of 0.49 s−1 in winter and 0.53 s−1 in summer), but its relative contribution to the total measured OH loss rate was much greater in winter (median of 23% versus 7% in the summer). It should be pointed out that these calculations do not include the C2 hydrocarbons ethane, ethene and ethyne, which were not measured. Based on published ratios of these compounds to other species [Parrish et al., 1998], we estimate that they would cause an OH loss rate of approximately 0.05 s−1 and 0.13 s−1 for summer and winter.
 Despite the comparable NMHC and CO reactivity in the two seasons, the total measured daytime OH loss rate underwent a more than fourfold increase from winter (median = 1.42 s−1; IQR: 1.12–2.30 s−1) to summer (median = 7.25 s−1; IQR: 4.60–9.38 s−1). This was due to the presence of high levels of isoprene and its oxidation products in summer, and also to the three-fold increase in oxygenated VOC concentration and reactivity from winter to summer (Tables 6, 7). Isoprene plus its oxidation products accounted for a median of 62% (IQR: 52–70%) of the daytime OH loss rate in summer, with the OVOCs accounting for an additional 20% (IQR: 15–26%). Formaldehyde measurements were not made during the PAQS study, and including the effects of this compound would result in an increased contribution to the calculated OH loss rate from the OVOCs in both seasons.
 The PAQS sampling site was located at the north end of Schenley Park, a 456 acre urban park with substantial tree cover. To test whether the observed isoprene concentrations were biased by the presence of a large nearby source, the daytime OH loss due to isoprene and its oxidation products was calculated for time periods when the wind was only from the northern sector and the wind speed was greater than 1 m/s. The resulting OH loss rate (median 4.55 s−1; IQR 2.85–5.73 s−1) was not significantly different from that calculated using all of the daytime data (median 4.71 s−1; IQR 2.77–6.20 s−1). Hence we find that in winter, the total daytime OH loss rate is dominated by the nonmethane hydrocarbons and CO, whereas in summer it is dominated by isoprene, its oxidation products, and oxygenated VOCs. The above calculations do not include methane, which was not measured during the experiment. Based on background concentrations of methane at this latitude (see http://www.cmdl.noaa.gov/info/ftpdata.html), we estimate the OH loss rate in Pittsburgh due to methane at approximately 0.21 s−1 during the winter and 0.32 s−1 during the summer.
 Daytime OH loss rates were also calculated on the following subsets of the data: high ozone days, high aerosol OC days, and days in which moderate to strong nucleation events were observed [Stanier et al., 2004b] (Tables 6 and 7). High ozone and high OC days were defined as days in which the maximum value of these quantities was above the 0.8 quantile for all observed daily maxima.
 For July and August, the 0.8 concentration quantile for the daily maximum ozone was 84 ppb. On high ozone days, the total measured OH loss rate was higher (median: 8.53 s−1) than otherwise (days not exceeding this threshold had a median OH loss rate of 6.74 s−1), and this increase was distributed relatively evenly among the different compound classes (Table 7). During January and February, the 0.8 quantile for daily ozone maxima was only 30 ppb. Days during which ozone exceeded this amount had a lower overall OH loss rate (median 1.00 s−1) than days on which it did not (median 1.57 s−1). These higher ozone days in the winter may have occurred during periods of enhanced vertical mixing. Comrie and Yarnal  analyzed the dependence of surface ozone in Pittsburgh on synoptic climatology. They concluded that high ozone levels in summer developed under stagnant anticyclonic conditions, whereas in winter high ozone concentrations were associated with tropopause folding and vertical transport of stratospheric ozone.
 In both seasons, days with high OC loadings were associated with higher levels of all VOC compound categories and CO. In January–February, the median OH loss rate was 2.70 s−1 on days with high OC versus 1.28 s−1 on days without. In July–August, the median daytime OH loss rate was 9.26 s−1 on high OC days and 6.81 s−1 on other days. By contrast, days on which nucleation events occurred had lower overall OH loss rates (medians of 1.04 s−1 and 4.96 s−1 in winter and summer, respectively) than days without nucleation events (medians of 1.54 s−1 and 7.93 s−1 in winter and summer). Stanier et al. [2004b] found that the occurrence of nucleation events during PAQS was dependent on the preexisting aerosol surface area available for condensation. The lower OH loss rates on nucleation days may be due to a positive correlation between gas phase reactivity and aerosol surface area (r2 = 0.48 in winter and 0.28 in summer); i.e. less polluted days with low OH loss rates also had lower particle surface area available for condensation of semivolatile aerosol precursors, which increased the likelihood of new particle nucleation.
 High temporal density speciated VOC measurements provide a useful framework for interpreting aerosol measurements. Statistical analyses such as factor analysis on combined VOC-aerosol data sets can test precepts used in source-receptor modeling. The range of combustion and photochemical markers in the VOC data set also enables us to deconvolve the relative contributions to ambient levels of OVOCs and aerosol OC from different source types. We calculate that secondary plus biogenic sources accounted for 24%, 12% and 27% of the ambient concentrations of acetone, MEK and acetaldehyde respectively in the winter and 29%, 26% and 34% respectively in the summer. Aerosol OC was found to be composed of 16% secondary carbon in the winter and 37% secondary carbon in the summer. The importance of the background contribution to observed concentrations of both OVOCs and aerosol OC emphasizes the role of longer-range transport and the need for a regional perspective in addressing air quality concerns. While local automotive emissions were the primary factor driving changes in VOC concentrations in Pittsburgh, they did not contribute significantly to variability in the aerosol species included in the factor analysis (PM 2.5 mass, aerosol sulfate and nitrate mass, and aerosol number density).
 VOC concentration data can also help define chemical conditions that are conducive to particle formation and growth. We find that while aerosol OC loadings are highest when VOC concentrations and reactivities are high, nucleation events tended to occur on days when levels of VOCs and CO were low. High ozone days in the summer were associated with high OH loss rates due to the VOCs and CO, whereas in the winter the highest ozone levels occurred on days with low levels of CO and nonmethane hydrocarbons but slightly higher OVOC concentrations.
 One of the overall objectives of the PAQS study is to develop the ability to predict changes in PM characteristics and atmospheric composition due to proposed changes in emissions. Reaching this objective will require accurate modeling of the chemical and dynamical processes controlling atmospheric composition in the Pittsburgh region. The results presented here should help to provide a basis upon which to test mechanisms included in such models.
 This research was conducted as part of the Pittsburgh Air Quality Study, which was supported by the US EPA under contract R82806101 and the US DOE National Energy Technology Laboratory under contract DE-FC26-01NT41017. DBM thanks the DOE for a GREF fellowship. The authors thank all of the PAQS researchers for their help; in particular, Allen Robinson, Beth Wittig, and Andrey Khlystov. Thanks also to Megan McKay for her considerable help.