Paleofire events obtained from the statistical treatment of sedimentary charcoal records rely on a number of assumptions and user's choices, increasing the uncertainty of reconstructio\ns. Among the assumptions made when analyzing charcoal series is the choice of a filtering method for raw Charcoal Accumulation Rate (CHARraw). As there is no ultimate CHARraw filtering method, we propose an ensemble-member approach to reconstruct fire events. We modified the commonly used procedure by including a routine replicating the analysis of a charcoal record using custom smoothing parameters. Dates of robust fire events, uncertainties in fire-return intervals and fire frequencies are derived from members' distributions. An application of the method is used to quantify uncertainties due to data treatment in two CHARraw sequences from two different biomes, subalpine and boreal.
 Sedimentary charcoal records have proven useful for identifying long-term trends in fire activity at local to global scales, studying interactions with vegetation [Clark, 1990; Blarquez and Carcaillet, 2010] and the role of biomass burning in the carbon cycle [Clark et al., 1996; Bremond et al., 2011], and understanding the linkages with climatic changes [Whitlock et al., 2007; Marlon et al., 2008; Ali et al., 2012; Nelson et al., 2012]. Information contained in sedimentary charcoal records may reflect three different components: (i) amount of biomass burned (i.e., long-term trends in Charcoal Accumulation Rate (CHAR) or “background” trends) [Marlon et al., 2006; Higuera et al., 2011], (ii) fire frequency [Gavin et al., 2006], and (iii) noise [Carcaillet et al., 2007]. In the last decade, improvements were made in the methods used to examining charcoal records and in the quality of fire reconstructions. Noteworthy is the reconstruction of fire events by decomposing charcoal records into noise that mirrors taphonomical and sampling effects, and high-frequency charcoal peaks that could be related to fire events [Long et al., 1998; Carcaillet et al., 2001b]. The use of sieved charcoal (ø > 150 µm) has led to spatially precise fire event reconstructions [Carcaillet et al., 2001a; Lynch et al., 2004]. Additionally, the use of a locally defined threshold for charcoal peak detection instead of a “globally” defined one has improved peak detection for records with varying mean and variance [Higuera et al., 2008]. Despite the progresses in paleofire research, the underlying methods behind fire reconstructions from sedimentary charcoal records rely on a number of analytical parameters that are determined by user choices. Here we present an approach based on ensemble members, i.e., a set of fire events reconstructions based on different analysis parameters, without a priori parameter choices to obtain robust history. By minimizing user analytical choices, the method we propose extends the capabilities of fire event reconstruction methods for sites located in all biomes such as the Mediterranean, tropical, temperate, and boreal.
2 Methods and Data Analysis
 The analysis of sedimentary charcoal records is composed of four main analytical steps [Higuera et al., 2009]: (1) Charcoal records are decomposed into Charcoal Accumulation Rates (CHAR) using the sediment age ~ depth model to obtain charcoal values for equivalent time steps and volumes along sedimentary sequences [Long et al., 1998]. CHAR are typically calculated at the median temporal resolution interval; (2) CHAR (subsequently CHARraw) are filtered using diverse smoothing methods to model the background component (CHARback), which is subtracted from the CHARraw to obtain the residual high-frequency CHARpeak; (3) CHARpeak is decomposed into two subpopulations using a Gaussian mixture model, with the first part being related to noise in the CHARpeak population (CHARnoise), while the second part could represents the occurrence of one or more local fire events [Gavin et al., 2006]. For simplification, the CHARpeak exceeding CHARnoise was noted CHARfire. Additionally, we used the term fire events to describe the particular events from the CHARfire that could represent one or more fires occurring in the surroundings of the lakes (< 1 km distance) [Higuera et al., 2007]. A given percentile of the CHARnoise is commonly used as a threshold to separate the CHARfire from the CHARnoise. The signal-to-noise index (SNI) is used to evaluate the effectiveness of the discrimination [Kelly et al., 2011]. The goodness-of-fit (GOF) metric is used to assess peak detection quality (i.e., the degree of contrast between the CHARnoise and the CHARfire) by comparing the empirical CHARnoise component with the one derived from the Gaussian mixture model. Significance is represented by the p-value of a Kolmogorov-Smirnov test, where large p indicates that the Gaussian mixture model adequately models empirical CHARnoise. (4) CHARfire is used to reconstruct local fire-return intervals (FRI) and fire frequency (FF) trends.
 Within this analysis, the second step (CHARraw filtering) is particularly prone to inducing bias due to user choice. There is no consensus in the literature about the most appropriate method for filtering the CHARraw series: They may be filtered using moving mode, median, average, inverse Fourier transform, locally weighted scatterplot smoothing (LOWESS) smoother, or LOWESS corrected for outliers, etc. [Higuera et al., 2009]. Additionally, these filters are applied using varying temporal windows (mostly ranging from circa 300 to 1000 years), and the choice is generally guided by maximizing the SNI and GOF metrics (for a review on filtering methods in recent papers: Higuera et al. ). To address the issue of parameter choices, we modified the CharAnalysis software [Higuera et al., 2009] to include a routine that replicates the analysis of a charcoal record using custom smoothing parameters. We used the five chief available smoothing procedures: Moving Mode (MMo), Moving Median (MM), Moving Average (MA), LOWESS (LOWESS), and robust LOWESS (rLOWESS) filters, and for each procedure, we ran 470 analyses using various smoothing windows t, where t = 100, 103, …, 1501. In total, an ensemble member of 2350 (470 × 5) fire event reconstructions is produced for each sedimentary charcoal record. As in the original procedure, the SNI and GOF are used as a selection criterion for fire event reconstructions. Given that the focus of this study was to test the effect of different filtering methods, we always calculated the CHARpeak component as a residual series (i.e., CHARpeak = CHARraw − CHARback), and we used a locally defined threshold to separate the CHARnoise and CHARfire components. These two procedures protect from biases related to variation in both the mean and variance of the empirical and simulated charcoal series [Higuera et al., 2010]. We did not screen peaks based on the original counts of charcoal peaks [see Higuera et al., 2009], because this procedure is specific to charcoal count data only [Ali et al., 2009]. At the final stage, the FRI distributions are modeled including the median FRI value, the mean number of reconstructed fire events and the Weibull b parameter that is estimated from the FRI distributions for each filtering method. For each reconstruction member fire date, a kernel density function [Mudelsee et al., 2004] is used to calculate FF. The median FF and percentiles of the reconstruction ensemble are estimated based on the distribution of all reconstruction members.
 For the studies requiring precise dating of fire events, we recommend that reconstructions should be guided by a consensus. Therein, within each reconstruction the number of fire events are determined and compared with the number of reconstructions identifying events for each time unit. The 75th percentile of this distribution is computed and selected as a threshold for defining the minimal number of members showing a fire by time unit, for a date being considered as a Robust Fire Event (RFE). This procedure eliminates fire dates related to analytical treatment and bias.
 To demonstrate our procedure, we applied it on sedimentary charcoal records from the subalpine Lago Perso (hereafter “Perso,” 44°54′21″N 6°47′50″E) located at 2000 m above sea level (asl) within the municipality of Cesana Torinese, in the Susa Valley (Italy), and the boreal Lac à la Pessière (hereafter “Pessière,” 49°30′30″N 79°14′25″W) located at 280 m asl in the black spruce forest of Quebec (Canada). The Perso and Pessière charcoal records spans the last 7900 and 7600 years, respectively. Charcoals were obtained using standard laboratory methods [Carcaillet et al., 2001b] and reported as areas (mm2 cm−3). CHARraw was calculated using median temporal resolution intervals, i.e., 14 and 13 years for Perso and Pessière. The Perso and Pessière records were chosen for their high-temporal resolution, the small lake areas (0.4 and 4.5 ha, respectively), and watersheds (0.27 km2 at Perso and, unestimated at Pessière because the area is flat). The statistical properties of CHARraw series required for peak analysis were satisfied in both cases [Carcaillet et al., 2001b; Blarquez et al., 2010]. For a complete description of the sites, see Carcaillet et al. [2001b] and Blarquez et al. . Codes and procedures presented in this paper are available as supporting information and online at http://blarquez.com/page/Codes/.
3.1 Fire-event Reconstructions
 We applied the ensemble-member approach to the Perso and Pessière CHARraw sequences to produce 2350 reconstructions. The member selection involved first excluding the members that have a GOF first quartile < 0.1 and thus showing a poorly modeled CHARnoise. Within the resulting members the 1000 first showing the highest SNI were selected to produce the final ensemble member (corresponding to 43% of the initial members).
 Figure 1 illustrates the SNI and GOF distributions for the Perso and Pessière charcoal analyses. Smoothing windows ≥ 200 years gave a generally high SNI value for the Perso record (Figure 1a). This window size corresponds approximately to a width of 15 charcoal samples, which is below to the theoretical recommended 30-sample width for accurate peak detection [Higuera et al., 2009]. For the Pessière charcoal record, this sample size was generally not sufficient for accurate peak detection; an optimal window width was at least 57 to 76 samples (corresponding to circa 750 to 1000 years depending on the filtering method, Figure 1b). The GOF distribution is marked by the occurrence of low scores indicating that the Gaussian mixture model sometimes fails to accurately model the CHARnoise population, notably for the rLOWESS, MM, and MMo filters at Perso (Figure 1a). Based on the GOF criterion, the excluded reconstructions concerned window widths superior to 500 to 1000 years (Figures 1a and 1b). The rLOWESS, MM, and MMo filtering method appeared particularly prone to induce a poor distinction between CHARfire and CHARnoise populations at Perso (only 149, 135, and 122 reconstructions left, respectively). Contrary, at Pessière, the lower number of reconstruction left is from the LOWESS and MA filtering methods (110 and 136 reconstructions).
 The RFE for each lake ensemble member analysis are shown in Figure S1 in the auxiliary material. From the Perso ensemble members, we were able to determine 33 fire events shared by the majority of the members (RFE) and not emerging from analytical bias (Figure S1a). At Pessière, 45 RFE were determined from the members' analysis (Figure S1b).
3.2 Variations Within the Reconstructed Fire Events
 Figure 2 shows FRI distributions by filtering method for the members shown in Figure 1, minus the excluded reconstructions (section 3.1). For both Perso and Pessière reconstruction members, we observed that, on average, the rLOWESS, MM, and MMo filters detected more events compared with the LOWESS and MA methods. For Perso, the rLOWESS, MM, and MMo filters gave similar mean FRIs of 189, 193, and 185 years, respectively (p > 0.05, Wilcoxon-Mann and Whitney U-test, Figure 2a). The situation is similar at Pessière, where the maximum event numbers were recorded using the rLOWESS, MMo, and MM filters (respectively, 62, 60, and 60 fires on average, with 122, 122, and 124 years associated mean FRIs, Figure 2b), while lower fire numbers were detected for the MA and LOWESS filters (52 and 54 fires on average, with 145 and 138 years associated mean FRIs, Figure 2b).
3.3 Reconstructing Fire Frequency (FF) History
 Fire frequency (FF) is treated here as the median of the ensemble-members FFs (Figure 3). When the ensemble FF is compared with the FF obtained with a single analysis of the Perso charcoal record (rLOWESS with a 700 year bandwidth), we noted only slight differences until circa 1000 cal yr B.P.; after this date, the two reconstructions diverged, with the rLOWESS reconstruction giving a lower fire frequency (Figure 3a). The Pessière ensemble-member median FF was similar to the FF obtained from the 700 years rLOWESS up to circa 2000 cal yr B.P.; after this date, the two reconstructions diverged, with the rLOWESS reconstruction at the upper limits of the ensemble-member reconstructions (Figure 3b). The uncertainty around the ensemble-member median for Perso is two times lower compared with the rLOWESS 90% confidence intervals (CI), until 2000 cal yr B.P. (i.e., a mean departure of 0.0025 fire yr−1 for the ensemble member 5th to 95th percentiles (90% CI), compared with 0.0055 fire yr−1 for the bootstrapped 90% CI). After 2000 cal yr B.P., ensemble-members uncertainty increase up to 0.0033 fire yr−1 (a value close to the 0.0042 fire yr−1 mean departure for the bootstrapped 90% CI). Uncertainty is more defined for the ensemble-member reconstruction from Pessière, where FF are highly conservative between 8000 and 4000 cal yr B.P., resulting in even lower ensemble-member percentile departures (i.e., a departure of 0.0020 fire yr−1 for the ensemble members, compared with 0.0067 fire yr−1 for the bootstrapped 90% CI).
 Uncertainties related to user analytical choices may lead to different fire event reconstructions, even if the primary assumptions of statistical accuracy are met during the analysis process, i.e., good differentiation of charcoal peaks (high SNI values) and adequate modeling of CHARnoise (high GOF values). A single fire frequency history reconstruction related to an ultimate statistical treatment does not exist. We aim to select quality fire event reconstructions from a pool of members based on rational criteria (SNI and GOF). A consensus may be determined using the ensemble-member approach, which shows a fixed degree of statistical accuracy, from which fire history may be reconstructed. By diminishing the influence of user analytical choice, the main trends in data are highlighted (Figure 3).
 While the aim of most paleofire studies is to analyze general trends in frequency, some approaches require precisely dated fire events. This is the case when the goal is to analyze the effect or response of several variables (e.g., climate, land-use, and vegetation) following fire events using, for instance, superposed epoch analysis [Blarquez and Carcaillet, 2010] or when the goal is to calibrate the sedimentary charcoal signal using trapped charcoal [Clark et al., 1998] and fire scars [Higuera et al., 2011]. We showed here that robust fire events (RFE, Figure S1) could be determined using fire dates from the ensemble members. Fire events that are detected across all filtering methods and window widths, therefore, could be related to major events likely to have profoundly impacted ecosystems. The analysis of an event proxy and its response variable (e.g., in dendrochronology) is generally preceded by a sensitivity analysis aimed at finding a given event (e.g., fire) recorded by the majority of individuals taken from a sampled population (e.g., fire scars) [Dieterich and Swetnam, 1984], or to detect these significant events by comparison with a control population (e.g., detecting insect outbreaks) [Speer et al., 2001]. Sedimentary charcoal analysis generally lacks such control populations: when archives are available in the form of actual fires (from textual archives or dendrochronological series), the comparison between actual fires and fires assessed from sedimentary charcoal is usually limited to a few decades or centuries. While the ensemble member could be considered a population from which significant events can be found, overall sedimentary charcoal analysis will always benefit from calibration or verification using independent data.
 Paleofire assessments generally yield a unique set of possible fire dates, from which confidence intervals (CI) can be calculated by random resampling of these dates. The ensemble-member percentiles are not based on the probability that a detected fire actually occurred but on the detection probability of fires. If fire events are present in all members, then the associated fire frequency CI should diminish (e.g., Pessière record from 8000 to 4000 cal yr B.P., Figures 3b). In contrast, large variability in fire event detection should result in larger CI. Our approach should not only permit users to more accurately model fire frequency but also make it possible to assess CI directly related to fire event detection and thus the variability extant within charcoal record analysis. These new strategies have been successfully applied for two sedimentary charcoal sequences from the subalpine and the boreal biomes, indicating that they could be generalized to other biomes. We caution, however, that despite these statistical improvements, there are situations in which these analyses of sedimentary charcoal records are not recommended. Notably, results may be ecologically inaccurate if they are applied to a system where charcoal peaks tend to be small, or where the sedimentation rate is slow, and therefore, peaks are difficult to distinguish from background trends. Analyses proposed here should only be applied to robust charcoal records, i.e., with high enough sample sizes, good chronology, little sediment mixing, small lakes with a strong local source and small watersheds, not much sediment focusing and, a CHARpeak described by a mixture of two Gaussian distributions with different means and variances [Gavin et al., 2006].
 Financial support was provided by the Forest Complexity Modelling program (CRSNG-NSERC, Canada) to OB, the Canadian Forest Service to MPG, the FIREMAN program (ANR/ERA-net BiodivERsA) to CC, the NSERC of Canada to YB and AAA, the FRQNT to YB, and by grants from the École Pratique des Hautes Etudes to JA and BL and, from the Centre d'étude de la forêt to CC. The research was carried out within the framework of the International Associated Laboratory (LIA France-Canada) CNRS-UM2-EPHE-UQAT-UQAM-UQAC entitled MONTABOR: Forêts montagnardes et boréales: chrono-écologie et aménagement écosystémique durable.
 The Editor thanks two anonymous reviewers for their assistance in evaluating this paper.