Improving geographically extensive acoustic survey designs for modeling species occurrence with imperfect detection and misidentification

Abstract Acoustic recording units (ARUs) enable geographically extensive surveys of sensitive and elusive species. However, a hidden cost of using ARU data for modeling species occupancy is that prohibitive amounts of human verification may be required to correct species identifications made from automated software. Bat acoustic studies exemplify this challenge because large volumes of echolocation calls could be recorded and automatically classified to species. The standard occupancy model requires aggregating verified recordings to construct confirmed detection/non‐detection datasets. The multistep data processing workflow is not necessarily transparent nor consistent among studies. We share a workflow diagramming strategy that could provide coherency among practitioners. A false‐positive occupancy model is explored that accounts for misclassification errors and enables potential reduction in the number of confirmed detections. Simulations informed by real data were used to evaluate how much confirmation effort could be reduced without sacrificing site occupancy and detection error estimator bias and precision. We found even under a 50% reduction in total confirmation effort, estimator properties were reasonable for our assumed survey design, species‐specific parameter values, and desired precision. For transferability, a fully documented r package, OCacoustic, for implementing a false‐positive occupancy model is provided. Practitioners can apply OCacoustic to optimize their own study design (required sample sizes, number of visits, and confirmation scenarios) for properly implementing a false‐positive occupancy model with bat or other wildlife acoustic data. Additionally, our work highlights the importance of clearly defining research objectives and data processing strategies at the outset to align the study design with desired statistical inferences.


| INTRODUC TI ON
Remotely deploying acoustic recording units (ARUs) to survey cryptic animals is an important tool for ecology and conservation biology (e.g., Blumstein et al., 2011;Newson, Bas, Murray, & Gillings, 2017;Parsons & Szewczak, 2009). Acoustic recording units are capable of collecting detection/non-detection data on focal species noninvasively and with minimal effort across broad geographic extents, making coordinated monitoring practical and feasible, even for sensitive species (e.g., Acevedo & Villanueva-Rivera, 2006;Loeb et al., 2015). Despite these advantages, the sheer volume of data collected by ARUs often necessitates automated species identification via classification software, resulting in the potential for two types of detection errors: imperfect detection and misidentification. Imperfect detection occurs when the focal species is present but no calls are recorded, or calls are recorded but none are identified as the focal species. Misclassification errors result from the classification software incorrectly assigning at least one recorded call to the focal species, when in fact, the species is absent from a site. Statistical analyses of ARU datasets can provide decision-makers with critical baseline information about the probability of species occurrence (site occupancy) and species distributions for at-risk species (e.g., McClintock, Bailey, Pollock, & Simons, 2010;Rodhouse et al., 2015).
Standard occupancy models can be used to address these research goals while accounting for imperfect detection, but the modeling framework assumes no misidentification errors (MacKenzie et al., 2006). When misidentification errors are ignored, estimators from standard occupancy models can be biased (e.g., McClintock et al., 2010;Miller et al., 2011;Royle & Link, 2006) and lead to unreliable management and conservation decisions.
Bat acoustic surveys that set out ultrasonic microphones for recording bat echolocation calls provide acute examples of how misidentification and imperfect detection may arise and complicate statistical inferences. Challenges associated with traditional capture and visual methods coupled with the increased risk of multiple threats (e.g., Hammerson, Kling, Harkness, Ormes, & Young, 2017;Jones, Jacobs, Kunz, Willig, & Racey, 2009;O'Shea, Cryan, Hayman, Plowright, & Streicker, 2016) have accelerated the widespread use of ARUs for surveying bats. Broad-scale monitoring programs have been initiated across Europe, North America, and elsewhere (e.g., Barlow et al., 2015;Jones et al., 2013;Loeb et al., 2015;Roche et al., 2011;Walters et al., 2012) and rely in part on coordinated acoustic surveys. The complication is that shared echolocation call characteristics from morphologically and ecologically similar bat species can result in incorrect species assignments from automated identification software and misidentification errors for the focal species (Russo, Ancillotto, & Jones, 2017;Russo & Voigt, 2016;Rydell, Nyman, Eklof, Jones, & Russo, 2017). Imperfect detection can occur when all echolocation calls from the focal species are of such low quality that they are filtered out during call processing. Another source is when all calls from the focal species do not exhibit enough distinguishing characteristics to receive a single-species classification resulting in no species assignment or a group classification such as low-or high-frequency bat. Additionally, the focal species could be present despite having none of its calls recorded.
Detection error rates arising from automated identification software are impacted by the focal species' call characteristics and behavior, the choice of detector (e.g., type and model), detector settings (e.g., gain level), detector placement in relation to environmental clutter (e.g., vegetation that may alter call behavior), the classification software used, and the call processing workflow employed. To promote coherency between these important considerations and the eventual modeling framework used, we diagram a generalized workflow for recording, processing, and verifying bat echolocation call files in Section 2.2. This workflow diagramming strategy highlights how typical practices may influence the appropriateness of certain modeling approaches for bat acoustic data and serves as a conceptual model for practitioners interested in designing ARU-based surveys for any taxa.
Standard occupancy models can account for imperfect detection if multiple within-season visits are made to each site (e.g., MacKenzie et al., 2002(e.g., MacKenzie et al., , 2006, a sampling design commonly used for bat acoustic surveys (e.g., Gorrensen, Miles, Todd, Bonaccorso, & Weller, 2008;Rodhouse et al., 2015;Weller, 2008). Furthermore, automated species identifications can be manually verified by a human to remove misidentification errors and provide confirmed detections prior to analysis (e.g., Wright, Irvine, & Rodhouse, 2016). For species that can be verified consistently and truly, the amount of effort and expertise required for this approach is impractical for large-scale coordinated monitoring. The difficulty posed by this verification burden can lead to the naive modeling approach of applying a standard occupancy model to unverified bat acoustic data, effectively ignoring misidentification errors. We propose an alternative option of explicitly modeling misidentifications in a false-positive occupancy model.
Three classes of false-positive occupancy models are outlined in Chambert, Miller, and Nichols (2015): site confirmation models (Miller et al., 2011(Miller et al., , 2013, calibration models (Chambert et al., 2015;Ruiz-Gutierrez, Hooten, & Grant, 2016), and the observation confirmation (OC) model (Chambert et al., 2015). Chambert, Waddle, Miller, Walls, and Nichols (2018) recently introduced another type of OC model that has the potential to extend inferences to include estimates of relative abundance of some taxa. False-positive occupancy models require auxiliary information about true site occupancy from a subset of sites or calibration information about the detection device's misidentification rate to ensure estimates of detection probabilities are unique. To our knowledge, Clement, Rodhouse, Ormsbee, Szewczak, and Nichols (2014) provide the only application of a falsepositive occupancy model to a bat acoustic survey. Clement et al. (2014) drew on mist-netted bats as true detections from a subset of sites to inform the probability of misidentification using a multiple method site confirmation model (Miller et al., 2011). However, capturing bats is invasive, costly, and quickly becomes impractical for geographically extensive surveys. It is also debatable whether hand captures constitute true detections for certain bat species, as many species are morphologically cryptic (e.g., Rodhouse, Scott, Ormsbee, & Zinck, 2008;Rodriguez & Ammerman, 2004;Weller, Scott, Rodhouse, Ormsbee, & Zinck, 2007). Bat acoustic data pose challenges for the calibration model as well, as the libraries of echolocation calls used to build classification software for automated species identifications are not always made under realistic conditions . The OC models, on the other hand, show promise for leveraging information from bat acoustic surveys while potentially reducing the manual verification burden.
We extend Chambert et al.'s (2015) OC model to accommodate known sources of heterogeneity in occupancy and detection probabilities and allow for spatially explicit estimates of occurrence and detection probabilities. Otherwise, ignoring potential sources of heterogeneity in detection probabilities could result in biased estimators . Further, our OC model extension allows for more flexibility in allocating confirmation effort than is afforded by the original formulation of the OC model in Chambert et al. (2015). We focus on whether using our extended OC model for analyses can provide a way to increase efficiency of ARU-based surveys through reduced confirmation effort. Our investigation into confirmation effort is complementary to the one presented in Chambert et al. (2018). Here, we focus on exploring a simpler OC model that does not rely on specifying an appropriate statistical distribution for nightly bat activity.
We use simulation to compare the three approaches for addressing misidentification errors in statistical modeling of ARU-based surveys: (a) REMOVE, removing them and applying a standard occupancy model; (b) IGNORE, ignoring them and applying a standard occupancy model; and (c) MODEL, using our extended OC model to explicitly account for them in the modeling framework. We compare approaches with respect to their estimator properties such as bias, precision, and coverage. We also explore how the allocation of confirmation effort affects OC parameter estimator properties.
Our simulations were based on species-specific parameter estimates from real bat acoustic data and illustrate the importance of using available pilot data when considering sample size questions.
Importantly, we provide a fully documented R (R Core Team, 2016) package, OCacoustic, for conducting customized investigations.
To improve access and applicability for practitioners, all of our functions incorporate the common r-formula syntax used in glm and occu. The package is bundled with an extended vignette providing instructions and guidelines for its use (Appendix S3).

| General terminology
The unit of analysis or sample unit for occupancy models is commonly a predefined spatial unit (MacKenzie et al., 2006). In our application, sites are defined as 10-km × 10-km grid cells within the state of Oregon, USA, Rodhouse et al. (2012), where the area was chosen based on focal species behavior and analysis objectives. In general, we suggest a probabilistic sampling design for choosing sites (e.g., a design based on the generalized randomized-tessellation stratified (GRTS) algorithm; Stevens & Olsen, 2004). Observations arising from different sites are assumed to be independent, as are those arising from different visits to the same site. We define a single visit as a one-night deployment of an ARU to a unique location within a site.
The replication needed to account for imperfect detection could be spatial replicates with multiple ARUs deployed within a site (if a spatial unit) on the same night, or temporal replicates with one ARU deployed for multiple nights at the same location within a site (although see Wright et al. (2016) for potential drawbacks). It is assumed that the occupancy status of a site is the same for all visits. The standard occupancy model and the OC model both use this terminology and require these assumptions.
Many bat echolocation call files can be recorded during a visit and detection/non-detection could be considered at two different levels: the observation level (i.e., individual recordings of echolocation calls) or the visit level (i.e., aggregating individual recordings up to a visit). For both levels, we define two types of detections: ambiguous detections which can include misidentifications of the focal species, and unambiguous detections without misidentification errors. Species identifications made by automatic software constitute ambiguous detections, whereas, those that are a posteriori verified by a qualified expert are unambiguous. We define verification as the process for obtaining unambiguous observation-level detections, confirmation as that for unambiguous visit-level detections, and the confirmation design as the visit-level detections that are chosen to be confirmed. Although the confirmation of visit-level ambiguous detections is done through verification at the observation level, the verification strategy employed will impact modeling options. Therefore, an important step in the planning phase of any ARU-based survey is diagramming the acoustic data workflow (e.g., Figure 1).

| Bat acoustic data workflow
Diagramming the conceptual workflow for any study begins with clearly articulated inferential goals and objectives (start of Figure 1).
Here, our objective is to use information from bat acoustic surveys to estimate site occupancy probabilities for one focal species with uncertainty. Most importantly, the focal species must be detectable acoustically and the number of sites, number of visits, and visit design should be based on species-specific behavioral characteristics (e.g., MacKenzie et al., 2006). Before deployment, detector locations and settings should be chosen specifically for species of interest following a consistent protocol (e.g., Loeb et al., 2015;NPS, 2016).

| The OC model applied to bat acoustic data
In the context of bat acoustic surveys, the OC model (Chambert et al., 2015;pg. 336) uses observation-level Auto IDs and Manual IDs to estimate visit-level detection and misclassification probabilities (concept diagram in Figure 2) conditional on the ARU workflow employed (e.g., Figure 1). This model assumes the occupancy status of the ith site (i = 1, 2, ···, n) is a Bernoulli random variable with a constant probability of ψ (Z i ∼ Bernoulli(ψ): Z i = 1 if the focal species occupies the ith site, Z i = 0 otherwise). It also assumes species detections during visits occur at occupied sites (visit-level true detections) with probability p 11 and at unoccupied sites (visit-level misidentifications) with probability p 10 . That is, visit-level ambiguous detections during the jth (j = 1, 2, ···, J i ) visit to the ith site also arise from where y ambig ij = 1 if at least one Auto ID is identified to the focal species during visit j to site i, y ambig ij = 0 otherwise (y ambig in Figure 2).
The visit-level unambiguous detection (ν in Figure 2) is assumed to be a multinomial random variable with levels defined by the (dis) agreement between the observation-level Auto IDs and their corresponding verified Manual IDs: (ν = 0) "no detections"-no Auto IDs F I G U R E 1 A bat acoustic survey workflow diagram. This workflow begins with goals and objectives (occupancy modeling in this example) and ends with inferences and conclusions. All intermediate steps influence downstream tasks (blue boxes). A critical step in any analysis is outlining the workflow to facilitate conversation among collaborators and ensure consistency among data collection, analysis, and dissemination of results prior to deploying ARUs. The focus of this diagram is occupancy modeling using bat acoustic data, but similar workflow diagrams can be created for different analysis objectives and/or different animals of interest (e.g., insects, frogs, and birds). The style of this diagram was inspired by the business workflow modeling software, Bizagi Modeler (www.bizagi.com)  We provide more flexibility in how much verification effort is allocated to each site by allowing for a combination of ambiguous (y ambig ) and unambiguous (ν) detections within a site. The likelihood for our extended OC model is written as, where I(C) ij = 1 if the jth visit to the ith site is confirmed and I(C) ij = 0 otherwise. Relationships between site-level covariates (X i ) and site occupancy (ψ i ) and site-and visit-level covariates (W l ij ) and detection probabilities (s l ij ; for l = 0,1) are modeled using the logit link function as, logit( i ) = X i and logit(s l ij ) = l W l ij , for l = 0,1, respectively. The occupancy and detection coefficients are represented by the and l vectors, respectively.
One source of heterogeneity in the observation process for bat acoustic data is the overall quality of calls obtained during a visit.
We expect overall call quality to be a function of detector and microphone placement, the amount of "environmental clutter" (e.g., (1)

VISIT LEVEL, STANDARD
vegetation, rocks, and water surfaces) near the detector, weather (e.g., wind and rain), and potentially other sources. A reasonable proxy to use for call quality is the total number of Auto IDs recorded during a visit, as only calls of a certain quality will ultimately receive single-species Auto IDs. We let K ij be the total number of Auto IDs recorded by the detector deployed at visit j in site i, and we note that K ij is always greater than or equal to zero. We expect that at some value of K ij , the proxy for quality ceases to substantially influence detection probabilities (i.e., the difference between observing 1 and 50 Auto IDs is more reflective of a change in call quality between visits than is the difference between observing 1,001 and 1,050 Auto IDs). Therefore, for our application and simulations, we assume a logit-linear relationship between the natural log of K ij + 1 (adding one to ensure the value being logged is greater than zero) and the detection probabilities, If available, visit-level covariates explaining heterogeneity in detection probabilities (e.g., software packages, regional classifiers, and "clutter") could be directly incorporated into the mean structure through Equation 4. Similarly, covariates also could be included at the site level (e.g. habitat type and average elevation) to account for heterogeneity in site occupancy using a logit link function on ψ i .

| Simulation study design
We were interested in whether the extended OC model was a viable alternative to the REMOVE approach. We conducted a simulation study comparing estimation of occupancy (ψ) among REMOVE, IGNORE, and our extended OC model with a confirmation design where all visits within all sites were confirmed (i.e., all unambiguous data). We included the IGNORE approach to corroborate that ignoring misidentifications from bat species classification software in a standard occupancy model can result in erroneous conclusions (e.g., Clement et al., 2014). To explore the impacts of the confirmation design on parameter estimation using our extended OC model, we also investigated nine different confirmation designs: all, half, or a quarter of sites with all, half, or a quarter of the visits being confirmed (i.e., contributing information in terms of ν (unambiguous) rather than y ambig (ambiguous) in Figure 2). We explored all possible combinations for a range of parameter values ( ,( 0,s 1 , 1,s 1 ), and ( 0,s 0 , 1,s 0 )), chosen to represent realistic speciesspecific characteristics based on empirical data (see Appendix S1 for empirical results). For this study, we specified ψ to reflect narrowly and widely distributed species (Label: low (L), high (H); "Occupancy" in Table 1). We set the regression coefficients associated with correct automated identifications (at least one call recorded and correctly classified to the focal species during a visit ( 0,s 1 , 1,s 1 )) to represent species that were hard, average, or easy to detect (Label: low (L), medium (M), high (H), "Baseline detect" in Table 1).
Similarly, we set regression coefficients associated with automated misidentifications ( 0,s 0 , 1,s 0 ) to represent species that were more easily or less easily confused with other species by the classification software (i.e., harder to misidentify or easier to misidentify; labeled low (L) or high (H) for "Baseline misID" in Table 1, respectively).
For each parameter combination, we generated 500 realizations of data (datasets) assuming the same sampling design as that used in our empirical data (n = 84 sites, and J i = 4 visits for all i), and assuming unambiguous visit-level observations arose from the datagenerating process described by the extended OC model. That is, we assumed a confirmation design where all visits within all sites were confirmed resulting in unambiguous data (ν-values) for all visits. We first generated true occupancy states (Z i ) from a Bernoulli(ψ) distribution. Then, we obtained realistic covariate values for each visit by sampling with replacement from K-values in the empirical data. Using Z i and the K ij -values, we produced visit-level detection probabilities (4) logit(s l ij ) = 0,s l + 1,s l ( log (K ij + 1)); l = 0,1.

Label
Occupancy ( (s 0 ij ∕s 1 ij ) following Equation 4 and generated the unambiguous dataset by taking random draws from the appropriate multinomial distribution for each site (ν ij |Z i = 1 or ν ij |Z i = 0, Equations 1 and 2). Using the unambiguous dataset, we obtained a dataset for the REMOVE approach (y confirmed -values) and a dataset for the IGNORE approach (y ambig -values) according to the relationships conveyed in Figure 2 (e.g., 11 = 1 ⇒ y ambig 11 = 1, y confirmed 11 = 0 for site 1 and visit 1).
We then investigate reduced confirmation efforts for the MODEL

| S IMUL ATI ON S TUDY RE SULTS
For brevity and clarity in Sections 3.1-3.3, we focus on results from the five parameter combinations in Table 1 and a subset of confirmation designs. General patterns for other parameter combinations and confirmation designs were similar, and results are included in Appendix S2.

| Directly comparing IGNORE, REMOVE, and MODEL
The average 95% CIs for occupancy probability were nearly iden-  Figure 3). The IGNORE approach (middle CIs, For all three approaches, precision, coverage, and bias of the occupancy estimator varied based on the species-specific characteristics assumed during data generation. In particular, F I G U R E 4 Average approximate 95% confidence intervals for each OC model parameter (column), computed from 500 simulated datasets generated assuming five parameter combinations (rows) for the OC model, assuming five confirmation designs (denoted C p,d on the y-axis, where p indicates the proportion of confirmed sites and d indicates the number of confirmed visits within those sites). Threeletter row-labels indicate assumed occupancy (L = narrowly distributed, H = widely distributed), baseline detection (L = hard to detect, M = average, H = easy to detect), and baseline misidentification (L = hard to misidentify, H = easy to misidentify). Assumed parameter combinations are shown with large black vertical tick marks along with corresponding average estimates in colored tick marks. Coverage is indicated by color, note that the scale for coverage ranges from 0.9 (brown) to 1 (green), rather than 0 to 1 Figure created using  Coverage precision was sensitive to detectability. For example, we observed wider confidence intervals for occupancy in widely distributed species that were also assumed to be hard to detect (HLL in Figure 3) than we did when their detection was assumed to be average or easy (HMH and HHL in Figure 3; see also,

| Comparing confirmation design scenarios within the MODEL approach
Fitting the OC model to all unambiguous data (OC [1,4] in Figure 4) provides a reference case producing similar results to the verification intensive REMOVE approach. Here, we compare different confirmation designs to assess the OC model's potential to reduce verification effort. We observed that for a fixed number of confirmed visits (e.g., compare CIs for OC [1,2] , OC Therefore, we found the properties of the extended OC model estimator depended on the confirmation design and the species-specific characteristics assumed during data generation. For harder-to-detect species, we observed less than nominal coverage probabilities (<0.95) for all parameters when only two visits were confirmed within a quarter of the sites. For narrowly distributed species, we observed estimator bias for the partial regression coefficients associated with true detection. The widely distributed species, on the other hand, showed bias for partial regression coefficients associated with misidentification. This bias was most pronounced for confirmation designs with half or a quarter of the sites confirmed (e.g., rows HLL, LLL, LLH, and top three CIs). Similarly, for a given occupancy and misidentification rate, we observed wider average CIs for occupancy with harder-todetect species and less confirmed data (HHL vs. HLL, all CIs; Figure 4).
Interestingly, however, the baseline misidentification rate (when moving from L to H, or 0.05 to 0.10 on the probability scale) did not largely affect the confirmation design required to produce an unbiased and precise extended OC model estimator (LLL vs. LLH in Figure 4).   Figure S2.3). This pattern suggests there is a minimum confirmation effort required for fitting the extended OC model and the minimum will vary based on the assumed species characteristics.

| D ISCUSS I ON
We outline an occupancy modeling framework that accounts for misidentification errors and simultaneously reduces the manual verification burden required for defensible inferences to inform conservation and management. This framework also allows for variability in detection probabilities related to field deployment conditions and the classification software used for automated identification; making it a flexible approach for geographically extensive surveys. Standard occupancy models require all visitlevel misidentification errors to be eliminated from a dataset (i.e., REMOVE approach). In comparison, our MODEL approach accounts for possible misidentification errors while still allowing for some ambiguous detections to be included in the analysis. In this way, the extended OC model inherently provides increased efficiency of ARU-based surveys by maintaining tolerable levels of uncertainty with reduced confirmation effort. Generally, we found confirming more sites resulted in more precise extended OC model estimators compared with confirming more visits within sites, but the "optimal" confirmation design was dependent on assumed species characteristics (detectability and occupancy).
We suggest diagramming the workflow (Section 2.2) as a first step in the survey design process because it highlights key deci- However, more methodology that directly uses observation-level information and also accounts for the idiosyncrasies (e.g., volume, data processing pipeline, ad correlation among recorded calls) of acoustic data could be the focus of future work.
Consistent with other work (e.g., Clement et al., 2014;Newson et al., 2017), we found evidence of species misidentification errors from classification software and, if ignored, occupancy was severely overestimated, supporting the need for verification. We assumed the Manual IDs were consistent and true, but if acoustic data are verified by more than one expert, this assumption becomes more tenuous, such that standardizing the workflow is key.
Currently, human verification provides the most reliable source of unambiguous detections for bat acoustic data, but this is subject to change. If call libraries for classification software used to obtain automatic species identifications improve and become more representative of conditions observed in the field, the calibration model (see Chambert et al., 2015;Ruiz-Gutierrez et al., 2016) has the potential to eliminate verification from the acoustic data workflow entirely, effectively removing all costs associated with a manual confirmation design. As discussed in Russo and Voigt (2016), quality calibration information is not currently found among the published literature. Until then, we advocate using the extended OC model to reduce costs associated with the verification process for bat acoustic data, when appropriate.
The OC model rests on the assumption that verification is consistent and true. This appears to be a reasonable assumption for most bats species that occur within a given faunal assemblage (see Fritsch & Bruckner, 2014;Russo & Voigt, 2016;Rydell et al., 2017), but becomes tenuous for rare species, species that are difficult to manually verify, or when multiple verifiers with different levels of experience are working on the same region of interest (Fritsch & Bruckner, 2014;Rydell et al., 2017). For rare bat species or less experienced verifiers, an entirely different survey design and analysis strategy should be pursued (e.g., using captured bats as in Clement et al., 2014) and the assumptions of that approach must also be carefully assessed.
Diagramming the workflow associated with wildlife acoustic data can help facilitate coherency among acoustic call processing and species identification decisions and subsequent statistical analysis and interpretations. In addition to providing functions like those available in unmarked (e.g., occu) for fitting the extended OC model, our r package (OCacoustic) provides the capacity for researchers to conduct their own investigations into design requirements (e.g., sample size and a desired level of uncertainty) prior to collecting data. OCacoustic facilitates exploring trade-offs in extended OC model estimator precision and bias related to number of sites, number of visits, covariate structures at the site level and visit level, assumed data-generating parameter values (ideally coming from estimates from pilot data), and confirmation designs.
Importantly, OCacoustic provides a design tool to increase efficiency of future animal surveys that rely on ARUs to collect detection/non-detection data for estimating spatially explicit occurrence probabilities (species distribution maps) to inform conservation and management.

ACK N OWLED G M ENTS
Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the US Government. R. Rodriguez wrote critical sections of Supporting S1 and manually reviewed the bat acoustic data. K. Banner created the R package and vignette. K. Banner led the writing of the manuscript with significant support from K. Irvine, A. Litt, and T. Rodhouse. All authors contributed critically to the drafts and gave final approval for publication.

DATA ACCE SS I B I LIT Y
Our data are included in our R package which is archived in Katharine