Spherical or spheroidal particle shape models are commonly used to calculate numerically the radar backscattering properties of aggregate snowflakes. A more complicated and computationally intensive approach is to use detailed models of snowflake structure together with numerical scattering models that can operate on arbitrary particle shapes. Recent studies have shown that there can be significant differences between the results of these approaches. In this paper, an analytical model, based on the Rayleigh-Gans scattering theory, is formulated to explain this discrepancy in terms of the effect of discrete ice crystals that constitute the snowflake. The ice crystals cause small-scale inhomogeneities whose effects can be understood through the density autocorrelation function of the particle mass, which the Rayleigh-Gans theory connects to the function that gives the radar reflectivity as a function of frequency. The derived model is a weighted sum of two Gaussian functions. A term that corresponds to the average shape of the particle, similar to that given by the spheroidal shape model, dominates at low frequencies. At high frequencies, that term vanishes and is gradually replaced by the effect of the ice crystal monomers. The autocorrelation-based description of snowflake microstructure appears to be sufficient for multi-frequency radar studies. The link between multi-frequency radar observations and the particle microstructure can thus be used to infer particle properties from the observations.
If you can't find a tool you're looking for, please click the link at the top of the page to "Go to old article view". Alternatively, view our Knowledge Base articles for additional help. Your feedback is important to us, so please let us know if you have comments or ideas for improvement.
 Precipitation remote sensing instruments in the microwave regime are commonly used to observe falling snow. Recently, the motivation to understand the scattering properties of snowflakes has been driven by the growing interest in using millimeter-wave radars to measure snowfall remotely. Such radars are used, in particular, for global observations by satellites such as the currently orbiting CloudSat, operated by the National Aeronautics and Space Administration (NASA), Global Precipitation Measurement (GPM) under development by NASA and the Japan Aerospace Exploration Agency (JAXA), and EarthCARE, which is being developed by the European Space Agency (ESA) and JAXA.
 A number of recent studies using accurate computational scattering models and detailed shapes [Ishimoto, 2008; Botta et al., 2010; Petty and Huang, 2010; Tyynelä et al., 2011] have indicated that at sizes that are large compared to the wavelength, the use of homogeneous spherical or spheroidal shape models leads to a misestimation of the backscattering cross section of snowflakes. On the other hand, at the small-particle limit, such models are valid (except for the effect of particle non-sphericity, especially with regard to polarimetric variables) under the Rayleigh scattering law, and their applicability has also been successfully extended further using other theoretical methods and found valid in practice [e.g., Leinonen et al., 2011; Hogan et al., 2012]. Leinonen et al.  found that the spheroid-snowflake model is sometimes, though not always, incompatible with experimental results. In the worst case, the error in the backscattering cross section can be as large as orders of magnitude.
 The cross section that is obtained by using the spheroidal shape model depends on the selection of the effective size of the equivalent spheroid. There are a number of different ways to determine the effective size, including the maximum diameter, the equivalent-volume diameter, and the radius of gyration [see, e.g., Hogan et al., 2000; Donovan et al., 2004; Westbrook et al., 2006]. In practice, the maximum diameter has been commonly used because it is readily measurable. However, the results by Petty and Huang , Kneifel et al. , and Leinonen et al.  indicate that an effective radius cannot be chosen consistently with any method if more than two wavelengths are used.
 Because of the complexity of the shapes of snowflakes, parametrization of their scattering properties in terms of their physical properties is needed in order to interpret remote sensing measurements. The number of particles contained in the typical volume of interest (a single radar bin) is large, and thus, it is often sufficient to consider the average scattering properties. A very large number of parameters is usually required for a complete description of an aggregate particle, but it may be possible to simplify the particle model greatly as it is enough to consider only the average backscattering.
 One hypothesis that has been made in various forms [e.g., Fabry and Szyrmer, 1999] is to make spheroidal shape models more realistic by accounting for the decreasing average density of the snowflake as a function of the distance from its center. This feature is not captured by the fully homogeneous sphere and spheroid models. We show in this paper that this is not sufficient at large wavelength-size ratios; instead, the small-scale inhomogeneities of the internal structure of the snowflakes must be considered in order to reproduce the backscattering cross section. The structure can be understood in terms of the density autocorrelation function. Based on the Rayleigh-Gans approximation, we demonstrate a simple parametrization of this function in terms of the particle structure and provide theoretical justification for its form.
2 Theory of Scattering by Aggregates
2.1 Rayleigh-Gans Theory
 The Rayleigh-Gans approximation (RGA) can be used to compute the scattering properties of a particle in the absence of interactions between parts of the particle. Following the notation of Bohren and Huffman , the RGA gives the backscattering cross section (also called the radar cross section) of a particle as
The notation is as follows: k = 2π ∕ λ is the wave number, λ is the wavelength, the dielectric factor K = (n2 − 1) ∕ (n2 + 2) where N is the complex refractive index of the material, m is the mass of the scatterer, ρ is the density of the material, and f is the Rayleigh-Gans form factor. The latter, a complex number, can be defined as a volume integral
where is the imaginary unit, R is a vector from a reference point to the given point (we use the center of mass as the reference point, but this is not mandatory), is a unit vector in the incident beam direction (defined here as being along the z-axis), and is a unit vector to the scattering direction. We define the probability density function (PDF) of the mass distribution of a particle, p(R), such that
where V is the total volume of the particle; this definition satisfies the normalization property
 The form factor contains information about the shape and the orientation of the particle, acting as a modification to the Rayleigh scattering law; at the small-particle limit, f → 1 and the RGA converges to the Rayleigh approximation for the equi-voluminal sphere. The form factor and, consequently, the RGA are intuitively understood as the integration of the independently scattered waves from all parts of the scatterer, neglecting any electromagnetic interaction between its parts.
 In the backscattering direction, . Then the form factor is given by the Fourier transform of the mass distribution in the z direction, p(z):
where we used the common notation for the Fourier transform of a function f (z), which is defined in this paper as
This elegant feature of the RGA has been pointed out and analyzed by, e.g., Sorensen ; that author also presents the generalization to other scattering directions that is omitted here for the sake of brevity.
Tyynelä et al.  showed that the accuracy of the backscattering cross section given by the RGA is almost always within 6dB, usually within roughly 2dB, of the corresponding results of volume integral calculations performed with the discrete dipole approximation (DDA) method (see section 3.5 for a confirmation of these assumptions for the snowflakes used in this paper). Although these differences are not negligible and larger errors can possibly be found in cases that were not considered by Tyynelä et al. , the RGA can be expected to reproduce the important features of snowflake radar backscattering, the effects of the interactions being minor corrections. This is particularly true if only qualitative analysis is required. The purpose of this paper is to understand the mechanisms and conditions that lead to the applicability (or lack thereof) of average-particle models, so the RGA is highly suitable for our purposes.
 Another convenient feature of the formalism given by (1) and (5) is that it is very general and independent of assumptions about the particle shape. The particle model can be a set of cubes in a regular lattice (such as with DDA calculations), a cluster of analytically defined shapes (as with cluster Mie calculations), a continuous and possibly inhomogeneous mass distribution, or even a fractal. For these, and indeed for practically any realistic ways to describe the particle, the distribution function of the particle mass is square integrable, and thus, f is well defined.
 We remark that there is some ambiguity in terminology between different authors using the RGA and related methods. We use f to denote the form factor, as defined in (2) and (5) according to Bohren and Huffman , consistently throughout this paper. However, we also adopt the “structure factor” S = | f |2, which is used by some authors especially in statistical physics, as it turns out to be a convenient shorthand. The structure factor is closely related to the concept of power spectral density in signal processing. Westbrook et al.  used f and the term “form factor” to denote the same quantity as our S; this difference in notation should be noted while comparing these studies. Due to the generality of the method, the name of the RGA itself also varies by author and context; see, e.g., Bohren and Huffman  for an overview.
2.2 Average-Particle and Second-Order Models
 As mentioned before, the number of scatterers measured simultaneously with a radar is very large, and thus, we recognize that the law of large numbers permits us to only consider ⟨σb⟩, the expected value of the backscattering cross section. Considering particles of a fixed mass and material, it can be seen from (1) that to obtain ⟨σb⟩, we need to know ⟨S⟩ = ⟨ | f | 2⟩, which is defined as
The quantity on the right-hand side of this equation is the average power spectral density of the mass distribution. As it is well known especially in signal processing literature, the power spectral density is simply the Fourier transform of the autocorrelation function [Mitra, 2002]. As the Fourier transform is a linear operator on functions, the expected value can be moved inside the transform, and thus, ⟨S⟩as a function of k is simply related to the Fourier transform of the average density autocorrelation function ⟨R(z)⟩:
where R(z) of a particle is defined in terms of its mass distribution in the beam direction, p(z), as
The connection between the form factor and the density autocorrelation function has been given previously in an aggregate scattering context by, for example, Sorensen .
 For an ensemble of aggregates of various sizes and masses, an examination of (1) shows (as noted by, e.g., Westbrook et al. ) that the above result can be generalized as follows: the deviation of the effective backscattering cross section from the Rayleigh assumption as a function of the wave number k can be determined from the m2-weighted average of the S(k) functions of the individual particles. We use the shorthand throughout this paper to refer to this function of the particle ensemble. The inverse Fourier transform of this gives the m2-weighted average of the density autocorrelation functions, . This means that with knowledge of the average structure factor over the whole spectrum, it is possible to reconstruct the average density autocorrelation function perfectly. Furthermore, the radar reflectivity is proportional to σb ∕ k4 and therefore (except for small variations in refractive index as a function of frequency) also to ; this means that gives the change of reflectivity with frequency relative to the low-frequency limit.
 Although radar measurements cannot determine over the full frequency spectrum, multi-frequency radars can still be used to measure it at discrete points. Interpolating and extrapolating with the help of a proper model, it is possible to estimate ; in fact, this function is the only physical property of the snowflake ensemble that can be directly retrieved using multi-frequency methods, with the assumption that the RGA is valid. Thus, we argue that physical and scattering properties of snowflakes should be related to each other through understanding how the physical properties affect the autocorrelation . In the rest of this paper, we examine how the density autocorrelation function behaves in realistic snowflakes and how it can be inferred from measurements.
3 Model Experiments
3.1 Aggregate Snowflakes
 For analysis of snowflakes, model particles were generated with an iterative algorithm based closely on that by Westbrook et al.  and Westbrook . The algorithm begins with an initial population of N randomly oriented monomers built from small volume elements. Since snowflakes aggregate primarily because of differences in their vertical velocities, the collision probabilities of particle pairs are computed on each iteration from the aerodynamic cross section and the mass of the particles, and according to these probabilities, a pair to collide is randomly selected. The larger particle in the pair impacts the smaller one directly from above at a random position on the horizontal plane; the particles stick together rigidly at first impact (the collisions are not perfectly efficient in nature [Pruppacher and Klett, 1997], but this can be neglected as the purpose of the algorithm is merely to build an aggregate from a given set of monomers). The resulting particle is oriented randomly and returned into the population; the process is repeated until only one particle remains.
 The monomers used in this study were dendrite ice crystals generated using the algorithm of Reiter . Their thickness was determined using an empirical relation given by Pruppacher and Klett .
 For testing under different conditions, we generated five data sets consisting of aggregates generated using varying sizes and numbers of crystal monomers. The first three data sets consist of aggregates made up of monodisperse crystal monomers, each 0.5mm in diameter. Although the crystals in natural snowflakes are not monodisperse, these data sets were included in order to carry out a simplified analysis of the microstructure, so that the essential features of the structural model can be demonstrated without the complicating factor of variable monomer size. The other data sets consist of aggregates where the monomer maximum diameter was sampled from an exponential distribution with a rate parameter Λ = 1 mm − 1, truncated at small and large sizes such that the maximum diameter of the monomers is in the range 0.2 mm < Dmax < 3 mm. In one of the polydisperse data sets (PMIX), a mixture of differently sized aggregates was used. The properties of the data sets, along with the abbreviations used for them later in this section, are summarized in Table 1. The number of aggregates per data set varies because for some data sets, a larger number of aggregates was generated to test the convergence properties better.
Table 1. An Overview of the Data Sets Used in Section 3. The Rightmost Columns Show the Average Maximum Dimension Along the z Axis ⟨Dmax⟩ and the Average Mass ⟨m⟩
3.2 Average Mass Distribution
 The physical properties of a single snowflake are shown in Figure 1. It can be seen from Figure 1b that the single-particle mass distribution has no simple form. For better insights about the distribution of the typical positions of the monomers in the aggregates, we can instead examine the average mass distribution of a number of particles. In Figure 2, we show the probability distribution function (PDF) of the mass distribution (around the center of mass) of the data set M50. It is clear that the average mass PDF corresponds with a high accuracy to the normal (Gaussian) distribution, even though the mass distribution for the individual particles can be very different from it.
 Aggregate formation is a complex random process. Thus, the normal mass distribution is perhaps not surprising considering that according to the central limit theorem of probability theory [Jaynes, 2003], the PDF of the sum of random variables tends toward normal as the number of random variables increases. We posit, consistently with this result, that at the limit of a large number of aggregates, the average three-dimensional distribution of the positions of the monomers within an aggregate follows a three-variable normal distribution. The marginal distributions of a multivariate normal distribution are also normal, reproducing the above result.
 In Figure 2, we also show a mass distribution that corresponds to a homogeneous sphere with the same standard deviation as the normal distribution. It is a reasonably good approximation of the correct distribution, which shows that one way to justify the spheres or spheroids as models of aggregate particles is to consider them as models of the average mass distribution. For this reason, we shall refer to the Gaussian, spherical, and spheroidal shape models together as “average-particle models”.
3.3 Modeling the Density Autocorrelation Function
 In Figure 3a, we show the average density autocorrelation function for the model snowflakes of data set M50, normalized such that . Figure 3b depicts the corresponding function, derived by taking the discrete Fourier transform (DFT) and given as a function of frequency ν for easier interpretation (ν = ck ∕ 2π where c is the speed of light). The functions determined computationally from the data are shown as the thick gray line. In Figure 3b, we also show, for various commonly used frequencies, dots corresponding to the m2-weighted average of S as calculated directly using (2). This verifies that the results of the indirect determination of S (by computing the autocorrelation function and taking the DFT) match those of the direct computation.
 At short distances (≲1.0 mm), the autocorrelation function seems to fall off linearly as a function of distance. At longer distances, the slope of the function becomes less steep, gradually approaching zero and reaching it at the maximum snowflake diameter of M50, 4.7mm. Closer examination reveals that at distances shorter than approximately 0.4mm, the shape deviates slightly from linear, giving a narrower peak than a purely linear relationship would. The main features of are a Gaussian shape at frequencies up to ν ~ 65 GHz (note the logarithmic scale in Figure 3b) and a gradual widening beyond that. In the following analysis, we construct a model that explains the features of and, consequently, of .
 As a starting point, we use the assumption that the density autocorrelation function is a zero-centered Gaussian, as it would be if the mass distribution was normal for each individual aggregate. This function is given by
where σm is the standard deviation of the monomer mass distribution and σa is the standard deviation of the monomer positions (see appendix A). This function is plotted as a dashed line in Figure 3. However, the Fourier transform of a Gaussian function is also a Gaussian [Jaynes, 2003], which corresponds to a parabola in Figure 3b. The autocorrelation function computed from the model aggregates is clearly not parabolic, and it can be seen that the Gaussian of Figure 3b falls off rapidly after approximately 90GHz and quickly starts to deviate drastically from the computed value. Additionally, this assumption does not reproduce the shape of the function at short range: a Gaussian function would be wider around z = 0 than the derived from synthetic data in Figure 3a.
 The discrepancy at high frequencies can be resolved by taking the effect of the monomers into account. As a simplistic model, one can consider an aggregate as a collection of Ne monomers whose mass distribution is normal with a standard deviation σm and whose positions within the aggregate are independent and identically distributed, also according to the normal distribution. Then the expected R(z) is as a weighted sum of two Gaussian functions (see appendix B). Ne should be interpreted as an “effective” number of particles and is not equal to N, as the positions of the monomers are not truly independent as we assume in the derivation: nearly equally validly, the aggregate could be considered to consist of N ∕ 2 two-monomer clusters, N ∕ 3 three-monomer clusters, and so on. For the same reason, the size of the monomers needs to be corrected with a factor b to give an effective monomer size. Our model autocorrelation function is then
This model modifies the average-particle model of (10) with a correction that is relevant at high frequencies; at low frequencies, the average-particle term dominates. The correction term corresponds to the effect of the inhomogeneities that the monomers introduce.
 The linear behavior at z = 0 can be reproduced by using a truncated Gaussian. We approximate the average autocorrelation corresponding to a truncated normal mass distribution as G(z), a Gaussian multiplied by a triangular function:
where σ is the standard deviation of the mass distribution, a is a correction factor that is included to compensate for effects introduced by, e.g., truncation of the Gaussian, D is the maximum dimension of the snowflake in the z direction, and c is a normalization constant that ensures that the function is a PDF. The variables σ and D are both measures of the size of the particle. By analogy to the pure Gaussian model, the following autocorrelation function can be justified:
 Simply choosing representative σ and D is not sufficient to reproduce the shape at longer distances: for instance, using the average ⟨D⟩ = 2.5 mm of the particle ensemble would force G(z) = 0 at distances longer than that; this is obviously not correct. Although the particles were constructed with identical parameters of the aggregation code, the size of the particles is highly variable, D ranging from 1.5mm to 4.7mm. The length measures are also highly correlated: the correlation coefficient of σ and D was ρ = 0.90 with D = 3.3σ on average. In order to account for the joint distribution properly, we computed this function for each σ,D pair and took an m2-weighted average of the functions (this roughly amounts to an integration over a size distribution). The parameters b and Ne were determined by nonlinear least-squares fitting such that the model matched as well as possible. The correction factor a is another free variable, but in practice, we found a = 2.0 to work very well for every tested case. There may be an underlying theoretical reason for this, but we did not investigate further in the current study. The parameters σm and Dm were sampled from the monomers using the same method. For the model of (11), the same approach was used, except for ignoring the maximum dimensions D and Dm.
 Using the Gaussian mixture model of (11) resulted in least-squares estimates Ne = 49 and b = 1.12 for the snowflakes of M50; this function is shown as the dash-dot line. The Gaussian-triangular mixture model (13) resulted in Ne = 142 and b = 1.00; the corresponding estimate is shown as the thin solid line. Both functions reproduce and quite accurately.
 The same models were applied on the data set M300 as shown in Figure 4. For this data set, we obtained Ne = 40 and b = 3.88 for the Gaussian mixture model (11) and Ne = 235 and b = 3.20 for the triangular-Gaussian mixture (13). For P100 (Figure 5), the convergence of the functions to the averages was not as good and thus there is some noise left in the structure factor. We achieved a good fit to data with Ne = 50, b = 3.66 for (11) and Ne = 42, b = 5.89 for (13).
 To simulate what might happen in the usual situation where particles of various sizes are contained in a radar volume, we applied the model to PMIX (Figure 6). The same model also seems to be applicable in this case, with Ne = 45, b = 6.68 for (11) and Ne = 92, b = 7.57 for (13). It is encouraging that there appears to be no need to increase the number of free parameters when differently sized aggregates are present.
 Finally, to demonstrate a case where the average-particle models are approximately applicable for all commonly used radar frequencies, we used the same approach for the data set M5 (Figure 9). In this case, the estimated parameters were Ne = 2, b = 2.48 for (11) and Ne = 3, b = 1.50 for (13).
 In practice, we found b and Ne to be difficult to decouple. Similar results were obtained with considerable increases of b if Ne was decreased to compensate, or vice versa. This is perhaps not surprising considering the ambiguity of interpreting these parameters and the ad hoc nature of their introduction. Less intuitively, values of Ne larger than N were found in a few cases. It is not clear at present if there is a physical reason for this or if it is an artifact of the fitting procedure.
3.4 Implications for Multi-Frequency Radar Measurements
 Determining the physical properties of snowflakes from radar measurements is an important practical measurement problem. Multi-frequency measurements can measure at discrete points, from which the whole function can be interpolated, preferably with the aid of a suitable model. Unfortunately, the shape of the function is only partially predictable and can exhibit variation outside the range of measurements. The highest frequencies that would be required for full reconstruction would often be higher than the current frequencies used in cloud radars, around 95GHz at maximum, although the frequency range needed in practice would depend on the size of the snowflakes and the monomers. Attenuation constraints limit the usability of radars heavily at higher frequencies.
 From the number of free parameters of the function, one can estimate the minimum number of collocated radar measurements required to reconstruct the function. Because different masses and sizes are present in a radar volume, the size or mass distribution requires 2–3 parameters, including the number concentration. Additionally, a mass-dimensional relationship of (typically) two parameters is needed. Those quantities are also needed separately for the effective monomers in order to model their effects properly. Then apparently the number of free parameters can be quite large, perhaps 8–10 depending on how many simplifications can be made. This is more than is practically ever simultaneously available from real measurements (although it may be possible to obtain more information from polarimetric and Doppler data), but since the functions in Figures 3– 6 do not seem to be particularly complex in shape, we consider it unlikely that the number of parameters required to represent them would be that high. It is not evident how this function should be constructed using fewer parameters; we think that this problem should be treated thoroughly in a later work. As an observation from our current results, however, we note that at low frequencies, the form of follows a Gaussian function even for ensembles of differently sized aggregates. This allows a rough estimation of the representative particle size from low-frequency measurements, as demonstrated in Figure 7. This finding is also consistent with Guinier's law, which states that at low frequencies, for a constant c, | f (k) | 2 ~ 1 − ck2: these are the first two terms of the series expansion of a Gaussian function.
3.5 Comparison to Exact Models
 The performance of RGA calculations on snowflakes was examined in detail by Tyynelä et al. . The snowflake models used in that study were, however, slightly different from ours, and thus, we made our own comparisons to ensure that the RGA is also valid for computing the backscattering from our snowflakes. As a reference, we used DDA calculations performed using the ADDA software [Yurkin and Hoekstra, 2011], as well as various models that are exact for analytical shapes.
 First, we tested the RGA against a sphere model, where each sphere was assigned the same mass and radius of gyration as the corresponding snowflake. The backscattering cross section was computed using Mie scattering, which is exact for spheres. The electromagnetic properties of the snowflake were modeled using the Maxwell-Garnett effective medium approximation (EMA) computed from the snow density, using ice as the inclusions and air as the matrix (which is probably justified at low ice densities, characteristic of snowflakes). Second, due to the popularity of layered spheres as snowflake models [e.g., Fabry and Szyrmer, 1999; Szyrmer et al., 2012], we used a two-layered sphere, again conserving the mass and the radius of gyration and optimizing the remaining parameters such that the density reproduces that of the corresponding Gaussian model as accurately as possible. For a Gaussian particle with a radius of gyration Rg, this yields the radius of the inner layer as a = 1.08Rg, the radius of the outer layer as b = 1.36Rg, and a density ratio of 4.91 between the inner and outer layers. The layered model corresponded to Model 4 of Fabry and Szyrmer ; the inner layer density was low enough to require the use of the same EMA (Maxwell-Garnett with ice as inclusions) for both layers. Third, because the Gaussian model was earlier found to be the most realistic average-particle model, we also compared to that, but because the inhomogeneity of the Gaussian model makes it intractable for Mie solvers, we used the RGA solution for Gaussian models instead.
 In Figure 8, we show the backscattering cross section as a function of frequency for the data sets M50 (Figure 8a) and P100 (Figure 8b), calculated using the above-mentioned models. For Figure 8b, only a 20-aggregate subset of P100 was used due to the high computational requirements of the DDA with such large targets. The agreement of the DDA and the RGA is quite good, especially when compared to their difference to the average-particle models. There is no significant difference in the performance of the single- and dual-layered sphere models. On the other hand, the backscattering cross section from the Gaussian model vanishes rapidly to extremely small values. At high frequencies, the difference of the average-particle backscattering cross sections and those from the DDA is much larger than the difference of the DDA and the RGA. From this, we can conclude that the use of the RGA is valid for the purposes of this paper and that the difference of average-particle and detailed snowflake models is primarily due to the difference in the complexity of the shapes, with electromagnetic interactions (which are neglected by the RGA but modeled by the DDA and Mie methods) playing a comparatively minor role.
 The description of scattering from aggregate snowflakes presented in this paper provides a physical basis for why, and when, the average-particle models of snowflakes are able (or unable) to describe backscattering from snowflakes. If the average-particle model is used, it is assumed that the mass density functions of similar-sized particles average out. We have demonstrated why it is the density autocorrelation function, and not the mass density, that is averaged when a large number of particles are considered together. The difference of the two approaches can be quite dramatic, especially when the structure factor is close to zero on average, but exhibits variability between individual particles.
 It can be summarized from the detailed treatment of the above sections that the following approach can be used to calculate the average backscattering cross section of an ensemble of simultaneously measured snowflakes:
 Determine density autocorrelation functions R(z) of each snowflake using (9).
 Calculate the mass-squared weighted average of the autocorrelation functions, .
 Determine the mass-squared weighted average structure factor as a function of the wave number k by taking the Fourier transform of according to (8).
 Calculate the average backscattering cross section as a function of the wave number as
 We formulated a particle model based on the Gaussian function, which, with its tailed structure, was found in section 3.2 to be a more realistic model of the average mass distribution than a spherical model. The Gaussian function also has a number of extremely convenient properties, among others the central limit theorem, being its own Fourier transform, and the convolution of two Gaussian models producing another Gaussian. These properties not only support the model theoretically but also allow the derivation of analytical results that facilitate the understanding of the observed features of the density autocorrelation function and the structure factor. However, the same general reasoning about the difference of autocorrelation-based and average-particle models can be expected to apply to the spherical and spheroidal particle models even if the details are different. The departure from the average-particle models is due to small-scale inhomogeneities (the monomer crystals) that are not accounted for in the Gaussian or spherical descriptions. Although the decrease in the structure factor with frequency exhibited by the spherical models is not as drastic as that of the Gaussian model, and thus, the spheres seemingly perform better at high frequencies (see Figure 8); this is attributable mainly to a mathematical coincidence rather than better modeling of the particle microphysics by the spherical models.
 Other authors [e.g., Schmitt and Heymsfield, 2010] have treated the inhomogeneities in aggregate snowflakes by using fractal geometry. The connection between the fractal dimension and the density autocorrelation function has been given by, for example, Sorensen . We have not used the fractal description explicitly in this paper for a number of reasons. Most importantly, the results obtained by constructing the density autocorrelation function statistically are more quantitative and yield more insight into the role of monomers as the inhomogeneities that cause the failure of the average-particle model at high frequencies. Additionally, describing the particles with a single fractal dimension ignores the fact that because of the different growth processes of the aggregates and the monomers, different length scales may exhibit different fractal behavior: at high frequencies, it is the fractal dimension of the monomers that is relevant (and different high-frequency behavior would be expected for monomers with a different fractal dimension). Furthermore, when using the fractal description for particles of limited size, the variation in average-particle density with distance from the center must be accounted for by using a cut-off function [Sorensen, 2001]. In the description presented here, these limited-size effects arise naturally.
 Our approach can also be compared with that of Westbrook et al. , who constructed a fit between two known extremal behaviors of backscattering. Their model matched the directly calculated values of the structure factor well, but the parameters of their polynomial fit lack physical context, which was sought with the model presented here; even though empirical parameters are also found in our model, they are formulated as adjustments to a physically based model instead. It is interesting to note that an approximately Gaussian shape of is apparent in their Figure 5.
 The derivation of the density autocorrelation function shown in appendix B shows that the effects of monomers emerge as a correction to the average-particle model. This correction is relevant at short distances and, correspondingly, at high frequencies. This also shows that the average-particle models can be expected to be accurate at low frequencies, whereas they are likely to fail at high frequencies where the monomers become relevant. As the Gaussian function is a better average-particle description than the sphere, but diverges extremely rapidly above some given frequency, we suggest that it could, in some cases, be used as an indicator for the applicability of average-particle models: if the values given by the Gaussian model are implausibly small at a given frequency, the spherical model can also be expected to be inaccurate at that frequency.
 The same derivation also demonstrates a physical explanation as to why dual-frequency measurements can retrieve the size of aggregate snowflakes: the shape of the structure factor as a function of frequency is dependent only on the size of the aggregates at low frequencies; the monomers are only relevant at higher frequencies. The particle size is determined from the m2-weighted average density autocorrelation function ; this results in a similar “representative size” measure as the mass-weighted mean diameter (although it is not obvious how exactly these two measures are related to each other). On the other hand, the mass-dimensional (m-D) relationship only affects by determining the weighting of differently sized snowflakes from the ensemble. This suggests that the effect of the m-D relationship on is difficult to discern, and thus, the coefficients of the m-D relationship may not be directly retrievable from multi-frequency measurements.
 It is noteworthy that the large dynamic range of the average structure factor means that the effects of the monomers are essential at high frequencies, even though they are almost unnoticeable in . This unfortunately means that would have to be determined with extremely high precision in order to reconstruct properly, requiring measurements of the mass PDF at a spatial resolution on the order of the spatial scale of the monomer microstructure. Conversely, there is a positive implication for multi-frequency measurements: , and thus, the retrieved particle size is not very sensitive to errors in the estimate of , as could be seen in Figure 7.
 Since the sizes of the snowflakes we used were fairly realistic, we are able to make some predictions regarding the applicability of average-particle models. In all cases that we tested, the C- and Ku-band measurements are in the aggregate regime, which corresponds to the average-particle description. For cases other than that of Figure 9, the W-band measurements, and with larger particles also the Ka-band measurements, were in the transition or monomer regimes. While the commonly used spherical and spheroidal models may not give correct results when measuring large snowflakes with millimeter-wavelength radars (Ka band and higher frequencies), they are applicable at lower frequencies. It should also be emphasized that our usage of the term “lower frequencies” extends well beyond the Rayleigh regime.
 In this paper, we have constructed a model that explains the main features of the radar backscattering properties of aggregate snowflakes. Our model is based on treating the aggregates as collections of monomers that are distributed within the aggregates according to the normal distribution, which we found to be realistic in the average case. The model is based on the Rayleigh-Gans approximation that, while not exactly correct, has been found to be quite accurate for snowflakes and allows for much more straightforward mathematical analysis of the connection between microphysical and backscattering properties of the aggregates, facilitating the understanding of the scattering properties at various frequencies.
 Most importantly, the new description addresses the question: when and why are spherical or spheroidal snowflake shape models applicable for calculating the radar backscattering properties of aggregate snowflakes? We approached the question by comparing the density autocorrelation functions resulting from these particle models with those computed for realistic aggregates. Under the Rayleigh-Gans approximation, it can be shown that the backscattering intensity of multiple simultaneously observed scatterers is equal to that of a single particle whose density autocorrelation function is the mass-squared weighted average of the density autocorrelation functions of the individual particles. Therefore, it is justified to define the “effective” scatterer in terms of the autocorrelation.
 Spheres or spheroids are commonly used as simplified shape models, but we observed that the average mass distribution of aggregate snowflakes is very close to the Gaussian distribution. Therefore, we also included a Gaussian mass distribution in our analysis, using the term “average-particle model” for these types of particle shape models together. The Gaussian distribution may be more realistic as a mass distribution model, but more importantly, it allowed us to construct a simple model of an aggregate as a collection of a finite number of discrete monomers. Computing the autocorrelation function, and the resulting backscattering cross section, showed that the Gaussian average-particle model emerges as a low-frequency approximation to the aggregate model. At high frequencies, the differences between the two approaches can be very large. The difference of “low” and “high” frequencies depends on the size of the snowflake, but the mass-squared weighting of density autocorrelation functions of different simultaneously observed aggregates means that large snowflakes, if present, should not be ignored even if the number of small snowflakes is much higher.
 Computation of the density autocorrelation function from our aggregate model assumptions results in a weighted sum of two terms. The first term is equal to the Gaussian average-particle model; the weight of this term is generally much larger than that of the second, but the term diminishes rapidly as the frequency increases. The second term corresponds to the monomers; its smaller weight means that it is irrelevant at low frequencies, but this Gaussian is wider in the frequency domain, and thus remains significant after the effect of the first, narrower, term vanishes.
 When we compute the autocorrelation function given by the model for each aggregate in a data set of 50 or more, and then take the mass-squared weighted average of the contributions, the directly calculated autocorrelation function is reproduced well, as is the corresponding function that gives the structure factor (the square of the absolute value of the Rayleigh-Gans form factor) as a function of frequency. As the clustering of monomers violates our assumption that their positions are independent, we introduced additional parameters to the model that correct for this effect; these are derived through numerical optimization. We also considered an optional correction that includes particle edges, with mixed results.
 This paper has built a framework that illuminates a number of features of radar backscattering from snowflakes but also raises a number of questions and goals for future study. The main weakness of the presented model is its dependence on numerically optimized parameters. The determination of these (or some equivalent parameters) from the physical properties of the particle is perhaps the most important goal of future work. The implications for retrieving the particle microphysical properties from measurements, either by multi-frequency radars or in situ instruments, should also be investigated more thoroughly.
Appendix A: Gaussian Average-Particle Model
 We denote a Gaussian PDF with mean μ and variance σ2 as N(μ,σ2). We shall use the result that the convolution of two Gaussian PDFs and is itself a Gaussian PDF, [Jaynes, 2003].
 Assume that an aggregate particle is made up of N monomers whose locations along the beam axis are distributed according to . For simplicity, we assume that the monomer positions are independent and that the mass within each monomer is distributed according to identical normal mass distributions . The PDF of the average mass density of the aggregate particles is the cross correlation of these two distributions. For symmetric mass distributions, this is equal to the convolution, which is . Thus, the average mass density PDF is
The density autocorrelation function is computed as the cross correlation of pavg(z) with itself. Again, the distribution is symmetric so we can directly write
where “ ⊗ ” denotes convolution.
Appendix B: Average Autocorrelation for Gaussian Monomer Distribution
 Here we shall calculate the average autocorrelation function for the particles described in appendix A. We shall use the theorems given above, and additionally, the fact that if two random variables are distributed according to and , their difference is distributed according to (this results directly from the rule for the convolution and the symmetry of the PDFs).
 The mass density PDF for a single particle is a normalized sum of the mass density PDFs of the monomers:
The autocorrelation for this mass density PDF is
We can compute the convolution pairwise as
where we use the rule of the convolution of Gaussian models. We can divide the sum over i and j in two parts: i = j and i ≠ j. In the former case, the monomers are the same. In the latter case, their positions are independent, and we have
Taking the average over possible locations of particles i and j, and recognizing that dij = zj − zi is the distance between two monomers, we can write
The average on the right-hand side can be computed from the normal difference distribution. As the monomer positions zi are distributed according to, the average over possible positions is
Then the resulting average autocorrelation function is
that is a weighted mixture of Gaussian. For similar applications of the RGA on stochastically shaped particles, see Muinonen [1996, 2000].
 This work was supported by the Academy of Finland (contracts 125180, 128255, and 128328). We also thank Karri Muinonen for his helpful comments.