The increasing ecological and economical importance of Populus species and hybrids has stimulated research into the investigation of the natural variation of the species and the estimation of the extent of genetic control over its wood quality traits for traditional forestry activities as well as the emerging bioenergy sector. A realized kinship matrix based on informative, high-density, biallelic single nucleotide polymorphism (SNP) genetic markers was constructed to estimate trait variance components, heritabilities, and genetic and phenotypic correlations.
Seventeen traits related to wood chemistry and ultrastructure were examined in 334 9-yr-old Populus trichocarpa grown in a common-garden plot representing populations spanning the latitudinal range 44° to 58.6°. In these individuals, 9342 SNPs that conformed to Hardy–Weinberg expectations were employed to assess the genomic pair-wise kinship to estimate narrow-sense heritabilities and genetic correlations among traits.
The range-wide phenotypic variation in all traits was substantial and several trait heritabilities were > 0.6. In total, 61 significant genetic and phenotypic correlations and a network of highly interrelated traits were identified.
The high trait variation, the evidence for moderate to high heritabilities and the identification of advantageous trait combinations of industrially important characteristics should aid in providing the foundation for the enhancement of poplar tree breeding strategies for modern industrial use.
Populus species (poplars, aspens and cottonwoods; referred to as poplar throughout) are ecologically and economically important tree species that span a substantial geographic area in temperate and boreal regions across the Northern Hemisphere (Burns & Honkala, 1990; Mansfield & Weineisen, 2007). Although gymnosperms are predominantly employed for conversion to commodity products, poplars are becoming increasingly important to the forest products industry and in the new green economy, particularly as a short-rotation species for the provision of fiber for pulp and paper production, oriented strand-board (OSB), moldings and trim, and furniture components (Semple et al., 2007). More recently, these species have emerged as prospective bioenergy crops because of their inherent rapid growth rates and favorable cell wall chemistry (Dinus, 2000). Lignocellulosic feedstocks are considered to be more sustainable than starch- and sugar-based feedstocks for bioenergy production in terms of ecological benefits, energy and carbon dioxide balance (Yuan et al., 2008). Although poplar can generate relatively high yields and significant energy output (Yuan et al., 2008; Carroll & Somerville, 2009), its innate cell wall composition requires costly pretreatment methods to access the complex cell wall carbohydrates (Hendriks & Zeeman, 2009; Mansfield, 2009). Many factors have been proposed to contribute to this natural recalcitrance of plant cell walls to deconstruction (Chang & Holtzapple, 2000; Dinus, 2000; Mosier et al., 2005; Studer et al., 2011) and, at present, several research strategies are underway to attempt to optimize and enhance saccharification efficiency (Mansfield et al., 1999, 2012; Yuan et al., 2008; Carroll & Somerville, 2009; Shen et al., 2011).
Biomass accumulation, wood composition and ultrastructural traits are known to be highly complex (Dinus, 2000; Novaes et al., 2009). It is also known that poplars show substantial variation in many important wood properties, such as lignin content and syringyl to guaiacyl (S : G) lignin monomer ratio (Wegrzyn et al., 2010; Studer et al., 2011), cellulose and hemicellulose composition (Sannigrahi et al., 2010), cellulose crystallinity (Kumar et al., 2009) and wood density (Zhang et al., 2012). Poplar tree improvement programs can capitalize on both the existing natural variation in wood traits as well as the vast genomic resources that have become available for this species in recent years (Tuskan et al., 2006; Jansson & Douglas, 2007; Ralph et al., 2008; Geraldes et al., 2011; Geraldes et al., in press; Slavov et al., 2012) to accelerate breeding strategies for improved cell wall and growth traits. Quantitative trait locus (QTL) identification and other traditional quantitative genetics approaches have been implemented to study the complex genetic architecture underlying wood formation (Pot et al., 2002; Kirst et al., 2004; Novaes et al., 2009; Thumma et al., 2010; Wegrzyn et al., 2010; Beaulieu et al., 2011). However, knowledge on the mode of genetic control (heritability) of the wood traits and their genetic relationships is essential for effective selective breeding programs. The aim of breeding programs is to maximize the genetic gain in industrially important wood traits and, concurrently, to effectively manage undesirable trait correlations whilst sustaining genetic variability for future selection and adaptation. For this, a better understanding of the inheritance and relationships among traits of interest is needed. With the exception of wood density, which is commonly studied and generally shows moderate to high heritability (Yanchuk et al., 1984; Pliura et al., 2007; El-Kassaby et al., 2011), little is known about the genetic parameters for other wood-related traits in poplar. This article provides a comprehensive assessment of wood traits considered to be key to improve the utility of Populus spp. for industrial applications.
The availability of informative molecular markers provides the opportunity to estimate variance components and genetic parameters, such as heritability and genetic merit, in wild populations (Ritland, 1996; Frentiu et al., 2008; Blonk et al., 2010). In the past, simple sequence repeats (microsatellites) have been used to construct marker-based relationship matrices (i.e. co-ancestry) to infer effective population size in conservation programs (Oliehock et al., 2006; Nomura, 2008), as well as to estimate heritability in wild populations (Van de Casteele et al., 2001; Frentiu et al., 2008; Blonk et al., 2010), with mixed results. The main drawbacks of employing microsatellite markers to estimate reliable genetic parameters is their limited number, and thus limited information, for the construction of reliable relationship matrices (Frentiu et al., 2008). Generally, hundreds of polymorphic markers are needed to precisely estimate relatedness (Visscher et al., 2008). In addition, factors such as sample size, complexity of the population structure, level of relatedness in the population, nonrandom mating and the implementation of phenotypic selection (i.e. truncation selection), individually or in concert, can affect the precision of the relatedness coefficient and, consequently, all ensuing genetic parameters, including heritability estimates (Van de Casteele et al., 2001). However, regardless of the accuracy of genetic parameters, breeding value estimates based on a molecular-based relationship matrix can still provide useful information for selection (Blonk et al., 2010). The development of high-throughput single nucleotide polymorphism (SNP) genotyping platforms promises drastic improvements in the estimates of genetic parameters generated from relationship-based methods. The constantly increasing number of available SNPs is expected to yield more accurate genetic parameters, thus increasing the expected selection efficiency of breeding programs in either wild or artificial populations with completely unknown or partially known pedigrees.
Furthermore, once a reasonable number of SNPs are available, the resulting, more accurate, genomic relationship matrices can, in turn, substitute the conventional pedigree matrices in best linear unbiased prediction (BLUP) analyses. Hence, the genetic parameters, such as additive genetic variance, heritability and genetic correlations, can be estimated with greater accuracy. This is known as the genomic best linear unbiased prediction (GBLUP) estimation method (VanRaden, 2008; Nadaf & Pong-Wong, 2011). However, it should be stated that the efficiency of selection programs that are focused on multiple traits is not only dependent on the extent of genetic control of these traits individually (i.e. heritability estimates), but also on the genetic correlations among them (Falconer & Mackay, 1996), which are driven by the relative fitness of the traits (Lande & Arnold, 1983). This study employs a dense SNP marker array to genotype 9342 SNPs to construct a genomic relationship matrix among 334 P. trichocarpa individuals with unknown genealogy, and to simultaneously estimate the narrow-sense heritabilities and genetic correlations among 17 cell wall chemistry and ultrastructural attributes that are important to the use of poplar as an industrial feedstock. Our results clearly demonstrate the substantial variation in these diverse traits in natural populations, and provide the underpinnings for breeding strategies to capitalize on genetic and phenotypic correlations among traits.
Materials and Methods
Black cottonwood population and wood core sampling
Complete details of the studied black cottonwood (Populus trichocarpa Torr. & Gray) population are described in Xie et al. (2009). In 2008, a 10-mm increment core was taken at breast height from each of 334 9-yr-old trees (originating from 105 subpopulations) in a north–south direction. The northern half of the core was used for ultrastructure determinations, as well as for wood chemistry, whereas the southern half of the core was used for fiber quality assessments. The geographical distribution of the subpopulations spanned 15° in latitude (range, 44.0–58.6°).
Phenotypic trait data
Wood chemical composition
Holocellulose, alpha cellulose and hemicellulose amounts were determined according to Browning (1967), with minor modifications. In brief, the woody material was ground using a Wiley mill to pass a 40-mesh screen, and then extracted overnight in hot acetone and allowed to dry overnight at 50°C. Aliquots of 3.5 ml of solution A (60 ml glacial acetic acid + 1.3 g l−1 NaOH) and 1.5 ml of 20% sodium chlorite solution (20 g NaClO2 in 80 ml distilled water) were added to exact amounts (150 mg) of extract-free wood meal to initiate the chlorite delignification. The reaction tube was tightly sealed and then gently shaken at 50°C for 16 h. The reaction was quenched by placing the tubes in an ice bath, and the reaction solution was then thoroughly removed by pipetting whilst not disturbing the settled reacted wood meal. This procedure was repeated. Finally, the reacted wood meal was transferred to a preweighed coarse sintered crucible, and washed twice with 50 ml of 1% glacial acetic acid (under vacuum suction), followed by a wash with 10 ml of acetone under applied vacuum, and then dried at 50°C overnight to obtain the holocellulose yield. To obtain the alpha cellulose content of the woody material, alkaline extractions using two different sodium hydroxide extractions were performed sequentially to remove the hemicelluloses. Exact weights of 100 mg of holocellulose were transferred to a small beaker and left at room temperature for 30 min to allow moisture equilibration. To this, 8 ml of 17.5% NaOH (from sodium hydroxide 50% w/w) was added and the material was left to react for 30 min. Then, 8 ml of distilled water were added and the material was stirred for 1 min, and left to react for 29 min. The reaction solution was carefully removed, and the process was repeated with fresh reactants. After the second reaction, all retentate was filtered through a preweighed coarse sintered crucible by washing with distilled water (3 × 50 ml). Subsequently, the reaction was neutralized by soaking in 1.0 M acetic acid for 5 min. After washing with distilled water (3 × 50 ml), the material was dried at 50°C overnight to obtain the alpha cellulose content. Subtracting the alpha cellulose fraction recovered from the holocellulose provided the hemicellulose yield.
Cell wall carbohydrates and total lignin were determined according to Huntley et al. (2003), with the concentrations of neutral cell wall-associated carbohydrates (glucose, xylose, mannose, galactose, rhamnose and arabinose) being quantified by high-performance anion exchange liquid chromatography using a Dionex (DX-600; Sunnyvale, CA, USA) high-performance liquid chromatography system equipped with a PA1 (Dionex) column, a pulsed amperometric detector with a gold electrode and a SpectraAS3500 auto injector (Spectra-Physics, Santa Clara, CA, USA). The column was equilibrated with 250 mM NaOH and eluted with deionized water at a flow rate of 1 ml min−1. The total lignin content was calculated as the sum of the Klason lignin and soluble lignin fractions. Lignin and cell wall sugar contents were determined as a percentage of dry wood, based on the initial weight of the wood sample analyzed.
The lignin monomer composition (S : G ratio) was determined according to Robinson & Mansfield (2009), and was analyzed by gas chromatography on a Hewlett Packard 5890 series II instrument (Agilent Technologies, Santa Clara, CA, USA), equipped with an autosampler, splitless injector, flame ionizing detector and a 30-m 5% diphenyl–95% dimethyl polysiloxane-coated RTX-5MS capillary column (inner diameter, 0.25 mm).
Wood physical properties
Fiber length was analyzed using the woody material isolated from the southern half of the increment core. In brief, each sample was macerated in a solution consisting of 30% hydrogen peroxide and glacial acetic acid in a 1 : 1 ratio (Franklin solution) at 70°C for 48 h. Following the reaction, the remaining fibrous material was washed with deionized water until the samples had been neutralized. Subsamples were resuspended in 10 ml of deionized, distilled water and analyzed on a fiber quality analyzer (FQA; Optest Equipment, Inc., Hawkesbury, Ontario, Canada). The fiber length was recorded in millimeters.
Wood density was analyzed using the northern half of the increment core. A 1.67-mm-thick section was precision cut from each increment core using a custom-built twin-blade pneumatic saw. After extraction with acetone, the wood sections were acclimated to 7% moisture content before density analysis. The samples were then scanned from pith to the bark by X-ray densitometry (QTRS-01X; Quintek Measurement Systems Inc., Knoxville, TN, USA). Wood density was obtained by averaging the density measures across all growth rings. Ten randomly selected P. trichocarpa samples from the same site, that were not part of the study, were sectioned as described above, their density determined directly by accurately recording the weight over volume and then scanned by X-ray densitometry to establish an extinction coefficient empirically, which was used to estimate the wood density for all samples from the population.
The microfibril angle (MFA) was measured on two growth rings, the first growth ring adjacent to the pith and the most recent growth ring immediately adjacent to the bark, on the precision-cut samples employed for density determination. MFA was determined using a Bruker D8 Discover (Bruker AXS Inc., Madison, WI, USA) wide-angle X-ray diffraction instrument equipped with an area detector (GADDS) to collect the diffraction patterns of the 002 crystal planes of cellulose. These strong 002 diffraction arcs of the cellulose I reflections contain information on all the orientations of microfibrils in the wood sample. The measurements were collected in transmission mode using a CuKα radiation source emitting at a wavelength of 0.154 nm, with X-ray optics θ1 = θ2 = 0° (θ1 = source, θ2 = detector). The diffraction intensities were collected at Bragg angles 2θ = 20–24°. The X-ray diffraction profile was integrated at Chi between −180° and +180°. We used the poplar-specific standard curve (MFA = 0.9583T − 3.8364, R2 = 0.9686) that related the empirically assessed MFA (as described in Ukrainetz et al., 2008) to that quantified by the X-ray diffraction pattern. The mean T value of the two 002 diffraction peaks was used in the equation to determine the MFA.
The specifications of the crystallinity measurements using X-ray diffraction were the same as for MFA measurements, except that θ1 = 17°. Diffraction intensities were collected at Bragg angles 2θ = 4–40°. The X-ray diffraction profile was integrated at Chi between −180° and 0° using GADDS software (Bruker AXS Inc., Madison, WI, USA). The data were normalized and resolved using the crystallinity calculation method of Vonk (1973). First, the background diffraction signal was subtracted, and then the amorphous curve was fitted to the diffraction pattern; finally, linear regression analysis was conducted to obtain the cell wall crystallinity.
The SNP genotyping of 334 individuals from the P. trichocarpa population was carried out using an Illumina Infinium® genotyping array with a set of 34 131 SNPs in 3543 candidate genes, as described by Geraldes et al. (in press). We eliminated SNPs within candidate genes with a minor allele frequency below 0.05, > 10% missing values and an Illumina GeneTrain score below 0.5. These three selection criteria reduced the number of SNPs to 29 233 for the 334 trees genotyped, 9342 of which were used to estimate the genomic pair-wise kinship. These 9342 SNPs were also selected because of their conformity to Hardy–Weinberg expectations. The genotypes of the 334 individuals for 9342 SNPs on the genotyping array are given in Supporting Information Table S1. Details regarding each locus and the genotyping methods can be found in Geraldes et al. (in press).
Estimation of heritabilities and genetic correlations between wood traits
The analysis of variance and trait heritability estimation followed the animal model in Henderson (1984) following:
(y, vector of observations; X and Z, incidence matrices relating fixed effects in vector β and random effects in vector u (breeding values) assuming and ; e, vector of residual effects assuming ). K is the kinship matrix computed on the basis of marker information following Loiselle et al. (1995), which substituted the commonly used average numerator relationship matrix A based on pedigree information (Henderson, 1976) after substituting the negative values with zeros (Yu et al., 2006).
Genetic correlation between pairs of traits was estimated as:
where covAB, and are the genetic covariance between traits and the additive genetic variance of both traits estimated in the bivariate model (Falconer & Mackay, 1996). Assuming as the response vector for the individual i and that the bivariate model is performed as follows:
where , the covariate matrix of measurements errors is defined by R = ∑ ⊗ I, where and symbol ⊗ represents the Kronecker product. The covariance matrix of random effects is , where K is the kinship matrix. With the assumption of mutual independence between u and e, . Analyses were performed in ASReml (Gilmour et al., 2002). The test of relationship between matrices of genetic and phenotypic correlations was performed using the ‘mantel.test’ function in R package ‘ncf’ computing (Bjornstad, 2009).
Variation in wood chemistry and ultrastructural traits
To measure trait variation in P. trichocarpa, we quantified wood cell wall traits in an unrelated population of 334 individuals (Fig. 1) from 105 subpopulations, grown in a common garden for 9 yr, with provenances spanning 15° latitude. The range of variation in 17 cell wall traits was considerable, and provides substantial opportunity for selection (Table 1). For example, the alpha cellulose and hemicellulose contents ranged between 35% and 49% and 22% and 34%, respectively, whereas total lignin ranged from 15% to 26%, and the molar composition of syringyl lignin monomers ranged from 63% to 79%. Similarly, the glucose content ranged from 41% to 62%, whereas the xylose content ranged from 14% to 24%. The mean average wood density in the population, an important measure for the expected yields, was 407 kg m−3. However, across the population, the density ranged between 320 and 528 kg m−3 (Table 1). Significant correlations were observed between subpopulation means for MFA, % galactose content in the cell wall and density with latitudinal gradients (r =0.34, P =0.0004; r =−0.27, P =0.0063; and r =0.20, P =0.0401, respectively). Wood samples with lower MFA and higher cell wall galactose content tended to originate from the northern region of the population range. More southern individuals, however, tended to display a higher wood density than individuals originating from the north. No other traits showed significant trends with latitude.
Table 1. The mean, standard deviation (SD), minimum (MIN) and maximum (MAX) values for 12 xylem chemistry and five ultrastructural traits in 9-yr-old Populus trichocarpa grown in a common-garden plot
Composite trait (total lignin = soluble + insoluble lignin; holocellulose = alpha cellulose + hemicelluloses).MFA bark, microfibril angle at most recent growth ring; MFA pith, microfibril angle at first growth ring; n, sample size; %, dry wood-based percentage of wood components.
With the large number of SNPs (9342) used to generate the marker-based relationship matrix ('Materials and Methods') and the sample size (n = 334) studied, we were able to estimate significant narrow-sense heritability (h2) for 16 of the 17 traits examined (Table 2). The significance of h2 estimates was determined by the magnitude of their standard errors (SEs). In addition, no sex effect was considered, as it was assumed that this effect did not appreciably contribute to the variance in the traits under study (as the trees were only 9 yr of age, not all individuals were sexually reproductive and, hence, their genders were unknown; Q. Cronk, University of British Columbia, pers. comm.). With the exception of the holocellulose (combination of alpha cellulose and hemicellulose) content, which showed a low h2 (0.11; SE = 0.135), the remaining h2 estimates (Table 2) ranged from 0.24 (fiber length) to 0.97 (soluble lignin). High h2 estimates (> 0.6) were detected for many of the cell wall chemistry traits, including arabinose, galactose, xylose, insoluble lignin and total lignin, whereas the wood average density showed the highest h2 estimate among the ultrastructural traits (0.56).
Table 2. Estimated narrow-sense heritability (h2) with standard error (SE) for 12 xylem chemistry and five ultrastructural traits in 9-yr-old Populus trichocarpa grown in a common-garden plot
Composite trait (total lignin = soluble + insoluble lignin; holocellulose = alpha cellulose + hemicelluloses). MFA bark, microfibril angle at recent growth ring; MFA pith, microfibril angle at first growth ring; %, dry wood-based percentage of wood components.
We next determined the extent of trait–trait genetic (rg) and phenotypic (rp) correlations. These analyses showed that 78 (57%) and 61 (45%) of 136 possible phenotypic or genetic trait pairs, respectively, were significant (Tables 3–6). We identified 50 trait pairs for which both phenotypic and genetic correlations were significant; in addition, 11 trait pairs showed significant genetic correlations, but no significant phenotypic correlations, and 28 showed significant phenotypic correlations, but no significant genetic correlations (Tables 3–6). Matrix similarities between genetic and phenotypic correlations were determined using the Mantel test with 1000 replications to uncover the patterns of correlation for 12 wood chemistry (r =0.82) and five ultrastructural (r =0.80) traits, and all tests were significant, indicating similar magnitude and trajectory between genetic and phenotypic correlations. To compare the correlation pattern between wood chemistry and ultrastructural traits, we conducted an overall Mantel test for the entire correlation matrix (i.e. 17 traits), which included both phenotypic and genetic correlations, and obtained a similar correlation pattern (r =0.76). This was required as the wood chemistry and ultrastructural trait matrices are not symmetrical; hence, the overall matrix test was performed.
Table 3. Phenotypic (below diagonal) and genetic (above diagonal) correlations (rp and rg, respectively) for xylem chemistry traits
We then used network analysis (Waitt & Levin, 1998) to visually represent the structures of phenotypic and genetic correlations (Fig. 2a,b). Strong and positive phenotypic and genetic correlations existed between clusters of lignin chemistry traits (insoluble lignin, soluble lignin, total lignin) and hemicellulose traits (xylose, mannose, hemicellulose) (cluster I, Fig. 2). A cluster of positive correlations involving glucose, density, alpha cellulose and holocellulose contents was also observed (cluster II), and the genetic correlations for this cluster (Fig. 2b) were much stronger than their respective phenotypic correlations (Fig. 2a).
Although glucose and xylose traits were (weakly) positively correlated, both phenotypically and genetically (rp = 0.119, rg = 0.295), alpha cellulose and density were negatively correlated with xylose, and total and insoluble lignin traits (Fig. 2a,b). By contrast, strong positive genetic correlations were found between alpha cellulose and wood density. The strongest correlations observed among wood chemistry traits were the negative correlation between alpha cellulose and hemicellulose content (rg = −0.809; rp = −0.374) and the positive correlation between xylose and mannose content (rg = 0.699; rp = 0.484). Strong genetic correlations were also found between MFA (most recent growth ring) and holocellulose (negative; rg = −0.532), between MFA (both growth rings) and arabinose (positive; rg = 0.561 and 0.671), and between cellulose crystallinity and fiber length (positive; rg = 0.588). In addition, cellulose crystallinity showed strong positive correlations with insoluble and total lignin (both phenotypically and genetically; Fig. 2a,b). MFA (most recent growth ring) again showed a positive genetic correlation with density (rg = 0.260), and a negative genetic correlation with glucose (rg = −0.479), whereas density and glucose were positively correlated (rg = 0.527; Fig. 2b), suggesting a trade-off between increased MFA and glucose content.
To our knowledge, this is the first forest tree study to comprehensively analyze hundreds of natural accessions of Populus spp. (poplar) for xylem chemistry and ultrastructural traits simultaneously. Previous studies evaluating wood trait variation in large poplar populations have employed other analytical techniques to assess some of the wood chemistry traits evaluated in this study. These include near-infrared reflectance (NIR) and pyrolysis molecular beam mass spectrometry (PyrMBMS) (Wegrzyn et al., 2010; Studer et al., 2011). Although such techniques have the advantage of being rapid and amenable to high-throughput analysis, a drawback of both is that they provide only a limited number of trait estimates, and are based on training sets derived from wet chemistry analyses similar to the techniques used in the current study. For example, PyrMBMS requires precalibrations using wet chemistry and quantifies wood chemistry traits indirectly, and, although rapid and accurate, does not disclose the make-up of five- and six-carbon sugars, and only offers total amounts of each (e.g. glucose derived from hemicellulose and alpha cellulose). Although commonly applied, neither offers comprehensive insights into the chemistry of cell wall traits (Novaes et al., 2009). By contrast, we applied the direct measurements of wood chemistry traits using wet laboratory approaches and direct measures of ultrastructure. Although laborious, these approaches provide the most direct and accurate phenotypic estimates of quantitative wood chemistry and ultrastructural traits and trait variation, making them highly amenable to the study of wood phenotype–genotype correlations in the P. trichocarpa populations studied.
Wood chemistry and ultrastructural traits show considerable variability
Wood quality is defined by the intrinsic morphological and chemical components of wood and, as such, plays a critical role in determining the overall economic value of a woody stem. In addition to growth rate, properties such as wood density, fiber length, MFA and cell wall chemistry are all industrially important traits, and ultimately can be used to select superior clonal material. However, trait selection is ultimately dependent on the end use of woody biomass. In this study, we present a comprehensive assessment of 334 P. trichocarpa genotypes spanning much of the native range of the species and, based on extensive phenotypic and genotypic diversity, conclusively demonstrate the potential for the selection of superior genotypes for targeted breeding for both bioenergy applications and traditional industrial uses.
Traditionally important wood quality characteristics showed substantial variation, with wood density ranging from 320 to 528 kg m−3 and MFA from 10.4 to 25.5°. Wood density has a considerable influence on the strength, machinability and rates of conversion of solid wood products, and affects both the yield and fiber properties of the pulp produced (Mansfield et al., 2007). Moreover, wood density influences significantly the overall biomass yield of trees (when growth parameters, such as volume, remain constant), which can affect the bioenergy applications of the total harvested material from a given stand. The S2 MFA has been shown to influence the longitudinal and transverse shrinkage in sawn lumber during drying operations, resulting in a substantially greater proportion of defects (when the angle is larger). In addition, MFA is also an important determinant of the overall wood strength and elasticity (Mansfield et al., 2007, 2011). For example, in both Pinus taeda (Cramer et al., 2005) and Eucalyptus delegatensis (Evans & Ilic, 2001), MFA has been shown to explain a large portion of the variation of the longitudinal modulus of elasticity.
Fiber length also varied dramatically, from 0.44 to 1.17 mm. It is well recognized that paper properties are highly correlated with inherent fiber morphology, such as fiber length, fiber diameter and wall thickness (coarseness), with fiber length contributing the greatest influence (Mansfield & Weineisen, 2007). Hardwood fibers generally impart good optical properties and act as a filler at a small percentage to softwood-derived carrier, or reinforcing fibers, in the manufacture of fine paper. It has been shown that paper strength is generally enhanced by longer fibers, and the importance of long fibers is predominant in sheets with low bonding strength, such as those made of only slightly beaten pulp or wet sheets (Mansfield et al., 2004). Variations in wood chemistry also impart significant effects on overall pulp yield and quality, as well as the requisite energy and chemicals required to attain target residual lignin (H-factor) content before bleaching during chemical pulping (Stewart et al., 2006).
We also found substantial variation in total lignin content (14.7–25.7%) and mol.% syringyl monomers (62.8–79.2), both of which have been shown to influence the pulping efficiency of poplar (Huntley et al., 2003; Stewart et al., 2006). As shown previously (Stewart et al., 2009), poplar lignin with elevated syringyl content is less condensed, has more β-O-4 bonds per aromatic subunit and has a lignin fraction with a lower average molecular mass. Wood with such lignin generates pulp with higher yield and lower kappa values (residual lignin). Similarly, wood traits that are of particular interest for chemical modification during pulping operations are also of interest in the production of biofuels from lignocellulosic feedstocks, particularly those that aim to use biochemical unit operations (i.e. enzymatic hydrolysis following a pretreatment). For example, biomass pretreatment efficiency is influenced by total lignin content, S : G ratio and hemicellulose content and distribution, whereas carbohydrate saccharification efficacy and yield are affected by wood density, % alpha cellulose, % glucose and cell wall crystallinity (Wegrzyn et al., 2010; Studer et al., 2011; Mansfield et al., 2012). Thus, the variation in these lignin traits provides a potential opportunity for genetic improvement of poplar for both traditional and bioenergy uses.
The observed level of variability in wood traits in the poplar populations studied (Table 1) mirrors that reported in previous studies of poplar (Kumar et al., 2009; Sannigrahi et al., 2010; Wegrzyn et al., 2010; Studer et al., 2011). The 334 individuals in the P. trichocarpa collection have provenances spanning almost 15° latitude (Xie et al., 2009). Unlike one study of Eucalyptus globulus accessions collected over a geographical range of 5° latitude (Stackpole et al., 2011), we observed no significant correlation between the molar composition of syringyl lignin monomers, and thus the S : G monolignol ratio, and the geographical distribution of the subpopulation provenances from which the wood was sampled, with respect to latitude. However, both lignin content and syringyl subunit composition are important for the adaptation of trees to the environment, because lignin performs important roles in stem integrity (strength), hydraulic conductivity and defenses against biotic challenges (Stackpole et al., 2011). Thus, these properties may represent important fitness traits. We identified a significant trend for average wood density with latitude, as reported in the same Eucalyptus study (Stackpole et al., 2011). Wood density, however, was shown to be of less importance for a tree's fitness (Pliura et al., 2007). Density also has the advantage of high repeatability in clonal trials; hence, high genetic gains can be relatively easily achieved in breeding and selection for this trait. Interestingly, both MFA and % cell wall galactose showed a clear geographical cline. Based on the high correlations on an individual tree basis (r = 0.5) of the monomeric sugars galactose, arabinose and rhamnose, we assumed that the determined galactose fraction in the cell wall originated from arabinogalactan (AG), arabinogalactan proteins (AGPs) and rhamnogalacturonan I (RGI, pectin), but not from galactomannan (hemicellulose). Recently, it has been shown that a noncellulosic polysaccharide matrix, consisting of hydrophilic AG, AGP and RGI, is present within the G-layer of tension wood (reviewed in Mellerowicz & Gorshkova, 2012). It is likely that these polymers contribute to the higher moisture retention associated with these cells. However, when the tension wood is dried and the water is removed rapidly, the resultant wood displays a greater level of shrinkage and causes substantial deformation of the cellulosic microfibrils, imparting negative effects on wood quality (Bowling & Vaughn, 2008). In addition, the MFA of the G-layer of tension wood is characteristically low, often approaching 0° in extreme cases (Mellerowicz & Gorshkova, 2012). Consistent with these features, the current findings clearly demonstrate a high negative genetic correlation between MFA and galactose content (Table 6). Furthermore, there is a strong negative correlation between MFA and alpha cellulose (Table 6), and a strong positive correlation between alpha cellulose and galactose (Table 3). Thus, our results suggest that individuals from the north, which have lower MFAs and higher cell wall galactose content, may contain a higher proportion of tension wood, which is also characterized by higher cellulose crystallinity (Mellerowicz & Gorshkova, 2012). In general, northern trees display faster growth rates, but have a shorter growing season (Soolanayakanahally et al., 2009), and deposit tension-like wood with more AGP, more alpha cellulose and lower MFA. By contrast, southern trees show slower growth rates, but have a longer growing season, resulting in the deposition of a greater proportion of latewood-type fibers known to have higher density, which is a function of the smaller lumen diameters, as growth slows and the demand for water dissipates.
Wood chemistry and ultrastructural traits show moderate to high heritability
The use of the ‘animal model’ has been widely applied in evolutionary genetics studies to obtain reliable estimates for the additive genetic effects in natural populations with a mixture of relationships between individuals (Kruuk, 2004). Estimates of the derived strict-sense heritabilities provide accurate assessments of the proportion of the total phenotypic variance explained by the underlying genetic factors, as this genetic variance is mainly additive for a quantitative trait (Hill et al., 2008). Trait heritability with unknown pedigrees can be inferred by inferring relatedness between pairs of individuals and correlating this genetic information with information on their phenotypic similarities. The (assumed) relationships between individuals dictate the amount of genetic markers needed to reconstruct the relatedness matrix (Visscher et al., 2008). In our case, individuals were largely unrelated (Fig. 1); thus, a large number of polymorphic molecular markers had to be employed to infer the relationships with higher precision. This is feasible with high-throughput genotyping techniques, which were readily available for our study. This approach has been successfully applied in other evolutionary studies, and also in plants (Ritland & Ritland, 1996), to estimate heritability ‘in the wild’.
A knowledge of heritability is important and central to the direct prediction of the response of a quantitative trait to selection, its ‘evolvability’ (Houle, 1992). Changes in allelic frequencies and environmental conditions affect the heritable variation (Hoffmann & Merilä, 1999), and thus heritabilities are not necessarily constant estimates, but are environment specific (i.e. dependent on population sampling) and can even vary over time (Visscher et al., 2008). If selection occurs on multiple traits, the genetic correlations between these traits also need to be known and taken into account. An understanding of the extent of genetic control over a particular trait (i.e. its heritable component) is also invaluable for studies that depend on strong phenotype–genotype correlations to identify individual loci with a significant effect on the trait or aim at the elucidation of the genetic architecture underlying the trait, as in association genetic studies; for example, when a larger percentage of a trait's variance is under genetic control (i.e. high heritability), an SNP marker-associated polymorphism is expected to be more meaningful than an equivalent association with a trait of lower heritability, even if the observed association expresses the same percentage of the trait's variance. Thus, our study represents an excellent resource for P. trichocarpa tree improvement strategies and facilitates the prediction of an advantageous phenotype in individuals through SNP markers.
With the exception of holocellulose, for which narrow-sense heritability was low (h2 < SE), all other wood traits investigated were under moderate or high genetic control (Table 2). Moderate to high heritabilities in traits suggest that satisfactory genetic gains could be obtained in these traits and that there exists a prospective potential for genetic tree improvement. The reasons for low h2 could be higher contributions of nonadditive genetic variance and substantial genome by environment (G × E) interactions. Traits that are highly responsive to environmental variation, such as life history traits, are well known to have low h2, because the genetic variance in total fitness is low (Price & Schluter, 1991). Specifically, fitness-related traits have greater contributions of dominance and epistatic variance (Hoffmann & Merilä, 1999), and fitness traits have confirmed low heritability with high residual variability (Houle, 1992). By contrast, the amount of genetic variation for many phenotypes related to fitness can be substantial (Visscher et al., 2008). This suggests higher adaptability of the population for increasing population fitness, as genetic variation is related to environmental variation (Dobzhansky, 1951). However, the response of every trait to directional selection depends on both estimates: the heritability and the genetic variation. We found that c. 60% of the xylem chemistry traits showed high heritability (h2 > 0.5). By contrast, only one of five measured ultrastructural traits had h2 > 0.5 (density), but moderate levels of narrow-sense heritability were observed for all other ultrastructural traits. Previous studies on E. globulus found heritability estimates for density, cellulose and lignin that were comparable with our P. trichocarpa results (Stackpole et al., 2011). As with our results, very high heritability for acid-soluble lignin was reported previously in E. globulus (Poke et al., 2006). These findings suggest that heritabilities, at least for certain wood traits, allow meaningful comparisons across hardwood species.
Different patterns of correlations among wood traits in P. trichocarpa
Genetic correlations between traits can arise from pleiotropic effects or from strong linkage disequilibrium of causative genes. Generally, genetic correlations are expected to be higher than phenotypic correlations because G × E interactions can be prevalent in the latter. However, low trait heritabilities can manifest extreme genetic correlations relative to their phenotypic counterparts (Cheverud, 1988). The average matrix correlations between genetic and phenotypic correlations observed here were similar to those reported previously for plants (Waitt & Levin, 1998). Comparisons between chemical and ultrastructural traits, as opposed to comparisons within chemical and ultrastructural traits, indicated the presence of stronger correlations within each trait set, as expected, as they tend to be more biologically related. Similarly, a lack of strong correlations between certain traits could be an indication of traits that are functionally or developmentally less related, and are therefore genetically and phenotypically less integrated, but functionally or developmentally less related traits can yield higher degrees of phenotypic correlation (Waitt & Levin, 1998). In any case, information on genetic correlations is invaluable for understanding the correlated response of traits to selection, that is trait co-evolution (Cheverud, 1988). From a practical perspective, this knowledge can be applied to indirectly select for and predict genetic gain for an advantageous phenotype that is difficult or costly to assess otherwise. It can also be applied to select for multiple traits simultaneously. However, the influence of environment on the expression of genetic covariation has also been highlighted (Sgro & Hoffmann, 2004), and needs to be considered when breeding strategies are developed.
Advantageous trait combinations
Genetic correlations provide insights into the evolutionary constraints of a species, and are best studied in the natural environment of the species (Lynch, 1999). However, tree improvement trials are widely established as common gardens to facilitate the testing of genotypes under comparable environmental conditions. Thus, it is possible to model the environmental effects and distinguish these from the true genetic differences between the individuals. Here, we studied phenotypic traits important for cell wall characteristics in a P. trichocarpa common garden. To identify potential advantageous traits that may be combined in single individuals, we focus the following discussion on correlations that involve trait pairs for which both genetic and phenotypic correlations were significant.
We identified two important trait clusters, each involving multiple traits: cluster I, a set of hemicellulose- and lignin-related traits; and cluster II, consisting of cellulose and density traits (Fig. 2, Tables 3, 6). Genetic correlations within each of these clusters were strong and positive (Fig. 3a), but many genetic correlations between traits in the two clusters were negative, particularly between alpha cellulose and wood density (cluster I) and lignin and hemicellulose traits (cluster II; Fig. 3b).
These correlations are important considerations when selecting for desired phenotypic combinations in breeding (e.g. for improved biofuel traits). In this case, an important target for genetic improvement is the recalcitrance of plant cell walls to deconstruction. The natural resistance of wood cell walls to biological conversion (enzymatic hydrolysis) is attributed to the heterogeneity of the biomass components and the occlusion of cellulose microfibrils by lignin and hemicelluloses, and the resulting inaccessibility of hydrolytic enzymes to the cellulose surface area (Mansfield et al., 1999; Mosier et al., 2005). Although xylan and lignin can alternatively serve as resources for biomaterials, increased bioethanol yield generally requires feedstocks with lowered lignin and xylan content, fewer or modified hemicellulose–lignin interactions and above average syringyl lignin fraction (S : G ≥ 2.0) (Studer et al., 2011; Mansfield et al., 2012). Thus, the identification of a concerted negative genetic correlation of lignin and hemicellulose content with density and alpha cellulose content (Fig. 3, Tables 3, 6) appears to be highly advantageous for breeding strategies directed at optimizing poplar as a lignocellulosic feedstock for bioethanol production. The data suggest that breeding for lower lignin or hemicellulose content could simultaneously select for higher density and alpha cellulose content, and vice versa.
We also found that lignin with a higher mol.% syringyl subunits is negatively correlated with the amount of insoluble and total lignin in the cell wall. These findings are consistent with those observed in Eucalyptus (Stackpole et al., 2011), and suggest that simultaneous breeding for high mol.% syringyl lignin and low total lignin is feasible.
The properties of cellulose are also important factors to be considered in feedstock development (Mosier et al., 2005). Extensive intra- and intermolecular hydrogen bonds promote the formation of crystalline structures. The intrinsic cellulosic ultrastructure has long been recognized as an important parameter in the efficiency of enzymatic hydrolysis (Mansfield et al., 1999; Chang & Holtzapple, 2000). We found that low cell wall crystallinity, which is advantageous for biofuel, chemical cellulose and chemical pulping applications, is correlated with lower lignin, higher mol.% syringyl lignin and higher density, suggesting that it should be possible to simultaneously select for these traits in breeding for improved feedstocks. Alternatively, MFA, which is the primary angle of orientation of cellulose in the secondary wall relative to the vertical axis of elongating cells, is an important solid wood property. We found that lower MFA, a desirable trait for solid wood applications given its association with wood stiffness and shrinkage, is correlated with a lower xylose content (Fig. 2a). Therefore, individuals selected on the basis of lower MFA would be predicted to produce offspring with higher glucose/alpha cellulose content and lower xylose content (Table 6). Two advantageous solid wood and biochemical processing traits for both pulping and bioenergy applications are wood density and alpha cellulose content. Given the strong positive genetic correlation between these two traits (Table 6), and the existence of accessions within the population sampled with high values for both traits (Fig. 4), it appears to be possible to simultaneously select for both traits. Our results provide evidence that natural selection in P. trichocarpa already acts in favor of certain desirable industrial trait combinations, and would facilitate the optimization of the feedstock in this species through selective breeding.
Our results also encourage further in-depth studies of trait co-evolution in long-lived perennials. For example, the presence of strong genetic correlations between certain wood traits emphasizes the presence of a common set of genes at the center of these correlations. The identified directionality of these trait correlations would suggest either synergistically or antagonistically pleiotropic gene effects on these traits, dependent on whether the two traits are positively or negatively correlated. Thus, it would be worthwhile to uncover such genetic pleiotropy and identify genes that are significantly correlated with both traits. Such an approach might involve the study of the genetics of genome-wide gene expression to collocate QTLs for gene expression and the quantitative traits under study (Porth et al., 2012). Key to such an undertaking is a knowledge of the extent of interrelation between/among traits of interest, and our network analysis of genetic correlations among traits provides important insights into potential genetic pleiotropy for a comprehensive set of wood traits.
Here, we demonstrate the feasibility of using informative, high-density, biallelic SNP genetic markers to assess the extent of genetic variance and covariance among important wood traits in a natural population of P. trichocarpa. We found that the use of a realized kinship matrix in the absence of a structured pedigree was advantageous, an approach that presents a new avenue for the estimation of trait variance components and genetic correlations in natural populations. Our approach to obtain a reliable insight into the network of correlated quantitative attributes in a wild population on the basis of genomic relatedness using high-density marker information represents a novel methodology to the study of trait correlations in plants. Our results provide information on the extensive variation in a comprehensive set of wood cell wall chemistry and ultrastructural characteristics, which are key traits for biofuel applications as well as traditional forest product activities. Our study showed that these industrially important wood traits exhibit significant range-wide phenotypic variation in the species. Furthermore, these findings indicate that there exists substantial opportunity for the use of breeding and selection to capture multiple advantageous trait combinations of interest from natural populations into improved progeny. The respective heritability values for the traits assessed in this study, and data on phenotypic and genetic trait correlations, should help to inform poplar breeding programs that aim to optimize Populus lignocellulosic feedstock for solid wood and fiber-based products, and for chemical processing applications, such as bioenergy.
This work was supported by the Genome British Columbia Applied Genomics Innovation Program (Project 103BIO). The authors acknowledge Shofiul Azam and Limin Lao for sample collection, and Eryang Li, Miki Fujita and Teaghan Mayers for technical support. We also acknowledge Lee Gunter and Ann Wymore, Oak Ridge National Laboratory (Oak Ridge TN, USA), for collaboration on the development of the SNP genotyping array and for performing hybridizations to the SNP genotyping array. The SNP array data were generated with support from the BioEnergy Science Center, a US Department of Energy (DOE) Bioenergy Research Center (Office of Biological and Environmental Research in the DOE Office of Science) under Contract No. DE-AC02-05CH11231.