• Open Access

A low molecular weight proteome comparison of fertile and male sterile 8 anthers of Zea mays

Authors


(Tel 650 862 1728; fax: 650 725 8221; email wangdx81@gmail.com)

Summary

During maize anther development, somatic locular cells differentiate to support meiosis in the pollen mother cells. Meiosis is an important event during anther growth and is essential for plant fertility as pollen contains the haploid sperm. A subset of maize male sterile mutants exhibit meiotic failure, including ms8 (male sterile 8) in which meiocytes arrest as dyads and the locular somatic cells exhibit multiple defects. Systematic proteomic profiles were analysed in biological triplicates plus technical triplicates comparing ms8 anthers with fertile sibling samples at both the premeiotic and meiotic stages; proteins from 3.5 to 20 kDa were fractionated by 1-D PAGE, cleaved with Lys-C and then sequenced using a LTQ Orbitrap Velos MS paradigm. Three hundred and 59proteins were identified with two or more assigned peptides in which each of those peptides were counted at least two or more times (0.4% peptide false discovery rate (FDR) and 0.2% protein FDR); 2761 proteins were identified with one or more assigned peptides (0.4% peptide FDR and 7.6% protein FDR). Stage-specific protein expression provides candidate stage markers for early anther development, and proteins specifically expressed in fertile compared to sterile anthers provide important clues about the regulation of meiosis. 49% of the proteins detected by this study are new to an independent whole anther proteome, and many small proteins missed by automated maize genome annotation were validated; these outcomes indicate the value of focusing on low molecular weight proteins. The roles of distinctive expressed proteins and methods for mass spectrometry of low molecular weight proteins are discussed.

Introduction

Maize is a major crop worldwide used for both direct human consumption and as animal feed. Hybrid seed is produced by cross-pollinating two lines, a methodology that is tedious by hand and greatly facilitated by using one male sterile parent (Laughnan and Gabay-Laughnan, 1983). Consequently, understanding male fertility has been a major research focus for maize flower development as a strategy to improve grain yield (Duvick, 2001). Maize male reproductive development requires approximately 30 days from differentiation of stamen primordia through pollen shed at anthesis; the complex developmental programme is highly regular permitting accurate staging of key developmental events based on anther length (Kelliher and Walbot, 2011). The stamen is a compound organ with a slender filament connecting the anther to the vegetative plant. At the start of meiosis, the maize anther has four lobes each containing five cell types plus a single central vascular bundle and the connective parenchyma tissue surrounding the vasculature (Figure 1a,b; Wang et al., 2010, 2011). These features facilitate the acquisition of sufficient carefully staged anthers for biochemical analysis.

Figure 1.

 Maize anther morphology and anatomy. (a) Photograph of a 1.5-mm maize anther when the PMC is ready for meiosis; (b) In transverse view (safranine and fast green staining, Wang et al., 2009), each anther has four locules, a single vascular bundle and surrounding connective parenchyma. The anther lobe somatic tissue is composed of four wall layers: epidermis, endothecium, middle layer and tapetum. PMC is located centrally. Meiosis, from the 1.5–2.5-mm anther stages, results in four microspores from each PMC. These separate from the tetrad and mature into pollen.

Transcriptome profiling has established that more than 33 000 maize genes are expressed during anther development, with many examples of stage-specific expression (Ma et al., 2008). As rapid cell proliferation ceases (anther length of 1.0 mm) and three days later at the start of meiosis (1.5 mm), more than 30 000 genes are expressed (Ma et al., 2008; Wang et al., 2010). Maize male sterile mutants with defects during this progression exhibit diverse phenotypes: excess Pollen Mother Cell (PMC) in mac1 (multiple archesporial cells 1) anthers (Sheridan et al., 1999); ms23 (male sterile 23), ms32 (male sterile 32) and ocl4 (outer cell layer 4) have excess somatic cells in one layer (Chaubal et al., 2000; Vernoud et al., 2009); ms8 (male sterile 8) has somatic cell division and expansion defects (Wang et al., 2010); in am1 (ameiotic 1), PMCs conduct mitosis instead of meiosis (Pawlowski et al., 2009); and msca1 (male sterile converted anther1) anthers have normal external morphology but contain leaf cell types (Chaubal et al., 2003). Transcriptome profiling of mac1, ms23 and msca1 (Ma et al., 2007), ms8 (Wang et al., 2010), and am1 (Nan et al., 2011) demonstrate that mutants with more severe phenotypes exhibit a greater disruption in the transcriptome and that detection of transcriptome deviation occurs before anatomical defects are visualized.

Protein evaluation has lagged behind transcriptome profiling in maize and other flowering plants, largely because near complete gene sequence information is required. Detection of protein confirms that transcripts are translated and can quantify post-translational modifications, two rationales for conducting proteomic investigations. Several recent studies have investigated rice anther protein profiles: 40 proteins were identified as consistently differentially expressed during anther development (Kerim et al., 2003); energy production–related proteins were down-regulated at the postmeiotic uninucleate stage in CMS-HL (Honglian Cytoplasmic Male Sterility) lines, which might explain male sterility in this genotype (Sun et al., 2009). Similarly, postmeiotic rice anthers from plants exposed to drought (Liu and Bennett, 2011), cold (Imin et al., 2004) or heat (Jagadish et al., 2010) treatments have been profiled, because these abiotic conditions can limit fertility. To date, there are no proteomic studies of Arabidopsis anthers, because they are extremely small and difficult to dissect from flowers. A key advantage of maize is that the tassel flowers are initially perfect, but the carpels abort early in floral development resulting in a male-only flower. Additionally, maize anthers are exceptionally large when key developmental stages are reached and are highly canalized in their development (Bedinger and Fowler, 2009); these factors facilitate accurate staging for pooling anthers in proteomic analysis. To date in maize, identification of proteins differentially expressed in ms8 compared to fertile siblings (Wang et al., 2010) and in mutator transposon–laden anthers compared to a sibling line with silenced Mu elements (Skibbe et al., 2009) has relied on identifying differentially accumulated proteins visualized by 2D-DIGE followed by LC/MS/MS identification of proteins in individual spots excised from gels. Generally, there is a strong positive correlation between novel protein detection and significant up-regulation of the corresponding transcript; however, the magnitude of differential expression is often different (Skibbe et al., 2009).

The primary aims of this study were to provide a detailed proteome profile of the low molecular weight proteins expressed premeiotically and during meiosis and to compare sibling fertile and ms8 anthers. Proteins expressed prior to and during meiosis are largely unknown in plants, with individual proteins implicated in meiosis defined by loss-of-function alleles and by antibody detection of proteins in situ. The low molecular weight proteome was selected because small secreted proteins are proposed to coordinate anther growth and cell fate setting (Yang et al., 2003; Zhao et al., 2008). Additionally, the pathogen Ustilago maydis causes anthers to switch to tumour production (Skibbe et al., 2010) by expressing many likely secreted low molecular weight proteins, a strategy that implicates small proteins as potent developmental signalling molecules.

The maize ms8 mutant was selected, because it has multiple, discrete phenotypes. It was discovered because anthers do not exert and meiosis fails (Beadle, 1931; Albertsen and Phillips, 1981). Our transcriptome profiling indicated that more than 11% of the gene expression programme is aberrant in ms8 anthers at the 1.0 mm premeiotic, 1.5-mm entry into meiosis and 2.0-mm mid-meiosis stages; a striking feature is that there is precocious expression of many transcripts indicating acceleration of some developmental programmes as well as ectopic expression of genes and the absence of some normal gene expression (Wang et al., 2010). In ms8 anthers, there are more but shorter epidermal cells, while the tapetal layer has too few but larger than normal cells. Thus, ms8 affects cell expansion in one locular somatic cell type and cell proliferation in a different cell type. At the 1.5-mm stage, ms8 PMCs are more widely separated than normal because of excess callose accumulation. Although the mutant meiocytes complete the first meiotic division to form dyads, these structures are defective and the cells collapse; concomitantly, the ms8 anthers stop growing and then senesce (Wang et al., 2010). Using two-dimensional fluorescence difference gel electrophoresis (2D-DIGE) and LC/MS/MS, 63 differentially expressed proteins were identified previously; the largest representation was in metabolism; however, proteins involved in plasmodesmatal remodelling and cell division were also differentially expressed (Wang et al., 2010). In the current study, we have greatly expanded the assessment of protein changes in ms8 compared to fertile siblings by size-fractionating total proteins by 1-D PAGE, digesting with Lys-C to favour recovery of large peptides from small proteins and sequencing peptides with an LTQ Orbitrap MS to acquire accurate mass and high-resolution measurements (Wong et al., 2009).

Results

High reproducibility: intra- and inter-sample correlation (R2) tests

To further explore the phenotypic consequences of the ms8 defect, carefully staged 1.5 mm (entry into meiosis, PMC fully mature) and 2.0 mm (mid-meiosis stage) sterile and corresponding fertile anthers in a primarily A619 inbred background were dissected for protein extraction; three biological replicates were prepared for each sample type. 25 micrograms of each protein extract was size-separated by NuPAGE (Invitrogen, http://www.invitrogen.com) 1-D gel electrophoresis; there was high reproducibility in the visible staining patterns of three technical replicates of each sample (Figure S1). Using alignment with molecular size markers, the regions corresponding to 3.5–10, 10–15 and 15–20 kDa were excised from individual gel lanes and then subjected to in-gel digestion using Lys-C (Figure S1). There were three biological replicates of each sample type. The resulting peptides were then sequenced in technical triplicates (yielding a total of nine data sets for each sample type) using an LTQ Orbitrap Velos mass spectrometer. Among biological triplicates at the 2.0-mm stage when meiosis is well underway, both fertile (0.94) and sterile (0.95) samples showed excellent inter-sample correlations (Table S1). Meiosis is synchronous in maize, and the very large PMC/meiocytes contain ∼20% of anther RNA (Nan et al., 2011) and likely a similar fraction of total protein, two factors that should contribute to exceptional similarity among samples. At the 1.5-mm stage, the correlation among biological triplicates is very high: 0.81 for fertile samples and 0.89 for sterile (Table S1). The 1.5-mm stage includes anthers in which the PMC is still in interphase between the end of mitosis and the start of meiosis, and anthers in which the meiocytes are in the early steps of prophase I; the asynchrony in cell events is the likely cause of the slightly lower correlation (Ma et al., 2008). As shown in Table S2, intra-correlation coefficients were investigated among technical triplicates of each sample type; these averaged 0.89, proving exceptionally high reproducibility; this experimental workflow has been proven reproducible in previous applications (Hack, 2004).

Proteomic profile and identification of proteins with altered abundance by Orbitrap sequencing

Filtering the combined data set of proteins from all four sample types can be done in myriad ways. Here, we report peptide probability (Nesvizhskii and Aebersold, 2005), protein probability (Nesvizhskii and Aebersold, 2005; Nesvizhskii et al., 2007), number of peptides indicative of a single protein (Bantscheff et al., 2007) and finally the number of times peptides were counted; these data are presented in Table S3 (Old et al., 2005). In using a minimum peptide identity probability of 95%, a minimum protein identity probability of 90% and a minimum of two assigned peptides, counted at least twice resulted in 359 proteins with a 0.3% peptide false discovery rate (FDR) and 0.2% protein FDR. Applying the less stringent criteria of only one assigned peptide, requiring only one count gives 2761 unique proteins with a 0.4% peptide FDR and a 7.6% protein FDR (Table S3). As expected, the criterion of two assigned peptides dramatically improves the FDR with the cost of a much smaller data set. Because we are only focusing on proteins smaller than 20 kDa, it is important to note that some very small proteins may only have a single Lys-C cleavage site and therefore generate only a single peptide amenable to a MSMS shotgun proteomics regimen. These proteins are not included in the highest confidence data set, but are undoubtedly present. To increase the robustness of the semi-quantitative measurement of spectral counting (Old et al., 2005), an additional criterion of a minimum of two spectra counted was applied with a minimum of one peptide assignment, which resulted in a total of 672 proteins (Table S3). Because the FDR for peptide assignment is 0.4%, we are confident the majority of the protein assignments based on these peptides are accurate. Molecular weight distribution analysis of the protein models demonstrated that 68% of them were <25 kDa as expected, and the remaining predicted high molecular weight proteins might be processed or truncated products; some protein fragments could be low molecular weight signalling proteins (Figure S2).

Distinctive and shared features of proteome patterns of ms8 and fertile anthers were explored with the 359 highly confident set. As shown in Figure 2a, 93 proteins were specifically expressed at the 1.5-mm stage, and 36 proteins are exclusive to the 2.0-mm stage in fertile anthers. These proteins are valuable stage markers for maize anther development. Proteins involved in translation and catalytic activity are the largest component of these stage-specific proteins; there are 27 proteins in these two categories specifically expressed in 1.5-mm fertile anthers, and 12 proteins in these categories are specific to 2.0-mm anthers (Figure 3). Proteins involved in protein binding and transport are also represented in the stage-specific proteins: seven transport and five protein-binding proteins are found only in the 1.5-mm stage fertile anthers, while nine transport proteins and seven protein-binding proteins are found only in the 2.0-mm stage fertile anthers (Figure 3). These data indicate that execution of meiosis results in obvious changes in the protein profile, especially for translation- and catalytic-activity-related proteins.

Figure 2.

 Venn diagrams of proteome content and overlap in various comparisons. (a) Comparison of fertile 1.5- and 2.0-mm stages; (b) Comparison of sterile 1.5- and 2.0-mm stages; (c) Comparison of sterile and fertile anthers at the 1.5-mm stage; (d) Comparison of sterile and fertile anthers at the 2.0-mm stage; (e) Comparison of 1.5-mm sterile anthers with 2.0-mm fertile anthers protein profile to test for precocious protein accumulation in ms8 anthers; (f) Comparison of transcriptome (transcripts with unique protein ID) and proteome data sets at the 1.5-mm stage; (g) Comparison of transcriptome (transcripts with unique protein ID) and proteome data sets at the 2.0-mm stage; (h) and (i) Comparison of a deep-sequenced whole anther proteome (unpublished data from Zhouxin Shen and S. Briggs utilizing staged W23 anther samples dissected at Stanford) and the reported proteome focused on low molecular weight proteins; (h) comparison of the whole anther proteome with 240 highly secure identified proteins in 2-mm fertile anthers addressed here; (i) comparison of whole anther proteome with 1239 proteins with at least one peptide hit identified in 2-mm fertile anthers in our study.

Figure 3.

 Gene ontology annotation. Percentage of each gene ontology category of proteins expressed in various comparisons. (a–d): stage-specific proteins; (a) compared with 2.0-mm fertile anthers, specifically expressed proteins in 1.5-mm fertile anthers; (b) compared with 2.0-mm sterile anthers, specifically expressed proteins in 1.5-mm sterile anthers; (c) compared with 1.5-mm fertile anthers, specifically expressed proteins in 2.0-mm fertile anthers; and (d) compared with 1.5-mm sterile anthers, specifically expressed proteins in 2.0-mm sterile anthers; (e–h): fertile or sterile specific proteins; (e) compared with 1.5-mm sterile anthers, specifically expressed proteins in 1.5-mm fertile anthers; (f) compared with 2-mm sterile anthers, specifically expressed proteins in 2.0-mm fertile anthers; (g) compared with 1.5-mm fertile anthers, specifically expressed proteins in 1.5-mm sterile anthers; and (h) compared with 2.0-mm fertile anthers, specifically expressed proteins in 2.0-mm sterile anthers. Categories are in alphabetical order, starting from the top of the list.

Protein profiles overlap substantially between ms8 and fertile siblings (Figure 2c,d). At the 1.5-mm stage, 92% of ms8 anther proteins were shared with fertile anthers; for the 2.0-mm stage, the overlap is 80% (Figure 2c,d). As for the 18 proteins detected in 1.5-mm sterile but not found in 1.5-mm fertile anthers, the top two protein categories are transport and nucleotide binding. For the 81 proteins found in fertile anthers but missing in ms8 anthers, catalytic activity, translation and transport are the top three categories (Figures 2c and 3). These data indicate that ms8 mutants are expressing many proteins in the same categories as fertile siblings; however, different genes are expressed within each category. For the 48 proteins found in 2.0-mm sterile samples but not detected in the fertile counterparts, catalytic activity, DNA binding and translation are predominate. Transport, protein binding and translation are the most abundant classes for the 48 proteins found in 2.0-mm fertile and missing from the sterile data set (Figures 2d and 3); these proteins highlight processes facilitating normal meiosis and should be of interest in future studies of the progression of meiotic stages in plants.

A developmental analysis was conducted for ms8: when comparing proteins identified in two stages of sterile anthers, 36 proteins were detected only at the 1.5-mm stage and 44 proteins only at 2.0 mm (Figure 2b). The Gene Ontology (GO) terms associated with the proteins are slightly different from the pattern in fertile anthers at both stages (Figure 3). Proteins involved in transport, translation and nucleotide binding are the major categories among the 1.5-mm stage-specific proteins; catalytic activity, DNA binding and protein metabolic process are the three main contributors for 2.0-mm stage-specific proteins (Figure 3).

Proteins differentially expressed between ms8 and fertile anthers at two stages were analysed by spectral count, when comparing profiles, a Fisher’s exact score (P-value, Zhang et al., 2006) was generated; the twenty top statistically significant differentially expressed proteins are shown in Figure 4a,b. Expression of many nucleic acid–binding proteins changed dramatically before and during meiosis (Figure 4a,b); although these proteins do not show an absolute on/off pattern, quantitative mis-expression is likely to contribute to the many defects in ms8 anthers.

Figure 4.

 Differentially expressed protein between ms8 and fertile at the 1.5 and 2.0-mm stages. (a) 1.5 mm differentially expressed proteins; (b) 2.0 mm differentially expressed proteins. The statistical significance was ranked by a Fisher’s exact score (P-value). The protein lists were produced from spectral count data of proteins that were identified in both sterile and wild type fractions. All data were manually validated to ensure proteins corresponding to low spectral count (<3) were true positives. The data plotted represent the log of P-value in which a negative value indicates a lower quantitative response in the sterile form and a positive value a higher quantitative response in the sterile form.

Gene Ontology analysis for all proteins identified in fertile, sterile and at both anther developmental stages was considered (Figure S3). In all four groups, proteins involved in translation, transport and catalytic activity are among the top three categories. We conclude that translational capacity and protein transport are key cell processes in both premeiotic and meiotic maize anthers.

ms8 transcriptome profiles at the 1.0, 1.5 and 2.0 mm stages were analysed using custom-designed Agilent 4 × 44 K microarrays containing 42034 probes (excluding controls) to 39612 genes. Major shifts in the transcriptome were detected between sterile and fertile anthers from sibling plants in the same family (Wang et al., 2010). One notable finding was that ms8 anthers achieve some developmental landmarks at a smaller size than fertile anthers; reflecting this precocious development, more than 41% of the 1.5-mm stage-specific ms8 transcripts are expressed at the 2.0-mm stage in fertile siblings (Wang et al., 2010). The current proteomic data substantiate the precocious phenotype first detected in the transcriptome pattern: 80% of the proteins expressed in ms8 anthers at the 1.5-mm stage are also expressed by 2.0-mm fertile anthers (Figure 2e).

Furthermore, the proteomic data set provided a significant expansion of the expressed gene set (Figure 2f,g). When comparing 1.5-mm fertile microarray data (among the 39 612 gene probes, 18 730 transcripts with unique protein ID) with the same stage proteomic data with the less strict criterion of only one or more assigned peptides, 632 proteins were identified only in the protein data set (Figure 2f). At 2.0 mm, 617 proteins were identified only in proteomic data (Figure 2g). A total of 742 small proteins were found in the proteomic analysis for which either transcripts were not detected or no probe was present on the microarray. Maize encodes many small proteins: 8.6% of the 27 500 maize full-length cDNAs encode ORFs of <100 amino acids, with nearly half of these encoding proteins <50 amino acids (Soderlund et al., 2009). Such small proteins are often missed in genome annotation (Silverstein et al., 2007; Lease and Walker, 2006), because only open reading frames encoding more than 100 amino acids are accepted in most automated genome annotations. Consequently, the present study with a focus on small proteins can be used to improve genome annotation as well as highlight the dynamic expression of small proteins during anther development.

Search for processed proteins

To screen for proteins that may have been processed to a shorter length, particularly the class of proteins with transit peptides, the small proteome data set for maize anthers was analysed further to detect peptides not predicted to arise from Lys-C cleavage, an enzyme that cleaves after lysine residues. Peptides that map to the amino terminal region of a protein model but for which the preceding amino acid is not lysine are candidates cases of signal peptide cleavage (Figures S4 and S5); signal peptides average 22.6 amino acids in eukaryotes and have defined features (http://www.cbs.dtu.dk/services/SignalP-1.1/sp_lengths.html). Several protein models match these criteria. One is a 60 S ribosomal protein encoded in the nucleus but with protein function in an organelle; two other models correspond to different isoforms of heat shock factor-binding protein 1 (Table S4). Many heat shock proteins are mitochondrial or Golgi proteins (Lindquist and Craig, 1988), and in plants, additional types are plastid-localized (Nguyen et al., 1993; Lenne and Douce, 1994; Suzuki et al., 1998; Lee et al., 2000; Wang et al., 2004), with the requirement of signal peptide cleavage for correct localization. A high complexity of heat shock proteins was detected when isolated maize microspores were subjected to thermal stress, suggesting that heat shock proteins are inducible during pollen development around the first mitosis stage (Magnard et al., 1996). This corresponds to a stage about 1 week later than the 1.5 and 2.0-mm stages examined in our study. Heat shock proteins are also constitutively expressed as part of the cellular protein folding machinery. To date, 22 maize heat shock protein isoforms have been identified, and individual heat shock binding proteins have specific interactions with them during plant development in the absence of heat shock (Fu et al., 2006). Our data set contains heat shock binding proteins present in anthers before and during meiosis, implying that these factors are critical in early anther development at optimal growth temperature conditions (∼28 °C day/20 °C night).

Comparison of the small anther proteome to a whole proteomic data set

Fertile maize anthers at the 2.0-mm stage were dissected from five plants in the W23 inbred background as a contribution to assembly of a whole protein expression atlas for maize. There are 5889 proteins identified in this global proteome data set in fertile 2.0-mm anthers; this is 24 times the diversity of the focused small proteome data set from this stage identified in a 1 h Orbitrap analysis with at least two peptide hits, and nearly five times higher compared to the small protein data set based on a single peptide hit (Figure 2h,i). Although the small protein data set had a short LTQ Orbitrap operating time and only a small portion of the sizing gel (Figure S1) was sampled, the small proteome data set contains a substantial number of unique proteins (Figure 2h,i). Among the 359 high confidence identified proteins with at least two peptide hits, there are 55 proteins that were not detected in the whole anther analysis; as for proteins with at least one peptide hit, 612 proteins were acquired only in the small protein data set (Figure 2h,i). Functional annotation of these proteins indicates a high diversity of GO representation, indicating that the small protein data set is an important addition to the anther proteome, totalling more than 10% of the combined data sets (Tables S5 and S6). Table S7provides the GO annotation of the high confidence proteins.

The small protein class is not abundant, based on the paucity of gel staining in the regions sampled (Figure S1); there could be an extraction bias against recovery of small proteins from whole anthers or unexpected migration behaviour as well as the intrinsic inability to reliably identify small proteins after trypsin digestion. Because 49% of the total small proteome found in the restricted analysis was not detected in the whole proteome, our results indicate the value of focused proteome analysis, particularly for small proteins. Validation that specific small proteins exist will be extremely useful in updating the annotation of the proposed gene models for small proteins and in identifying a class of short open reading frames as authentic protein-coding maize genes.

Discussion

As a major crop in terms of economic value and a primary food source for hundreds of millions of people, corn has been improved through intensive breeding. Increasing grain yield is the main target of selection, and the most substantial progress has come from use of high-quality hybrid seed (Duvick, 2001). Mature maize male and female flowers are located in the tassel and ear, respectively, allowing mechanical de-tasselling to permit pollination of one row only by a neighbouring pollen-shedding inbred line (Duvick, 2001). From the mid-1950s until 1970, CMS was used to circumvent detasseling of the ear parent; however, the resulting monocultures with regard to the CMS T (Texas) mitochondrial genome resulted in massive crop failures after emergence of race T of the fungal pathogen Helminthosporium maydis (Levings and Siedow, 1992). A return to male sterility to simplify and lower the cost of hybrid corn seed production will require the use of diverse germplasm to prevent genetic vulnerability in the resulting crop. There are many spontaneous and induced male sterile mutants caused by defects in maize nuclear genes. If deployed appropriately, nuclear male sterility could be as effective as CMS employed previously. Assessment of protein profiles between male sterile mutants and fertile sibling anthers before and during meiosis could clarify the mechanisms of this important progression, and eventually result in the establishment of male sterile inbred lines useful in hybrid seed production.

In this study, differential quantitative proteomic profiling was used to assess developmental changes in maize anthers. The first observation is that maize anthers staged by length have remarkably consistent proteomes, illustrating the appropriateness of length staging within an inbred line, a notable advantage for the investigation of male reproductive development. On average, the protein yield is 3.76 μg per 1.5-mm anther and 7.76 μg per 2.0-mm anther (D. Wang, unpublished data); therefore, samples as small as seven anthers suffice for one replicate in maize. In contrast, about 3000 meiotic stage anthers were needed to produce enough rice protein for analysis (Kerim et al., 2003); even at later stages, such as the uninucleate pollen stage, 40–50 individual rice plants were needed for dissection to produce one biological replicate (Sun et al., 2009). Given the effort involved in obtaining carefully staged anthers, some rice anther proteomic analyses have pooled anthers at different developmental stages (Liu and Bennett, 2011). To date, no proteomic profiles are available for Arabidopsis anthers, given the difficulty of dissecting and collecting tiny anthers. Therefore, the maize anther proteome will serve as the model for flowering plants; because there is high conservation of anther gene expression programmes (Ma et al., 2008), many discoveries in maize will be of general utility.

In maize, male gametophyte development is a tightly programmed and elaborate process central to crop productivity. Therefore, improving our understanding of protein expression in male sterile plants with defects prior to meiosis is a major research priority. We have previously used a 2-D DIGE and LC/MS/MS approach to analyse ms8 and its fertile siblings at three early developmental stages. Sixty-three differentially expressed proteins were identified, with metabolism being the most abundant GO category (Wang et al., 2010). Here, we have expanded this analysis to focus on a profile of small proteins (3.5–20 kDa) profile via LTQ Orbitrap MS analysis. In the set of 359 proteins identified with high confidence, differentially expressed proteins were identified by spectral count (Figure 4a,b). Hundreds of proteins are expressed in a stage-specific manner in both fertile and ms8 sterile anthers, providing new insights about entry into and progression through meiosis; additionally, the hundreds of proteins differentially expressed between fertile and sterile anthers provide new insights into the aberrant phenotypes of ms8 (Figure 2). In addition to the metabolism GO category, proteins involved in transport, translation and catalytic activity showed distinctive expression patterns (Figure 3). The translation category is the major category in 1.5- and 2.0-mm fertile maize anthers, with catalytic activity second (Figure S3). During the ∼3 days separating these two stages, 36 proteins appear and 93 disappear from the proteome lists. Data from a later stage of fertile rice anther development are available: at the uninucleate pollen stage in rice, 97 proteins were identified, the most abundant GO designation was catalytic activity (34%), and 27% of the rice proteins had an unknown molecular function (Sun et al., 2009).

An important contribution of this study is identification of hundreds of distinctively expressed proteins in the premeiotic and mid-meiosis stages as well as markers for male sterility in the ms8 mutant. For example, ADP ribosylation factors (Table S7) exist in functionally redundant plant gene families; they are low molecular weight GTPases required for vesicle trafficking and plasma membrane expansion in eukaryotes. In plants, ARFs play critical roles in secretion and intracellular molecular distribution and are required for the acquisition of epidermal cell polarity (Xu and Scheres, 2005), secretion of callose (Böhlenius et al., 2010) and polar auxin trafficking (Kleine-Vehn et al., 2009). Six maize ADP ribosylation factor proteins were quantified in the high confidence protein data set, and they show complex regulation. Sterile 1.5-mm anthers contain three ADP ribosylation factor proteins (GRMZM2G044368, GRMZM2G075719 and GRMZM2G061912), none of which were found in their 1.5 mm fertile counterparts. Interestingly, all three proteins are detected in 2.0-mm fertile anthers. This protein accumulation pattern confirms that ms8 anthers exhibit precocious expression of some developmental programmes, first evidenced as a substantial proportion of shared transcripts between the fertile 2.0 and ms8 1.5-mm anthers (Wang et al., 2010). Fertile anthers at both developmental stages express GRMZM2G106960, and this protein is present in ms8 1.5-mm anthers but was not detected at 2.0 mm nor were the three precociously expressed ADP ribosylation factors; consequently, at 2.0 mm, ms8 anthers are missing (or have below detectable amounts) of four ADP ribosylation factors accumulating in fertile siblings.

Fertile 1.5-mm anthers express two ADP ribosylation factor genes (GRMZM2G113995 and GRMZM2G086636) that were not detected in the sibling ms8 sterile anthers; however, they were detected in 2.0-mm sterile anthers, somewhat of a surprise because the general pattern is that ms8 mutant anthers show acceleration of numerous developmental programmes (Wang et al., 2010). Given the diverse roles of ADP ribosylation factors in differentiation, mis-regulation of individual isoforms could contribute to ms8 defects in specific cell types such as epidermal expansion, tapetal cell division and callose secretion. Future studies of individual anther cell types could clarify this hypothesis.

GRMZM2G030523 encodes proliferating cell nuclear antigen, an accessory protein for DNA polymerase (Nagar et al., 1995). It is a marker for 1.5-mm fertile anthers compared to the 2.0-mm meiotic stage. Most anther cell division is completed by 1.0 mm (Kelliher and Walbot, 2011); however, a low frequency of division continues over the following 3 days leading up to meiosis, while there is only rare somatic cell division during meiosis. The ADP ribosylation factor GRMZM2G086636 is also a 1.5-mm stage marker, because the protein was not detected in 2.0-mm fertile anthers. Several proteins involved in managing reactive oxygen species are also differentially accumulated in 1.5-mm fertile compared to 2.0-mm fertile anthers: two thioredoxins (GRMZM2G436199 and GRMZM2G300066) and one glutaredoxin (GRMZM2G148387). Glutaredoxins play stage-specific roles in anther development, because knockout mutations confer striking defects (Xing and Zachgo, 2008; Hong et al., 2012). Additionally, two glutathione S-transferases (GRMZM2G031545 and GRMZM2G146246) are stage markers characteristic of 2-mm fertile anthers and are also absent in both stages of ms8 sterile anthers.

Four differentially expressed proteins (GRMZM2G009048, GRMZM2G012479, GRMZM2G075655 and GRMZM2G157443) in the data set were annotated as antifreeze proteins. Antifreeze proteins are usually 12–35 kDa with hydrophilic interfaces that bind to ice crystals in intercellular spaces, altering crystal patterns to reduce cellular dehydration and damage (Jia and Davies, 2002; Pearce, 2001; Griffith et al., 1992; Hon et al., 1994). These proteins are expressed in cold-tolerant species in response to temperature and day-length stimuli; however, maize is not among these cold-tolerant species (Antikainen and Griffith, 1997). Antifreeze proteins are closely related to pathogen response proteins such as β-1, 3-glucanases and chitinases, and plants expressing these proteins have increased resistance to pathogen attack (Huang and Duman, 2001; Hon et al., 1995). It is likely that the proteins detected in maize anthers serve stress response or antipathogenic roles, rather than cold tolerance. The four proteins identified in this data set each have a different pattern of abundance; two of the proteins are more abundant in 1.5-mm fertile anthers compared to 2-mm fertile anthers, and the other two proteins are more abundant in 2-mm anthers compared to 1.5 mm. For all four proteins, when a difference between sterile and fertile comparisons was observed, the proteins were found to be more abundant in the sterile samples. Therefore, the antifreeze proteins show stage-specific accumulation patterns in 1.5 and 2-mm anthers, and the ms8 mutant has increased levels of these proteins. We speculate that the absence of the ms8 protein increases the stress response in sterile plants leading to an overall increase in classes of stress response proteins expressed in anthers. Finally, using the larger proteome list with 1 or more assigned peptides, a cell death protein (protein ID: GRMZM2G021710_P01, death effector) is detected only in 1.5-mm ms8 anthers, and this protein may contribute to the lethal defects of the mutant, a subject for future investigation of the mechanisms underlying male sterility in ms8 and other maize mutants.

The small molecular weight proteome analysis also provides information on an often neglected class of proteins, some of which may serve as signalling molecules during anther development. In a shotgun proteomics approach, the ideal peptide is of a molecular weight larger than 1000 Da, this corresponds to a 9–10 mer using an averaging model. Our data were generated from gel regions corresponding to 3.5–20 kDa, and Lys-C was used for protein digestion. In studying the low molecular weight proteome, particularly using an initial SDS-PAGE size fractionation, there are inherent advantages to using this enzyme. Trypsin is the standard protease used in proteomics research; it cleaves C-terminal to both lysine and arginine residues. For small proteins with multiple lysine and arginine residues, the multiple small peptides are unlikely to be assigned successfully to protein models. Therefore, we used Lys-C to increase average peptide length.

On the basis of full-length cDNA sequencing, about 10% of transcripts encode proteins <100 amino acids. Given that maize anthers at 1.5 and 2.0 mm express about 30 000 transcripts, we might expect up to 3000 small proteins. Based on the intensity of gel staining, however, the low molecular weight region of the anther proteome contains very little protein. In this study, 359 proteins were identified with high confidence, perhaps 12% of the total small proteome; at the more relaxed criterion of one peptide assignment, 2761 proteins were identified, or about 90% of the expectation. Because the overall protein complexity is so reduced compared to the whole anther proteome, a much shorter LC/MS run time should be sufficient to identify most proteins. At the chosen time of 1 h analysis per molecular weight range, the instrument was matched very well with the level of protein complexity, because the maximum duty cycle (the number of peptides sequenced for every survey scan) was rarely pushed to its limitations.

Experimental procedures

Plant material and anther dissection

ms8 and fertile sibling planting and anther dissection were performed as previously described (Wang et al., 2010); cytological study and anther sampling were carried out according to recent publications (Wang et al., 2009, 2010). For the deep whole proteome, fertile maize plants in the W23 inbred background were planted in the field during summer 2009; 300 anthers at the 2.0-mm stage were dissected from each individual and five biological replicates were used for analysis.

SDS-PAGE and in-gel digestion

Anther protein extraction was performed as reported previously (Skibbe et al., 2009). Three biological replicates of each stage were analysed to identify differentially expressed proteins between ms8 and fertile siblings. Total proteins were separated by the NuPAGE electrophoresis system (Invitrogen, http://www.invitrogen.com/site/us/en/home.html), with three technical replicates to improve reliability. Gels were stained with SimplyBlue SafeStain (Invitrogen); three zones (20–15, 15–10 and 10–3.5 kDa) were excised per lane using the Sharp Protein Standard (Invitrogen) as a guide. An example of sample reproducibility and gel cutting is shown in Figure S1. In-gel digestion was performed as previously described (Shevchenko et al., 1996, 2007), but using Lys-C as the protease. In brief, excised gel slices were reduced with DTT and free cysteines alkylated using propionamide. Each gel slice was destained using an acetonitrile/water solution and dried in a speed vacuum. The slices were reconstituted in 100 nanogram of Lys-C, 50 mm Ammonium bicarbonate, pH 8.0. Gel slices were digested overnight at 37 °C in the presence of 0.02% protease max (Promega, Madison, WI). Peptides were extracted, and each extract was taken to dryness in a speed vacuum centrifuge. Each sample was reconstituted with the same sample buffer (0.1% formic acid, 2% acetonitrile and 97.9% water) and injected onto an HPLC column, as described in the next section.

LTQ Orbitrap Velos sequencing and protein data analysis

The HPLC was an Eksigent (Dublin, CA) nano2D LC run at 600 nL/min with a 60 min linear gradient where solvent B (acetonitrile) started at 2% ramping to 35% over 45 min, followed by column washing and re-equilibration; a Michrom Advance source (Michrom Advance, Auburn, CA) was used in conjunction with a in-house packed fused silica C18 reversed phase column (150 μm inner diameter, 12 cm length). The mass spectrometer was a LTQ Orbitrap Velos, controlled using Xcalibur 2.0 software (Thermo Scientific, San Jose, CA) and run in data-dependent acquisition mode, in which the mass spectrometer was instructed to sequence the top 12 most intense peptide precursor ions. The precursor scan occurred in the Orbitrap at a 60 000 resolution, where each of the ms/ms scans occurred in the LTQ Velos ion trap; this workflow is commonly referred to as a high-low protocol.

Each Thermo (Thermo Scientific, San Jose, CA) RAW file was converted to mzXML format using an msconvert script, after which the file was submitted to database searches using Sequest on a Sorcerer platform. The database was downloaded from the NCBI as Zea mays. Variable modifications of serine, threonine, tyrosine phosphorylation, lysine methylation, demethylation and acetylation were allowed in the analysis. The precursor ions were searched with a 20 ppm mass tolerance and 2 mis-cleavages. Each search result was combined into a master scaffold file for which protein ID, statistical analysis (Fisher’s Exact score) and spectrum counts were reported. The data presented as high confidence identification represent proteins that were identified with at least two peptides; this greatly reduced the size of our data set but significantly increased the confidence in protein identification as well as relative quantification using spectral counting. Whole anther proteome analysis was performed in the laboratory of Steven P. Briggs at UCSD (http://briggs.ucsd.edu/publications.html).

Venn diagram and GO chart construction

Comparisons between microarray and proteome results were done by assigning maize protein IDs to each microarray probe using the BLASTx version and BLAST program. Protein sequences were downloaded from version 2 of MaizeGDB’s (http://www.maizegdb.org) filtered gene set. Protein assignment for the proteome spectra utilized the same MaizeGDB data set. GO terms for the filtered gene set were also downloaded from MaizeGDB.

Whole anther total protein sequencing analysis

Protein and spectra counts were extracted from the whole anther total protein mass spectrometry data set for each sample type using Scaffold 3 software (http://www.proteomesoftware.com) with the following criteria: minimum number of peptides = 1, minimum peptide per cent = 95% and minimum protein per cent = 90%. For certain charts, a stricter criterion was applied where the minimum number of peptides = 2.

Acknowledgements

We thank Dr. Guo-ling Nan for anther sample dissection for the deep, whole proteome analysis conducted by Dr. Zhouxin Shen and Dr. Steven P. Briggs of UC-SD. We thank Dr. Allis Chien at the SUMS facility and Dr. David S. Skibbe for guidance on the NuPAGE. This research was supported by the National Science Foundation (07-01880) and Award Number S10RR027425 from the National Center for Research Resources. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Center for Research Resources or the National Institutes of Health.

Ancillary