Summary
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
To reveal the transcriptomes of Arabidopsis seed, comprehensive expression analysis was performed using ATH1 GeneChips (Affymetrix, Santa Clara, CA, USA). In the dry seed, more than 12 000 stored mRNA species were detected, including all ontological categories. Statistical analysis revealed that promoters of highly expressed genes in wild-type dry seeds overrepresented abscisic acid-responsive elements (ABREs) containing the core motif ACGT. Although the coupling element and seed-specific enhancer RY motif alone were not prominently overrepresented in genes with high expression, the presence of these elements in combination with ABRE was associated with particularly high gene expression. The transcriptome of the imbibed seeds differed from that of the dry seed even at 6 h after seed imbibition. After imbibition many upregulated and downregulated genes were co-regulated in clusters of three to five genes. Genes for which expression was affected by the abi5 mutation tended to be located in clusters, suggesting that transactivation by ABI5 is not restricted to a single gene, but affects other proximal genes. Furthermore, cytosine methylation was observed not only in large silent retrotransposon clusters in centromeric regions, but also in non-centromeric silent gene clusters in the seed. These results suggest that such regions might be transcriptionally silenced by methylation or heterochromatin structures. Our analyses reveal that transcriptomes of Arabidopsis seed are characterized by multiple regulatory mechanisms: epigenetic chromatin structures, chromosomal locations (e.g. co-regulated gene clusters) and cis-acting elements.
Introduction
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
In the course of embryogenesis and seed maturation, the embryo enters a quiescent phase (Bewley, 1997). The seed remains quiescent, even if watered, until conditions such as temperature, light and nutrients become permissive for germination. Dry mature seed contains a large number of mRNA species. Stored RNA in mature dry seed was first found in cotton (Dure and Waters, 1965) and is universal in plant species (Almoguera and Jordano, 1992; Ishibashi et al., 1990; Kuligowski et al., 1991). The function of stored RNA remains unknown, however it is thought to play a role in protein synthesis during the early stages of germination.
Developmental phases during late embryogenesis and subsequent germination are characterized by spatial and temporal patterns of gene expression (Nambara et al., 2000; Parcy et al., 1997). Expression analyses have shown that the transition from late embryogenesis to germination occurs gradually and therefore characteristics of both developmental phases co-exist during the transition (Harada et al., 1988; Nambara et al., 2000). In accordance with this, sequence analysis suggests that the stored RNA comprises transcripts necessary for both late embryogenesis and seed germination (Comai et al., 1989; Hughes and Galau, 1989, 1991). Subsets of the stored mRNA in dry seed are well described in several plant species, however a genome-wide analysis of stored RNA has not been reported, and the regulation and function of the dry seed transcriptome is poorly understood.
Abscisic acid (ABA) plays a primary regulatory role in seed maturation and dormancy. ABA promotes expression of a number of genes such as Em and Late embryogenesis abundant (Lea) during seed development and maturation (Busk and Pagès, 1998; Cuming, 1999). Functional analysis of ABA-regulated gene promoters has identified the motifs mediating ABA response. The most widely known ABA-responsive element (ABRE) is a sequence resembling a G-Box with an ACGT core (Marcotte et al., 1989; Shen et al., 1993). The ABRE requires a second element to form a functional ABA response complex (ABRC) (Shen and Ho, 1995). The coupling element (CE) has been identified as one such motif, and it elicits a response to ABA when located adjacent to an ABRE (Shen and Ho, 1995). The RY/Sph motif is involved in seed-specific gene expression (Dickinson et al., 1988; Hattori et al., 1992). Similar to the CE, the RY element also interacts synergistically with ABRE (Ezcurra et al., 1999; Vasil et al., 1995). Several other cis-acting motifs have been also reported (reviewed in Thomas, 1993). These analyses mostly infer a complicated regulation system that involves multiple cis-acting motifs controlling ABA-mediated transcription during seed development and maturation.
Genetic studies have identified components involved in ABA biosynthesis and signaling in the seed. These mutants were defective in accumulation of seed storage reserves and seed dormancy (Finkelstein and Somerville, 1990; Karssen et al., 1983; Koornneef et al., 1984; Nambara et al., 1992). Three ABA-insensitive loci, ABI3, ABI4 and ABI5, have been identified, encoding B3, AP2 and bZIP-type transcription factors, respectively (Finkelstein and Lynch, 2000; Finkelstein et al., 1998; Giraudat et al., 1992; Lopez-Molina and Chua, 2000). Biochemical approaches revealed their target sequences as the RY/Sph motif for ABI3, CE1-like sequences for ABI4, and ACGT-containing ABRE for ABI5 (Carles et al., 2002; Kim et al., 2002; Mönke et al., 2004; Niu et al., 2002). These transcriptional regulators are conserved in both monocots and dicots (Hobo et al., 1999a; Niu et al., 2002; Suzuki et al., 1997). Physical interaction between Arabidopsis ABI3 and ABI5, or their rice orthologs, VP1 and TRAB1, has been demonstrated (Hobo et al., 1999a; Nakamura et al., 2001). This suggests that in response to ABA, ABI3 and ABI5 act in concert, via RY motifs and ABREs, to regulate gene expression in the maturating seed. While the genes expressed during seed development and germination have been extensively analyzed in many species (Dong et al., 2004; Girke et al., 2000; reviewed in McCarty, 1995; Ogawa et al., 2003; White et al., 2000), the in vivo effect of ABI transcription factors on global expression profiles has not yet been fully analyzed.
To explore genome-wide expression patterns as well as regulatory mechanisms in Arabidopsis seeds, we conducted mRNA profiling in dry and imbibed seeds using an oligonucleotide-based microarray. By comparing expression profiles between wild type and abi mutants, we found that the ABRE predominantly defines the transcriptome in the dry seed. We also observed a consistent trend of co-regulation of neighboring genes. Cytosine-methylation was detected in centromeric regions and in silent gene clusters. Multiple regulations by epigenetic chromatin structures, chromosomal locations (e.g. co-regulated gene clusters) and cis-acting elements within Arabidopsis seed transcriptomes are discussed.
Gene expression profiles of dry and germinating seeds
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
Stored mRNA in mature dry seeds might reflect the gene expression pattern prior to seed desiccation. To reveal the gene expression profiles in wild-type dry and imbibed seeds in Arabidopsis, we performed microarray analysis using the Affymetrix GeneChip ATH1. Among 22 476 genes represented on the microarray, 12 470 mRNA species were quantitatively detected in dry seeds. Duplicate microarray analysis using the different seed batch gave consistent results. A similar number of transcripts were detected in 24 h-imbibed seeds (14395 genes) (see Supplementary Material S1 ).
Stored mRNA species in dry seeds are thought to be required for late embryogenesis and seed germination. In order to differentiate these two classes of genes, we analyzed expression profiles obtained from seeds at 6, 12 and 24 h after imbibition. Of the 12 470 stored mRNA species, more than 10 000 mRNA species were also quantitatively detected during the course of seed imbibition. K-means clustering analysis classified the genes detected in dry seeds into seven kinetic patterns (Figure 1a). This temporal profiling of genes represented as stored mRNA species reflected the overlapping nature of both late embryogenic and germinative states. The largest gene set was group 3, which showed relatively constant expression levels for 24 h after imbibition. Other sets of genes were dramatically or moderately downregulated (groups 1 and 2, respectively), and more than a 1000 genes were upregulated gradually or dramatically after 12 h (groups 6 and 7, respectively). Genes in groups 4 and 5 showed transient increases in transcript level at 6 and 12 h, respectively. Expression patterns of several genes from each group were validated by semiquantitative RT-PCR and representative results are shown in Figure 1(a).
Figure 1(b) shows the ontological classification of the genes in each group. A number of highly expressed genes for late embryogenesis are included in group 1. Genes related to metabolism were overrepresented in groups 1, 6 and 7. This indicated that embryogenic genes were largely downregulated within 6 h of imbibition, and that induction of metabolic genes involved in germination started around 12 h after imbibition. During this shift, expression of genes involved in cell cycle, DNA processing, transcription and protein synthesis were activated, and these categories were significantly overrepresented in groups 5 and 6.
Combined effects of ABRE and other motifs
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
We next tested whether the transcriptome of dry seed reflects the characters of known cis-elements. Previous promoter analyses have demonstrated that a single ABRE provides relatively weak transcriptional activation, while multiple copies of ABRE confer a stronger ABA responsiveness (Shen and Ho, 1995). To examine whether seed transcriptome contains this character, we surveyed the effect of copy number of the selected ABRE on expression level in the dry seed. The percentage of genes with a single, double or triple or more repeat(s) of the ABRE among genes with various expression levels was compared with the respective overall rates (see Experimental procedures, Table S5). Genes with high expression were more likely to contain multiple ABREs (Figure 3a, Col). The degree of enrichment of ABRE in the abi5 seed was generally lower than in wild type, however Figure 3(a) clearly shows that the multiple copies of ABREs were functional even in abi5.
The coupling element is not regarded as a mediator of ABA signals, but is rather an enhancer of ABA-inducible expression via ABRE in the seed (Shen et al., 1996). The combined effects of ABRE and CE motifs on high-level expression were also examined in the same manner. We selected the 6-bp sequence CACCG(A/C) as representative of CE, which has been reported as the target sequences of ZmABI4 (Niu et al., 2002). The effect of CE(a) (CACCGA) alone on high-level expression was relatively small in the seed [Figure 3b, CE(a)]. Genes containing both ABRE and CE(a) showed up to sevenfold higher frequency relative to the random percentage [Figure 3b, CE(a)+ABRE]. A similar pattern was also observed when the CE(c) (CACCGC) sequence was present (data not shown). These analyses are also consistent with the previous promoter analysis (Shen et al., 1996).
The RY/Sph motif has been reported to regulate seed-specific gene expression (Bäumlein et al., 1992; Hattori et al., 1992). We surveyed the frequency of typical 7-bp RY sequences, CATGCAT [RY(t)] and CATGCAA [RY(a)], in wild-type dry seed. Analysis of RY(a) showed that the motif was moderately overrepresented in highly expressed genes in the dry seed (Figure 3b). However, analysis of upstream regions of highly expressed genes did not show significant increase in the percentage of RY(t) motifs (data not shown). The combined effects of RY(a) and ABRE showed up to 12-fold enrichment in highly expressed gene sets [Figure 3b, RY(a)+ABRE]. A similar pattern, but to a less extent than in RY(a), was observed with RY(t) (data not shown). These results suggest that the association of RY motif with ABRE facilitates strong gene induction in the seed.
We performed the same analysis on the abi5 and abi4 mutants, comparing them with wild type. As shown in Figure 3(b), there was only weak overrepresentation of ABRE in abi5. The frequency of CE alone in highly expressed genes in abi5 was similar to that in wild type, however frequency of the genes containing both CE and ABRE was apparently low. Analyses of RY(a) in the abi5 seed showed a slight less frequency in the highly expressed groups, although ABI5 does not bind directly to the RY motif. Finally, the combination of RY(a) and ABRE was reduced in genes with a high-level expression in abi5. Slight reduction of the effect of ABRE on high-level gene expression was also observed in the abi4 seed (Figure 3b). Although the effect of CE alone on expression levels in abi4 seed was similar to that in wild type and abi5, the effect of the abi4 mutation on the percentage of highly expressed genes containing both CE and ABRE was visible. The RY motif also occurred less frequently in highly expressed genes, and the frequency of RY and ABRE was largely reduced in the abi4 mutant. These results showed fluctuations in the effect of cis-acting motifs, with ABRE having the most noticeable influence in the seed.
Co-regulation of chromosomal gene clusters in the seed
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
We also found co-regulation of adjacent genes occurred in the seed. In many cases, three to five adjoining genes showed related expression patterns after seed imbibition (Figure 4a). For example, four genes from At1g01080 to At1g01110 underwent upregulation regardless of their expression level, while other sets of genes (At1g01160–At1g01180, and from At1g01230 to At1g01250) were downregulated during this period. Co-regulation of some neighboring genes was verified by semiquantitative RT-PCR (Figure 4b). The tendency was observed overall on chromosomes. The frequency of three-gene-clusters showing co-downregulation and co-upregulation after 24 h imbibition (149 and 136 clusters, respectively) in 3701 genes was significantly higher than that in the same number of genes chosen randomly [84.8 ± 9.8 (t = 100), see Experimental procedures].
Clustered expression patterns were also observed in dry seed of the abi4 and abi5 mutants. These mutants displayed some variations in expression pattern compared with wild type, and especially in the abi5 mutant. The overall pattern of expression clusters was, however, mostly similar in wild type and in both mutants as shown in Figure 5(a,b). This suggests that changes in gene expression observed in these mutants were not due to the progression of developmental stages, but rather the direct effect of abi mutations.
We then surveyed the chromosomal location of the genes whose expression was altered by abi mutations. Co-regulated gene clusters were commonly composed of three to five genes, therefore, we assessed the frequency of downregulated gene pairs located within a distance of five genes. In abi5 dry seed, 527 genes exhibited at least twofold decrease in expression level compared with wild type. Among them, 77 downregulated genes (14.6%) had another downregulated gene nearby. In contrast, in abi4 dry seed only 18 of 309 downregulated genes (5.8%) were co-located. This trend was also found among upregulated genes. Ninety-five of 716 upregulated genes (13.3%) in abi5 dry seed were co-located, compared with 25 of the 438 genes (5.7%) in abi4 dry seed. Genes whose expression was affected by the abi5 mutation co-localized more frequently than those affected by the abi4 mutation. This trend was more apparent when the distance of co-regulated genes was plotted. Co-location of abi5 downregulated genes was distance-dependent (Figure 5c), while this was not the case in abi4 downregulated genes. A similar trend was also observed for upregulated genes (Figure 5d). These results suggest that the effect of ABI5 in trans-activation of gene expression is not merely restricted to a single gene, but affects expression of other proximal genes.
Clusters of gene families were often observed on chromosomes, and some of these are targets of co-regulation by ABI5 (Table S6). The downregulated clusters of genes include Lea proteins, 2S seed storage proteins and dehydrins, while the upregulated gene clusters include two sets of NAM transcription factors and genes related to metabolism.
We also found a number of large gene clusters that were hypo-expressed in dry seeds, each composed of more than 10 genes, and these clusters are located on many different chromosomes (data not shown). In most cases, these regions contain transposon/retro-transposon-related genes, pseudogenes or tandem/inverted repeats of the gene family(-ies), such as serine/threonine protein kinases or germins. In particular, we found five silent regions which cover more than 1 Mb. Transposons and retrotransposons in plants are normally methylated, leading to the formation of heterochromatin structures and transcriptional silencing (Hirochika et al., 2000; Kato et al., 2003). To examine whether these silent regions are methylated, Southern blot analysis was performed on genomic DNA isolated from dry seed, following digestion by either one of two isoscizomers HpaII or MspI. Transposon/retrotransposon-related genes as well as pseudogenes on different chromosomes all gave larger hybridization signals in HpaII digestion compared with those in MspI digestion, indicating that these genes were methylated (Figure 6a). Larger hybridization signals than expected were often detected, even after MspI digestion, which suggests dense methylation of these regions including non-CG methylation. Figure 6(b) shows methylation status in tandem repeats of gene families. Inhibition of HpaII digestion was observed for these genes. In contrast, hybridization patterns following digestion by both enzymes were unchanged when ABI5 and LEC1 were used as probes (Figure 6c). This indicates that the non-expressed LEC1 as well as the highly expressed ABI5 were free from methylation in the seed.
Stored mRNA species and phase transition in the seed
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
Mature dry seeds contain large amounts of mRNAs that are not only reservoirs from embryogenesis and seed maturation but also a provision for germination. Previous reports have identified only a subset of highly expressed mRNA species, such as the genes for seed reserve synthesis, Lea proteins, protein synthesis and degradation (Dal Degan et al., 1994; Dietrich et al., 1989). Our microarray analysis describes mRNA species representing more than half of all genes. This number was also equivalent to the transcriptome reported in other organs (http://www.arabidopsis.org/info/expression/ATGenExpress.jsp). Transcripts from all the ontological categories were included. However, among the 2–3% genes (approximately 500 genes) with the highest expression in the dry seed, genes for metabolism as well as protein synthesis and degradation were overrepresented (Figure S1, Table S7). Ribosomal proteins, translation initiation and elongation factors were especially conspicuous. This may reflect the indispensability of protein synthesis in early seed germination, and is consistent with the observation that germination process is completely halted by cycloheximide application (Rajjou et al., 2004). Interestingly, abundant mRNA species include the ones not only for the functionally described proteases/peptidases but also for proteins related to the ubiquitin/proteasome system. This suggests involvement of selective proteolysis during early seed germination, in addition to the massive breakdown in protein bodies.
Early studies using cotton seeds have suggested that long-lived mRNA can contribute to protein synthesis during early stages in germination (Dure and Waters, 1965). In addition to the role of long-lived mRNA, our expression profiling revealed that transcripts of a number of genes (Figure 1a, group 4) exhibited a transient accumulation within 6 h after imbibition (Figure 1a). Indeed, the association of ABRE with high gene expression that was observed in the dry seed was not seen in the 6 h-imbibed seed, showing that the mRNA profile is largely altered within this short period.
Cis-acting elements determining transcriptome of stored mRNA
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
Synthesis of subsets of mRNAs that are stored in the seed is known to be positively regulated by ABA. ABA-mediated gene expression is in turn regulated through several distinct ABREs, however these ABREs are not necessarily involved in all ABA-mediated processes. Our expression profiling identified a G-box-like CACGTG sequence as the most prominent ABRE in dry seed transcriptome. The enrichment of this element is well correlated with the endogenous ABA level that dramatically decreases within 6 h, almost reaching the basal level by 12 h (Figure 2b,c). In addition, similar sequences were more frequent in downregulated genes in abi5 seeds (Table 1 and Tables S1–S4). Furthermore these sequences are overrepresented in upregulated genes in seeds imbibed in 30 μm ABA (data not shown). These sequences play an important role in ABA-induced synthesis of embryonic mRNA in the seed.
Downregulated genes in abi5 dry seed displayed overrepresented numbers of ABRE in the upstream region. However, upregulated genes in this mutant seed exhibited no significant overrepresentation of ABRE. This suggests that ABI5 contributes mostly to activation of ABRE-mediated transcription rather than repression.
Our analyses also showed that multiple copies of ABRE and combining CE or RY with ABRE results in increased expression in seed (Figure 3a,b), which is consistent with previous promoter analyses (Shen et al., 1996; Vasil et al., 1995). By monitoring the expression of 22 000 genes in the seed, these effects were confirmed to be functional in vivo. Although a contribution of CE and RY to regulation of gene expression was observed, these elements showed only weak correlation with high-level expression in wild-type seed, when analyzed individually (Figure 3b). A statistical cis-element search in downregulated genes in abi4 seed did not identify CE (data not shown). More research will be needed to clarify the role and function of the ABI4 and CEs. However, recent comprehensive expression analysis using Arabidopsis overexpressing maize VP1 showed that VP1-regulated genes were enriched in ABRE (Suzuki et al., 2003). Similarly, our expression analysis of the abi3-8 mutant seed (Nambara et al., 2002) showed an increased occurrence of ABRE, but not RY, in downregulated genes in abi3 relative to the wild type (K. Nakabayashi and E. Nambara, unpublished data). Collectively, our analyses indicate that CE and RY play a role in vivo in controlling ABRE-mediated gene expression in seed.
Co-expression is frequently observed in seed
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
In addition to regulation of individual genes, our analysis showed that genes for which mRNA is stored in the seed are more often clustered on a chromosome rather than dispersed randomly. Stored mRNA from gene clusters of three to five genes (15–20 kb) was observed in the dry seed of wild type, abi4 and abi5 mutants (Figure 5a,b). Interestingly, our kinetic expression profiling revealed that proximal genes tend to be co-regulated during seed germination (Figure 4). Gene clustering was frequently observed throughout all chromosomal regions. Collectively, this indicates that regulation of both individual genes and regions containing multiple genes determines the transcriptome of dry and imbibed seed. A number of the clustered gene families include genes for 2S albumins and Lea proteins (Table S6). The expression of these genes is coordinated during seed maturation and late embryogenesis. Therefore, gene clustering may contribute to such developmentally controlled expression.
Our analysis reveals that in abi5 dry seed, both upregulated and downregulated genes tend to be located in clusters (Figure 5c,d). This is not the case in abi4 dry seed. This suggests that ABI5-dependent transcriptional regulation may contribute at least in part, to clustered gene expression. The upregulated genes in abi5 do not enrich the ABRE, but they tend to be located proximally. Accordingly, the effect of ABI5 on regulation of gene clusters does not necessarily require ABRE binding. Some transcription factors are known to interact with nucleosome remodeling factors, therefore it is possible that ABI5 or its complex may influence chromosomal structure. On the contrary, ABI4-dependent regulation does not appear to affect expression of neighboring genes in dry seed (Figure 5c,d).
Large silent regions display characteristics of heterochromatin
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
Aside from observing cis-acting regulatory elements in gene expression, we also observed a number of silent regions throughout chromosomes. Although the size of such regions varies considerably, clusters became apparent where hypo-expressed genes (evaluated as ‘absent’ expression) were located in groups of more than 10 genes (approximately 50 kb). Most of these silent regions contain either pseudogene(s), retrotransposon(s) or clustered homologous genes. These traits are reminiscent of heterochromatin regions where a large number of proximally located genes are repressed. Methylation is often linked with heterochromatin that is associated with repetitive sequences including the centromere, transposable elements and rRNA repeats (Bender, 2004). Our results clearly illustrate that the large silent genomic regions are heavily methylated (Figure 6a). Most methylated regions contain more than 100 transposon-related genes, and in some cases almost 300 genes are sequentially linked, and map to the centromeric regions. The sizes of these large clusters are well matched to the estimated sizes of Arabidopsis centromeres (Haupt et al., 2001). Thus, these genes may be silenced transcriptionally by heterochromatin structure.
The genes in repeated non-transposon arrays were also targets of methylation (Figure 6b). In contrast to the centromeric retrotransposon elements, in which only a small number of fairly large bands were detected in Southern analysis of HpaII-digested DNA, CG methylation in the non-transposon genes or in the clusters was restricted. This restricted methylation pattern is similar to that observed in SUP and FWA genes and it is reported that hypomethylation of these genes leads to an increase in their expression (Jacobsen et al., 2000; Soppe et al., 2000). Interestingly, the silent cluster of serine/threonine protein kinases that we selected was found to be expressed in axillary buds (K. Tatematsu and E. Nambara, unpublished data). This suggests that this gene (or gene clusters) may be transcriptionally silenced in the seed as documented elsewhere (reviewed in Finnegan et al., 2000; Lohe and Chaudhury, 2002).
Our analyses suggest that expression profiles in Arabidopsis seeds during different phases of development are defined by multiple regulatory mechanisms: epigenetic heterochromatin structures and conditional silencing by methylation, unknown factor(s) controlling expression of clustered genes, and individual cis-acting elements.
In silico analysis: a powerful tool for dissecting expression profiles
- Top of page
- Summary
- Introduction
- Results
- Gene expression profiles of dry and germinating seeds
- Upstream regions of highly expressed genes in wild-type dry seeds enriched ABREs
- ACGT-containing ABREs are less operative in the abi5 seed
- Intensity and kinetics of ABRE-mediated gene expression during germination
- Combined effects of ABRE and other motifs
- Co-regulation of chromosomal gene clusters in the seed
- Discussion
- Stored mRNA species and phase transition in the seed
- Cis-acting elements determining transcriptome of stored mRNA
- Co-expression is frequently observed in seed
- Large silent regions display characteristics of heterochromatin
- In silico analysis: a powerful tool for dissecting expression profiles
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
The accumulation of microarray data has provided us with a large amount of information on gene expression. In many cases, however, microarray data have been used to show a change in the expression level of specific genes with a particular treatment or a difference between wild-type and mutant plants. One of our approaches to fully utilize expression data from GeneChip analyses was to display the distribution of expression levels of 22 000 Arabidopsis genes without comparison to other GeneChip data as shown in Figures 2 and 3. Based on the distribution of these expression levels, particular ontological categories harboring selected single or multiple cis-acting elements were shown to be disproportionately represented in subsets with particular expression levels.
In addition, microarray analysis of 22 000 Arabidopsis genes enabled us to examine gene expression as influenced by chromosomal location. It is generally assumed that specific cell types will have specific patterns of gene clustering as in Arabidopsis root cells (Birnbaum et al., 2003). However, in this study, we could see both genetic and epigenetic regulation within the transcriptome of stored RNA in dry and imbibed seeds by analyzing the whole seed (without any separation of particular cell types). This may be due to the simplicity of the seed structure. In addition, there may be few differences in the composition of cell types in dry and 24 h-imbibed seeds, as radicle protrusion was initiated only after 30–36 h of imbibition. Our expression profiling demonstrates that seed is an excellent system in which to study epigenetic regulation using microarray-based expression profiles.
A number of detailed studies of ABA-inducible promoters have identified ABRCs that are necessary and sufficient for ABA-induced transcription. These previous in vitro and transient expression studies provided the basis that enabled us to use the ABI5/ABRE system as a model to evaluate our expression profiling. With validated data sets and analytical conditions, our results describe the transcriptional profile of the seed, and verify previous results on in vitro expression patterns by showing that these also occur in in vivo situations. The next challenge is to identify new cis-acting elements in novel mutants using our profiling system.