Summary
- Top of page
- Summary
- Introduction
- Results
- Discussion
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
To overcome the detection limits inherent to DNA array-based methods of transcriptome analysis, we developed a real-time reverse transcription (RT)-PCR-based resource for quantitative measurement of transcripts for 1465 Arabidopsis transcription factors (TFs). Using closely spaced gene-specific primer pairs and SYBR® Green to monitor amplification of double-stranded DNA (dsDNA), transcript levels of 83% of all target genes could be measured in roots or shoots of young Arabidopsis wild-type plants. Only 4% of reactions produced non-specific PCR products. The amplification efficiency of each PCR was determined from the log slope of SYBR® Green fluorescence versus cycle number in the exponential phase, and was used to correct the readout for each primer pair and run. Measurements of transcript abundance were quantitative over six orders of magnitude, with a detection limit equivalent to one transcript molecule in 1000 cells. Transcript levels for different TF genes ranged between 0.001 and 100 copies per cell. Only 13% of TF transcripts were undetectable in these organs. For comparison, 22K Arabidopsis Affymetrix chips detected less than 55% of TF transcripts in the same samples, the range of transcript levels was compressed by a factor more than 100, and the data were less accurate especially in the lower part of the response range. Real-time RT-PCR revealed 35 root-specific and 52 shoot-specific TF genes, most of which have not been identified as organ-specific previously. Finally, many of the TF transcripts detected by RT-PCR are not represented in Arabidopsis EST (expressed sequence tag) or Massively Parallel Signature Sequencing (MPSS) databases. These genes can now be annotated as expressed.
Introduction
- Top of page
- Summary
- Introduction
- Results
- Discussion
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
Transcription factors (TFs) are master-control proteins in all living cells. They often exhibit sequence-specific DNA binding and are capable of activating or repressing transcription of multiple target genes. In this way, they control or influence many biological processes, including cell cycle progression, metabolism, growth and development, and responses to the environment. As many TFs are themselves regulated at the level of transcription (Chen et al., 2002), knowing where and when TF genes are transcribed, and how such transcription is affected by internal or external cues can be valuable in elucidating the specific biological roles of the cognate proteins.
With the completion of the Arabidopsis thaliana genome sequence, it became possible for the first time to carry out a census of putative TFs in a higher plant. The Arabidopsis genome contains about 30 000 annotated loci (http://www.arabidopsis.org), approximately 5–6% of which code for putative TFs (Davuluri et al., 2003; Ratcliffe and Riechmann, 2002; Riechmann et al., 2000). Less than 10% of these have been characterized genetically. Given that a large proportion (approximately 40%) of Arabidopsis genes remain to be annotated with regard to function (AGI, 2000), it is likely that the number of TF genes will increase; in fact novel classes of TFs are still being discovered (Riechmann, 2002). TF genes are generally expressed at low levels in plants, frequently in a cell-type or tissue-specific manner, and often only transiently during development (e.g. LEAFY (Weigel et al., 1992); SHATTERPROOF1 and 2 (Liljegren et al., 2000); WUSCHEL (Mayer et al., 1998) or MONOPTEROS (Hardtke and Berleth, 1998). Although DNA and oligonucleotide arrays, such as Affymetrix chips, that contain most of the predicted genes of Arabidopsis are now available for transcriptome analysis, it is likely that the transcripts of many TF genes will be difficult to detect and quantify with DNA array technologies. Reverse transcription (RT)-PCR is estimated to be at least 100-fold more sensitive than DNA arrays in detecting transcripts (Horak and Snyder, 2002). In yeast, for instance, kinetic or real-time RT-PCR was able to detect transcripts of virtually all TF genes, which varied in abundance by over four orders of magnitude. In contrast, DNA arrays were unable to detect most yeast TF transcripts in a reliable manner (Holland, 2002). The limitations of DNA arrays for TF transcript detection are likely to be even greater in Arabidopsis, which contains a large number of different cell types, only a fraction of which will express a particular TF, e.g. WUS (Mayer et al., 1998). For this reason, we have developed a library of more than 1400 PCR primer (oligonucleotide) pairs that can be used to quantify transcripts of the majority of TF genes in Arabidopsis by real-time RT-PCR. Using these primers, together with SYBR® Green and an ABI PRISM® 7900HT 384-well-plate PCR system, we are able to measure the abundance of virtually all Arabidopsis TF transcripts (via cDNA) in the same sample in a single day.
Here, we present the first results obtained with this new resource. Besides providing the first comprehensive estimate of the range of TF transcript levels in Arabidopsis, we identify 36 putative root-specific and 52 putative shoot-specific TF genes in Arabidopsis, which may play important roles in the development or function of these distinct organs. In addition, a comparison between real-time RT-PCR and Affymetrix chip technology for measuring gene transcript levels is made, which highlights the value of this new resource with respect to its sensitivity and its ability to provide quantitative data.
Discussion
- Top of page
- Summary
- Introduction
- Results
- Discussion
- Experimental procedures
- Acknowledgements
- Supplementary Material
- References
- Supporting Information
We have developed a unique public resource for studying the expression of transcription factor genes in Arabidopsis. This resource, which is based upon highly multiplexed real-time RT-PCR with gene-specific primers, enabled us to measure transcript levels in roots or shoots of Arabidopsis seedlings for 1247 TF genes with high specificity and precision. Single PCR products of the expected size were obtained following RT-PCR for all of these genes, and sequencing of a subset of them confirmed the specificity of each PCR. Four per cent of the 1465 different TF RT-PCRs yielded more than the single expected product. Synthesis of new primer pairs should enable specific measurements to be made on the transcripts of these genes in the future.
Approximately 13% of TF gene transcripts were not detected in samples of roots or shoots of vegetative plants grown under the conditions used in these experiments. Of these, about a quarter of the genes have primers that do not span exon–exon junctions. All primer pairs tested from this subset yielded unique PCR products of the expected size from genomic DNA as template, showing that the primers have been correctly designed and do function. This indicates that these genes are expressed at extremely low levels or not at all under these conditions. Transcripts of another third of these genes have meanwhile been detected in Arabidopsis siliques or in seedlings exposed to various nutrient stresses (A. Blacha, T. Czechowski, W.-R. Scheible and M. Udvardi; unpublished results).
The sensitivity and robustness of TF transcript quantification by real-time RT-PCR were outstanding. As few as two copies of a target DNA could be detected in a complex mixture of 109 cDNA molecules (Figure 2a). This corresponds to a detection limit of about one transcript per 1000 cells, or 0.001 transcripts per cell which is similar to values obtained for yeast (Holland, 2002). In contrast, detection limits of DNA arrays are three orders of magnitude higher, at one transcript per cell (Holland, 2002; Horak and Snyder, 2002). Robustness of cDNA quantification was demonstrated in a second way: a linear relationship between output signal (
) and target cDNA amount was maintained over a wide range of mixtures of root and shoot cDNA (Figure 2b). Such robustness has never been shown for DNA arrays, to our knowledge. Precise quantification of transcripts by real-time RT-PCR depends upon having uniformly high amplification efficiency, or having a method to determine the amplification efficiency for each individual PCR. The latter was achieved using the method described by Ramakers et al. (2003). This allows the amplification efficiency to be determined for each technical and biological replicate, and the relative transcript abundance to be calculated accordingly. The technical precision of real-time RT-PCR measurements of TF transcript levels was high. Very low intra-assay variation was observed in duplicate measurements of the same pool of cDNA, made in separate runs on the PCR machine (Figure 3a). Interassay variation was estimated by measuring cDNA produced from two separate RT reactions that began from the same sample of RNA. As expected, interassay technical variation was slightly higher than intra-assay variation (Figure 3b). Interassay variability of Affymetrix chips was greater than that of real-time RT-PCR (Figure 3c,d), especially for genes expressed at low levels. The signal to noise ratio for hybridization-based methods of transcript detection is known to decrease exponentially with decreasing amounts of transcript (Holland, 2002; Figure 3c,d). This was not the case for real-time RT-PCR measurements, although variability in duplicate measurements increased slightly as TF transcript levels decreased in our experiments (Figure 3a,b).
Real-time RT-PCR indicated that TF transcript levels in Arabidopsis range over five orders of magnitude (for example, see Figure 5). Such a range in TF gene expression levels has never before been reported for plants. Presumably, this great range reflects not only differences in the expression level of different TF genes within any one cell-type, but also differences between cells of different tissues and organs. Given their role(s) as regulators of gene expression, it is to be expected that many TF genes will be expressed in a precise spatial and temporal manner in response to developmental and/or environmental cues. TF genes that orchestrate developmental transitions are known to be amongst the lowest expressed of all genes, and transcripts of these genes are often only detectable by RT-PCR or RNA in situ hybridization (Long et al., 1996; Mayer et al., 1998; Putterill et al., 1995; Siegfried et al., 1999). The most-highly expressed TF genes are presumably transcribed constitutively throughout the plant. Some of these may bind non-specifically to DNA. We are aware that not all of the genes that we have targeted are necessarily TF genes. These genes were selected because they encode DNA-binding and other domains that are shared by TF proteins, which does not necessarily mean that they are transcription factor genes. Nonetheless, it is interesting to compare the range of transcript levels that we measured for TFs in Arabidopsis with that measured using the same technique in yeast. Levels of TF transcripts in the single-celled yeast Saccharomyces cerevisiae varied over four orders of magnitude (Holland, 2002), which is one order of magnitude less than that observed by us in the more complex, multicellular plant.
It is also interesting to compare the data on TF transcript abundance obtained by real-time RT-PCR with those obtained for the same RNA samples using Affymetrix chips (Figures 3 and 4). The range of values obtained with real-time RT-PCR was two orders of magnitude greater than that obtained with Affymetrix chips (105 versus 103). As shown above, real-time RT-PCR yields a constant ΔCT for each X-fold change in initial DNA concentration over the whole range of detectable DNA concentrations (Figure 2a). This is not true for DNA array-based methods, which suffer from an exponential decrease in signal intensity as transcript levels fall, because of second order kinetics of hybridization (Holland, 2002). This could account for the narrower range of values obtained with Affymetrix chips compared to real-time RT-PCR (Figure 4).
Although real-time RT-PCR exhibited greater precision in replicate measurements than Affymetrix chips, this does not necessarily imply greater accuracy. To address the issue of accuracy directly, we used both methods to identify TF genes with extreme shoot to root expression ratios and compared these data with that available in an Arabidopsis MPSS database. MPSS represents an alternative means by which to estimate the relative abundance of gene transcripts in a particular organ. Like serial analysis of gene expression (SAGE; Velculescu et al., 1995), MPSS (Brenner et al., 2000a,b) generates short sequence tags produced from a defined position within an mRNA, and the relative abundance of these tags in a given library represents a quantitative estimate of expression of that gene. The Arabidopsis MPSS data set contained 3 645 414 tags from a root cDNA library and 2 885 229 tags from shoots. As described above, there was good qualitative agreement between real-time RT-PCR and the MPSS data (Table 1).
We also compared the quantitative accuracy of real-time RT-PCR and Affymetrix chips. A plot of the absolute signals given by the two methods revealed a rather weak correlation in the range corresponding to highly expressed genes and no correlation for genes expressed at lower levels (Figure 4). Unlike quantitative RT-PCR, hybridization-based technologies like Affymetrix chips are qualitative, and there is no strict linear relationship between signal strength and transcript amount for different genes. Thus, it is not possible to conclude with confidence that transcripts of one gene are more abundant than transcripts of another gene, simply based on greater signal strength in the former case on an Affymetrix chip. It is generally assumed that it will not affect the reliability of conclusions drawn from the changes in the Affymetrix signal for a given gene across different chips, i.e. the Affymetrix chips do provide reliable information about the relative levels of a transcript in different tissues or conditions. To check this, we compared the S/R ratios for all of the TFs that we measured, calculated from real-time RT-PCR data and from Affymetrix arrays (Figure 6a). Indeed, the agreement was good, provided abundantly expressed transcripts were compared (Figure 6b,c). This confirms the accuracy and reliability of both methods. For about half of the TFs, however, the signal obtained by the Affymetrix technology was in a range where accurate results could not be obtained (Figure 6b,c). As already indicated, these discrepancies were most widespread for genes that show a low signal on the arrays.
This problem may not be unique to TFs. In fact, for any given sample, the fraction of Arabidopsis genes that are labelled absent by Affymetrix software typically is 30–40%, and 40–45% of the genes typically display normalized expression signals <32 at a target normalization value of 100 (Figure 7). As observed for the TF genes, additional families of genes may contain a considerably higher than normal fraction of low-expressed members, or members with cell-type specific expression patterns. For example, inspection of our Affymetrix data sets indicated that 56% of the approximately 600 annotated receptor kinases (Shiu and Bleecker, 2001) yielded Affymetrix signals <32 (see Figure 7) when probed with cDNA from shoots and roots. As observed for TFs, the receptor kinases are under-represented amongst the highly expressed genes and over-represented amongst the more lowly expressed genes (Figure 7). A similar picture emerged for the large family of cytochrome P450 genes (not shown). Therefore, dedicated analyses of these and other gene families may benefit from a real-time RT-PCR approach similar to the one that we have taken for TFs.
TFs control many aspects of plant growth and development by regulating the expression of sets of target genes. Many TF genes are also regulated, in time and space, by internal and/or external cues. Thus, it should be possible to identify TF genes involved in important plant processes through ‘Guilt by Association’. To identify TF genes that may play roles in root- or shoot-specific processes, we compared transcript levels of 1214 TF genes in these organs (Figure 5). Approximately 7% (87) of the TF genes repeatedly exhibited greater than 20-fold differences in expression in shoots compared to roots (Table 1). Seventy-three of these were represented in the Arabidopsis MPSS data, and as mentioned above, almost all of these were confirmed as essentially root or shoot specific.
There is no published information on the majority of the 87 shoot- or root-specific genes that we identified by real-time RT-PCR (Table 1 and Table 2). Only 14 of the 52 shoot-specific genes have been characterized to some extent in the past. Eight of these were found to be expressed predominantly or exclusively in shoots. These include AGL2/SEP1 (AT5G15800), YAB3 (AT4G00180), YAB1/FIL (AT2G45190), ATH1 (AT4G32980), WUS (AT2G17950), SPL3 (AT2G33810), SPL4 (AT1G53160) and SPL5 (AT3G15270). Most of these genes have been implicated in plant development. AGL2/SEP1 is expressed in floral meristems, floral primordia and ovules, and plays a central role in controlling organ identity, such as the development of petals, stamens and carpels (Pelaz et al., 2000). YAB3 is expressed in all above-ground organs but not in roots, and specifies abaxial tissue development in lateral organs (Siegfried et al., 1999). YAB1/FIL is expressed in above-ground vegetative and reproductive meristems and is required for the growth and maintenance of inflorescence and floral meristems (Sawa et al., 1999). SPL3, SPL4 and SPL5 are expressed in aerial organs, especially in the inflorescence, and control flowering and other aspects of plant development (Cardon et al., 1997, 1999). Other shoot-specific genes from Table 1 that have been described in the literature are: ATH1, which is involved in photomorphogenesis (Quaedvlieg et al., 1995); and two genes involved in phytochrome B signalling, PIF4 (Huq and Quail, 2002) and PIL6 (Yamashino et al., 2003). Shoot-specific expression of the latter two genes has not been reported previously.
Three genes that we identified as shoot specific encode well-known stress-response regulators: CBF1/DREB1B (AT4G25490), CBF2/DREB1C (AT4G25470) and ERF2 (AT5G47220). Expression of the two CBF genes, which regulate adaptive responses to cold stress, is induced dramatically by chilling (Medina et al., 1999; Shinwari et al., 1998). However, under non-stress conditions, CBF1 and CBF2 transcripts were barely detectable in shoots or roots (Medina et al., 1999; Shinwari et al., 1998). Our results indicate that the basal or non-induced level of expression of these genes is significantly greater in shoots than in roots, which makes biological sense because the shoot is exposed to more rapid changes in temperature than the root is. ERF2 is involved in signal reception of ethylene-mediated signalling pathways and also shows modest induction by cold stress (Fujimoto et al., 2000). The WUS homeodomain TF gene is expressed in very few cells of the shoot apical meristem during embryogenesis, vegetative growth and flower development, and determines the fate of meristem stem cells (Mayer et al., 1998).
Of the 35 root-specific genes we identified (Table 2), only two have been characterized in the past, namely AGL21 (AT4G37940) and AGL17 (AT2G22630). Based on their root-specific expression patterns, roles in root development have been proposed for those two AGL genes (Burgeff et al., 2002; Rounsley et al., 1995). Other AGL genes have also been characterized as root specific (Alvarez-Buylla et al., 2000; Burgeff et al., 2002; Rounsley et al., 1995), including AGL14 (AT4G11880) and AGL19 (AT4G22950). We found transcript levels of both to be approximately 10 times higher in roots than that in shoots (Supplementary Material, Table S1).
Many of the reported genes that we identified as shoot specific appear to be involved in developmental processes. This may simply reflect the way in which most TF genes have been isolated to date, namely via genetic screens for aberrant growth and development. Defects in TF genes involved other plant processes, such as metabolism, may produce more subtle phenotypes, which are difficult to identify. Thus, many of the novel root- and shoot-specific genes that we have identified may eventually be implicated in processes other than development. Obviously, reverse genetics will play a central role in identifying functions for these genes.
Recently, an expression profile matrix for 400 Arabidopsis TF genes, derived from a series of Affymetrix chip experiments, was used to identify TF genes that may play roles in responses to different environmental stresses (Chen et al., 2002). Transcripts of about 10% of the genes were not detected under any of the conditions used in that study. Importantly, we detected expression of several of these genes in roots and/or shoots, using real-time RT-PCR (AT4G13480; AT1G73410; AT4G01500 and AT3G12820), which highlights the greater sensitivity of this technique. An interesting anomaly discussed in the paper by Chen et al. (2002) was the expression pattern of the TINY gene (AT5G25810), which was found by Affymetrix chip analysis to be expressed at high levels in roots but not at all in other organs. TINY is required for both vegetative and floral organogenesis (Wilson et al., 1996), which indicates that it is expressed in aerial parts of the plant. We were able to detect transcripts of this gene in both roots and shoots (sevenfold higher in roots than in shoots) using RT-PCR.
To summarize, we have created a resource for real-time RT-PCR profiling of almost 1500 Arabidopsis TF genes that, compared to existing technologies such as Affymetrix chips, increases significantly the sensitivity, precision and accuracy with which transcripts of these genes can be measured. This resource is also more flexible than other systems: we can add, remove or replace primer pairs at any time. For instance, we will re-design and replace primer pairs for those PCRs that yielded efficiencies lower than 0.5. On the other hand, we are also aware of a significant number of additional Arabidopsis genes that have been attributed a potential role as transcriptional regulators (http://www.arabidopsis.org; http://arabidopsis.med.ohio-state.edu; Davuluri et al., 2003; http://genetics.mgh.harvard.edu/sheenweb/AraTRs.html), and we plan to add primers to these genes to the existing set of TF primers in the near future.
We used real-time RT-PCR in this study to identify a considerable number of novel root- and shoot-specific TF genes, which may play important roles in development or other organ-specific processes. This information will be a valuable starting point for further research on these genes. In addition, we provide the first experimental evidence that the vast majority of the putative TF genes annotated in Arabidopsis are indeed expressed. This new resource will help to identify TF genes involved in numerous plant processes, including abiotic stress responses, an area that we are particularly interested in.