Automated Quantification of the Subcellular Localization of Multicompartment Proteins via Q-SCAn

Authors

  • Nicholas C. Bauer,

    1. Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
    2. Graduate Program in Biochemistry, Cell, and Developmental Biology, Emory University School of Medicine, Atlanta, GA, USA
    Search for more papers by this author
  • Anita H. Corbett,

    Corresponding author
    1. Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
    2. Winship Cancer Institute, Emory University School of Medicine, Atlanta, GA, USA
    Search for more papers by this author
  • Paul W. Doetsch

    Corresponding author
    1. Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
    2. Winship Cancer Institute, Emory University School of Medicine, Atlanta, GA, USA
    3. Department of Radiation Oncology, Emory University School of Medicine, Atlanta, GA, USA
    4. Department of Hematology and Medical Oncology, Emory University School of Medicine, Atlanta, GA, USA
    Search for more papers by this author

Abstract

In eukaryotic cells, proteins can occupy multiple intracellular compartments and even move between compartments to fulfill critical biological functions or respond to cellular signals. Examples include transcription factors that reside in the cytoplasm but are mobilized to the nucleus as well as dual-purpose DNA repair proteins that are charged with simultaneously maintaining the integrity of both the nuclear and mitochondrial genomes. While numerous methods exist to study protein localization and dynamics, automated methods to quantify the relative amounts of proteins that occupy multiple subcellular compartments have not been extensively developed. To address this need, we present a rapid, automated method termed quantitative subcellular compartmentalization analysis (Q-SCAn). To develop this method, we exploited the facile molecular biology of the budding yeast, Saccharomyces cerevisiae. Individual subcellular compartments are defined by a fluorescent marker protein and the intensity of a target GFP-tagged protein is then quantified within each compartment. To validate Q-SCAn, we analyzed relocalization of the transcription factor Yap1 following oxidative stress and then extended the approach to multicompartment localization by examining two DNA repair proteins critical for the base excision repair pathway, Ntg1 and Ung1. Our findings demonstrate the utility of Q-SCAn for quantitative analysis of the subcellular distribution of multicompartment proteins.

image

In eukaryotic cells, subcellular compartments are functionally defined by the proteins present within them. While many proteins are localized to a single compartment, some proteins are localized to multiple compartments either constitutively or in response to cellular signals. For example, many transcription factors are localized to the cytoplasm until a signal triggers nuclear import [1, 2]. Such mobilization of proteins is an efficient strategy to alter local function in response to stimuli because transport of a preexisting pool of protein is more rapid than the de novo synthesis and localization of a comparable amount of protein. Other proteins simultaneously play roles in multiple compartments such as some DNA repair proteins that are localized to both the nucleus and mitochondria to maintain the integrity of the genomes in each of these cellular compartments [3].

Despite the biological significance of localizing proteins to multiple subcellular compartments, tools for quantifying the relative subcellular distribution of multicompartment proteins have not been extensively developed. Many protein localization studies employ manual scoring from microscopy data, relying on the heterogeneity of the cell population and human visual detection to provide a useful threshold [4-7]. However, these implicit thresholds are subjective and the process can be very labor-intensive. In addition, manual methods are only semi-quantitative as they are based on qualitative data. True quantification can be achieved by manually tracing the boundaries of the compartments of interest and then quantifying pixels within each compartment, but the laborious nature of this type of analysis means the number of cells that can be analyzed is effectively limited. Colocalization analysis [8], which has advanced greatly over the last decade and is widely available in image analysis software, is more suited to addressing questions about whether proteins and markers are spatially linked rather than about the distribution of a protein among distinct compartments. Photobleaching [9] and photoactivation techniques can be employed to examine dynamics [10]; however, these techniques require highly specialized experimental setups and are also limited to larger cells amenable to such techniques. Biochemical fractionation techniques can also provide quantifiable compartmentalization information on a population of cells [4, 11, 12], but microscopy-based techniques are superior to fractionation because micrographs preserve the spatial relationships and yield information on the single-cell level, not just the population average.

The limitations of the above techniques form a critical impediment to analyzing the steady-state distribution of proteins localized to multiple compartments. Development of advanced, automatable techniques that provide unbiased quantification of protein localization on a per-cell basis is becoming an active area of research. We have developed an approach to quantifying protein distribution among multiple compartments, which we term quantitative subcellular compartmentalization analysis (Q-SCAn). This microscopy-based method uses brightfield DIC images to identify cells, relies on a set of fluorescent markers to define subcellular compartments, and provides information about the amount of a protein of interest, marked by a third fluorescent reporter, within the identified compartments. By comparing the fluorescence intensities for each compartment, a localization index is calculated for each cell, yielding a quantitative measure of protein localization. Furthermore, the distribution of these localization indices can be compared between different cell types, conditions and time points to address the regulation of protein localization.

Here we describe the development of Q-SCAn in Saccharomyces cerevisiae and demonstrate its utility in measuring the single-cell localization of proteins by following the oxidative stress-induced relocalization of the transcription factor Yap1 [13]. Next, we extend the approach to multicompartment localization by examining the nucleomitochondrial base excision repair (BER) protein Ntg1 [14]. Finally, we apply the method to evaluate the localization of another nucleomitochondrial BER protein, Ung1 [15], which has not been previously analyzed in any quantitative manner. Our analysis of Ung1 provides new biological information about mechanisms of localization of Ung1 and thus insight into regulation of the BER pathway, demonstrating the utility of Q-SCAn for such studies. This work presents a novel method for quantifying the subcellular distribution of multicompartment proteins which can be immediately put to use and extended without specialized equipment or programming experience.

Results

Automated quantification of subcellular protein localization: Q-SCAn

To address a critical gap in the techniques available to rapidly and reproducibly provide quantitative information about protein distribution among multiple cellular compartments, we have developed a method termed Q-SCAn. To develop this system, we exploited the budding yeast S. cerevisiae and focused on developing a system that could be used to quantify protein levels in the nucleus and mitochondria.

We designed a dual reporter for S. cerevisiae that could be integrated into the genome to create a reporter yeast strain with constitutively labeled nuclei and mitochondria. The reporter encodes spectrally distinct fluorescent proteins targeted to the nucleus and mitochondria via well-characterized targeting signals (Figure 1A). The nuclear reporter protein comprises the strong, artificial SV40 bipartite nuclear localization signal (NLS) [16] fused to the tandem dimer red fluorescent protein tdTomato [17]. This reporter protein is expressed from the low-level CYC1 constitutive promoter [18] and terminated by the NUF2 terminator [19]. The mitochondrial reporter protein is composed of the highly efficient Neurospora crassa Su9 mitochondrial matrix targeting signal (MTS) [20] fused to the cyan fluorescent protein mCerulean [21]. This reporter protein is expressed from the high-level TEF1 constitutive promoter [18] due to the relative dimness of mCerulean as compared to tdTomato, and is also terminated by the NUF2 terminator [19] (Figure 1A). These reporters were integrated into the S. cerevisiae LEU2 locus to create a constitutively labeled yeast reporter strain.

Figure 1.

Quantitative subcellular compartmentalization analysis (Q-SCAn). A) Schematic of the tandem nuclear and mitochondrial marker protein construct, which is integrated into the Saccharomyces cerevisiae LEU2 locus as described in Results. B) The Q-SCAn marker proteins specifically and uniformly localize to their respective compartments. The nuclear and mitochondrial fluorescent marker proteins (NLS–tdTomato, MTS–mCerulean) were expressed independently in S. cerevisiae cells and co-stained with DAPI (dye, upper panel) or MitoTracker (dye, lower panel), respectively, and subjected to colocalization analysis to generate a colocalization coefficient (Coloc. Coefficient: mean ± SD over five images). C) Example merged image of cells co-expressing both fluorescent marker proteins. Scale bar is 5 µm. D) Diagram of the flow of information in Q-SCAn. E) Result of the compartmentalization of the sample image in (C). Identified cells are outlined in white, identified nuclei are marked in orange, and identified mitochondria in cyan. Overlapping portions of compartments are excluded, and cells that do not have both markers expressed and visible in the optical slice are excluded. Scale bar is 5 µm. F) Schematic of the mitonuclear localization index. The index is generated by dividing the mean nuclear GFP intensity by the sum of the mean nuclear and mean mitochondrial GFP intensities. A value of 0 indicates that all of the signal is in the mitochondria, while a value of 1 indicates that all of the signal is in the nucleus, as depicted by the cell diagrams (nucleus outlined in orange, mitochondria outlined in blue).

To validate the localization of these fluorescent reporter proteins to the targeted compartments, we co-stained cells expressing either NLS-tdTomato or MTS-mCerulean with DAPI to label chromatin and mitochondrial nucleoids or MitoTracker Red to label mitochondrial proteins, respectively. As shown in the micrographs in Figure 1B and validated by colocalization analysis, the marker proteins localized to their intended subcellular compartments consistently and with high specificity, a necessary requirement for computer-assisted image analysis, whereas these standard dyes comparatively displayed highly variable and non-specific staining.

To carry out Q-SCAn, fluorescent micrographs of cells expressing the marker proteins were obtained (Figure 1C). The Q-SCAn method, which is implemented in the open source CellProfiler software package (the CellProfiler pipelines necessary to run Q-SCAn are available for download at http://www.biochem.emory.edu/doetsch/qscan.html), was then applied to the images collected as depicted in Figure 1D to: (i) identify the location of cells from the DIC image utilizing information from the marker channels; (ii) filter out cells that have poor marker fluorescence that would prevent robust identification of compartments; (iii) identify, within each cell, the location of nuclei and mitochondria using information from the tdTomato and mCerulean channels, respectively; (iv) subtract the nuclei and mitochondria from the cell to define the cytoplasm; (v) remove overlapping parts of nuclei and mitochondria; and (vi) quantify mean GFP intensity within each cell compartment (Figure 1D). The result of the compartmentalization is shown in Figure 1E. A measure indicating the relative distribution of the protein, a localization index, is then constructed, where the mean nuclear GFP intensity is divided by the sum of the mean nuclear and mean mitochondrial GFP intensities. This analysis results in a measure ranging from 0 (100% mitochondrial) to 1 (100% nuclear) (Figure 1F). (See Appendix S1 and Figures S1–S5 for technical details of the analysis, Supporting Information.)

To calibrate the system, we constructed and expressed localization sequence-fused GFP proteins targeted to the nucleus, NLS-GFP2 (NucGFP) [16] and mitochondria, MTS-GFP2 (MitoGFP), as well as one designed to remain in the cytoplasm, NES-GFP2 (CytoGFP) (Figure 2A). The thresholds and corrections used in the algorithm were tuned such that the nuclear and mitochondrial GFP protein-expressing cells resulted in an exclusively nuclear or mitochondrial localization index, respectively (Figure 2B), though mitochondrial localization is somewhat less sharply defined than nuclear localization. A protein which does not localize to either compartment, cytoplasmic GFP, has a neutral and diffuse localization index distribution. These data demonstrate that the nucleomitochondrial localization index is valid for quantifying proteins with distributions in between these two extremes.

Figure 2.

Q-SCAn calibrated to detect distinct nuclear and mitochondrial localization. Saccharomyces cerevisiae cells co-expressing integrated NLS–tdTomato and MTS–mCerulean, and nuclear, mitochondrial, or cytoplasmic GFP (NucGFP, MitoGFP, CytoGFP) were imaged. Greater than 200 cells were analyzed per experiment. A) Representative images of cells expressing a localized GFP. Scale bar is 5 µm. B) Quantification of the distribution of localization controls presented as mitonuclear localization index (0 = mitochondrial to 1 = nuclear). Vertical black line indicates the sample median, white box the interquartile range, and horizontal black line the adjacent range.

Q-SCAn used to analyze nuclear localization

To assess the practical utility of Q-SCAn to examine a classical signaling paradigm for multicompartment protein localization, the movement of a transcription factor from the cytoplasm to the nucleus [1, 2], we analyzed the localization of the S. cerevisiae transcription factor, Yap1 [13]. Under normal cellular conditions, Yap1 shuttles between the nucleus and the cytoplasm but shows diffuse, steady-state localization primarily to the cytoplasm, as the rate of nuclear export exceeds the rate of nuclear import [22]. However, upon exposure to oxidative stress, although steady-state levels of Yap1 do not change significantly [23], Yap1 nuclear export is blocked, resulting in rapid nuclear accumulation [7, 22].

We expressed Yap1-GFP in S. cerevisiae expressing the nuclear marker protein. As expected, in the absence of any oxidative stress (Figure 3A, no treatment), Yap1-GFP was diffusely localized throughout each cell [23]. Cultures were then treated with a mild dose of hydrogen peroxide and samples were imaged within three windows of time following exposure (Figure 3A). On applying Q-SCAn using a nucleocytoplasmic localization index (cytoplasmic = 0, nuclear = 1), the rapid mobilization of Yap1 into the nucleus was readily apparent (Figure 3B). The time-course and extent of relocalization corresponds well with previously published work [7] as well as with manual scoring of the same data (Figure 3C). Q-SCAn also allowed us to obtain new biological information about the heterogeneity of the distribution of Yap1 in the population. Initially the variance in Yap1 localization is very wide, but the distribution quickly tightens during oxidative stress as nuclear localization increases. The comparison with manual scoring also allowed direct comparison of the efficiency benefits of using Q-SCAn. Analyzing the same dataset (from end of image collection to start of data analysis) of over 200 images manually required approximately 6 h of working time while setting up and beginning the Q-SCAn analysis for the same number of cells required less than 30 min, with minimal interaction over the course of the analysis. With a small program we developed to fully automate the process, even this minimal interaction is eliminated (Appendix S1).

Figure 3.

Yap1GFP redistributes to the nucleus under oxidative stress, as revealed by Q-SCAn. Saccharomyces cerevisiae cells co-expressing integrated NLS–tdTomato and MTS–mCerulean and Yap1–GFP were exposed to 5 mm hydrogen peroxide and imaged within three windows of time after exposure, 1–10 min, 30–45 min and 60–80 min. Across three independent replicates, >200 cells were analyzed per time window. A) Representative images of cells expressing Yap1-GFP before treatment (no treatment: NT) and within each time window following exposure. Scale bar is 5 µm. B) Quantification of the distribution of Yap1-GFP (0 = cytoplasmic to 1 = nuclear) within each time window following exposure. Horizontal black line indicates the sample median, white box the interquartile range, and vertical black line the adjacent range. C) Percentage of cells visually scored for cytoplasmic (C, white), nuclear and cytoplasmic (N/C, light gray), or nuclear (N, dark gray) Yap1-GFP localization within each time window following exposure. Boundaries are mean ± SEM.

Q-SCAn used to analyze the dual-localized DNA repair protein Ntg1

To extend the utility of Q-SCAn beyond localization to a single compartment, we next analyzed a system where a protein is localized to two specific compartments. For this purpose, we examined a DNA repair protein critical for the BER pathway, the yeast endonuclease III-like DNA N-glycosylase/AP lyase, Ntg1, which localizes to both the nucleus and mitochondria [14] to differing extents depending upon the DNA damage status of each genome [4].

To assess the utility of Q-SCAn for detecting differences in the distribution of a protein between the nucleus and mitochondria, we exploited previously characterized mutants of Ntg1 with altered targeting to each compartment [5] as indicated in Figure 4A and compared their localization with wild type Ntg1. We expressed these Ntg1 localization sequence mutants (Figure 4A) in S. cerevisiae containing the Q-SCAn nuclear and mitochondrial reporter proteins. These cells were imaged and analyzed via Q-SCAn. Micrographs of representative fields of cells are shown in Figure 4B. Q-SCAn revealed that wild type Ntg1 is relatively evenly distributed between the nucleus and mitochondria, with most cells containing slightly more nuclear Ntg1 relative to mitochondrial Ntg1. Disrupting the MTS resulted in nearly complete loss of mitochondrial localization (Figure 4C). Altering both segments of the bipartite NLS within Ntg1 (NLS1 and NLS2) resulted in loss of nearly all nuclear localization of Ntg1 (Figure 4C). Consistent with a previous study that defined the contributions of NLS1 and NLS2 to Ntg1 localization [5], disruption of NLS2 also resulted in significant loss of nuclear localization, but, notably, Q-SCAn revealed that this Ntg1 variant retained some nuclear localization as compared to the NLS1/2 double mutant. These data demonstrate that Q-SCAn can be readily used to experimentally quantify the localization of nucleomitochondrial proteins with different distributions between cellular compartments, revealing even modest changes such as the difference between altering both NLS1 and NLS2 vs. NLS2 alone.

Figure 4.

Localization of Ntg1 analyzed by Q-SCAn. Saccharomyces cerevisiae cells co-expressing integrated NLS-tdTomato and MTS-mCerulean, and an Ntg1-GFP variant were imaged. A) Schematic of Ntg1 showing nuclear localization motifs (NLS, orange), mitochondrial matrix targeting signal (MTS, blue) and catalytic site (yellow), indicating the amino acid changes introduced within each targeting signal. B) Representative images of cells expressing Ntg1-GFP variants. Scale bar is 5 µm. C) Quantification of the distribution of Ntg1-GFP fusion proteins (0 = mitochondria to 1 = nucleus). Vertical black line indicates the sample median, white box the interquartile range, and horizontal black line the adjacent range. Asterisk denotes distributions determined to be significantly different from wild type Ntg1 (p < 0.05). Across three independent replicates, >350 cells were analyzed per variant analyzed.

Application of Q-SCAn to define targeting signals within the DNA repair protein Ung1

Finally, to assess the utility of Q-SCAn to obtain novel biological insights, we analyzed another DNA repair protein, the S. cerevisiae uracil DNA-glycosylase, Ung1. Similar to Ntg1, Ung1 localizes to both the nucleus and mitochondria [24]; however, neither the distribution of the protein between the two compartments has been analyzed, nor have the intracellular targeting signals been experimentally defined. The predicted nuclear and mitochondrial localization signal sequences within Ung1 were identified via the NUCDISC component of PSORT II [25] and iPSORT [26], respectively. As shown in Figure 5A, these predicted targeting motifs within Ung1 are arrayed in an overlapping manner similar to the organization of these sequences in Ntg1 (see Figure 4A).

Figure 5.

Functional analysis of Ung1 targeting signals revealed by Q-SCAn. Saccharomyces cerevisiae cells co-expressing integrated NLS-tdTomato and MTS-mCerulean, and an Ung1-GFP variant on a 2μ plasmid were imaged. A) Schematic of Ung1 showing its predicted nuclear localization signals (NLS, orange, identified via PSORT II [25]), mitochondrial matrix targeting signal (MTS, blue, identified via iPSORT [26]), and catalytic site (yellow), indicating the amino acid changes introduced within each putative targeting signal. B) Representative images of cells expressing Ung1-GFP variants. Scale bar is 5 µm. C) Quantification of the distribution of Ung1-GFP fusion proteins (0 = mitochondria to 1 = nucleus). Vertical black line indicates the sample median, white box the interquartile range, and horizontal black line the adjacent range. Asterisk denotes distributions determined to be significantly different from wild type Ung1 (p < 0.05). Across three independent replicates, >250 cells were analyzed per variant analyzed.

Localization of wild type Ung1 reveals that this BER protein has a distinct intracellular distribution compared to Ntg1 (Figure 5B,C) as Ung1 displays greater nuclear concentration than Ntg1. This assessment can be made by comparing the distribution of localization indices for the two proteins (median, Ung1: 1.0 vs. Ntg1: 0.68). To extend the analysis and gain insight into the functional targeting signals in Ung1, we disrupted the three predicted localization sequences (MTS, NLS1 and NLS2) within Ung1 by site-directed mutagenesis (Figure 5A). We compared the localization of these Ung1 variants to wild type Ung1 using Q-SCAn. Micrographs of representative fields are shown in Figure 5B. These results reveal that NLS1 is necessary and sufficient for nuclear localization of Ung1 (Figure 5C), defining the functional NLS within Ung1. NLS2 may have a minor contribution to nuclear localization, but the distributions of the NLS1 mutant and NLS1/2 double mutant are statistically identical. The Ung1 variant with the altered MTS lost all mitochondrial localization as would be predicted if this motif is a functional targeting signal (Figure 5C). These data demonstrate that Q-SCAn can provide novel information about the extent of intermediate shifts in protein localization that may not be readily apparent by visual inspection.

Discussion

The goal of the work presented here was to develop a rapid and quantitative method that could be used to assess the relative amount of a protein present in multiple compartments. Our results demonstrate the utility of Q-SCAn for such studies. Given the automated and rapid data analysis feasible with Q-SCAn, biological questions that examine changes in the localization of a dual-compartment protein can be readily addressed quantitatively. For example, a change in compartment-specific localization in response to a cellular signal over time could be analyzed as presented here for Yap1. In addition, quantitative information regarding the contribution of intracellular targeting signals to specific cellular compartments, such as described here for the DNA repair proteins Ntg1 and Ung1, can be readily collected and analyzed. Thus, Q-SCAn expands the tools available to address questions of protein distribution in a quantitative manner. Furthermore, Q-SCAn has been implemented in such a way as to ensure that the method can be readily employed by those that wish to analyze any protein of interest.

This study also demonstrates how Q-SCAn can be employed to gather new biological information about a protein of interest. While accumulation of Yap1 within the nucleus upon oxidative stress is a well-established biological response [7, 13, 22], our analysis reveals information about the extent of nuclear localization with time. Furthermore, we employed Q-SCAn to define the functional localization sequences of the DNA repair protein Ung1, which is localized both to the nucleus and mitochondria [24]. Our analysis reveals that the NLS1 sequence constitutes the functional classical NLS in Ung1. However, altering both predicted classical NLS motifs did not eliminate all nuclear localization, indicating that Ung1 likely exploits a non-classical nuclear import pathway to ensure access to the nucleus. Such a mechanism using dual pathways to access the nucleus has also been observed for Ntg1 [5]. Thus, in addition to facilitating experiments through rapid and automated data analysis, the quantitative results obtained using Q-SCAn provide novel information that is not revealed through conventional qualitative scoring methods.

A major strength of Q-SCAn is that the approach has been developed so that it can be easily applied by a user without any need for highly specialized equipment or software. Q-SCAn employs the open-source CellProfiler software package [27]. This package is user-friendly and highly modular, allowing for any aspect of the Q-SCAn algorithm to be modified for different circumstances or to couple with other analyses. The CellProfiler package is also under active development and improvement at the Broad Institute, and technical assistance is readily available if required (www.cellprofiler.org).

There are a number of points that must be taken into consideration when implementing Q-SCAn. First, a marker protein for the compartment or compartments of interest will need to be designed. The fluorescence spectrum of fluorescent markers should not overlap with the fluorescence spectrum of the analyzed protein. Here we selected tdTomato and mCerulean coupled with GFP based on the spectral properties of these fluorescent proteins [17]. Another consideration is that, as described here, the localization examined is not that of the endogenous protein, but rather that of a fluorescently tagged protein. Such tagged proteins are commonly used to obtain information about protein localization [28, 29] but both the presence of the tag and the level of the tagged-protein relative to endogenous protein need to be taken into account when interpreting results. Fluorescence noise should also be monitored. Although several noise-removal methods are implemented in Q-SCAn, some noise still remains. As a result, as the target protein level approaches the noise level, the localization index will approach equality. Finally, care must be taken to ensure that cells in the imaged fields are well-separated. While the cell-finding algorithm can readily separate cells that touch, there can be difficulty in separating cell clusters, which could inadvertently be treated as a single cell.

Development of methods to analyze protein distribution is an active area of research. A recent publication employed a dye to mark the S. cerevisiae plasma membrane to quantify recruitment of YFP-Ste5 to the plasma membrane upon pheromone signaling [30]. Using this approach, the authors were able to calculate both the pheromone dose–response for Ste5 recruitment to the membrane and the dissociation constant for Ste5 from the membrane. While elements of this quantitative approach are similar to Q-SCAn, there are some significant differences. These authors employed a dye to mark the target compartment, but such dyes are not readily available to specifically mark all cellular compartments. In addition, the method was implemented using a special-purpose software, Cell-ID [31]. While this program, like CellProfiler, is open-source, significant expertise would be required to use or modify the program. Another key distinction from Q-SCAn is that the authors employed defocused brightfield images to identify cells. While this approach facilitates cell-finding, obtaining these images on a non-automated microscope would be laborious and employing such images could result in a loss of information about internal cellular morphology and cell fitness. Given the different biological questions addressed by the two approaches, recruitment of a protein to the membrane [30] and movement of proteins between intracellular compartments (this report), these two methods complement one another to obtain quantitative biological information.

As a general method, there are numerous ways that Q-SCAn could be developed and extended, which is facilitated by implementation in CellProfiler. For example, the cell-finding algorithm could be replaced to identify mammalian cells, and the compartment-finding algorithms could be adapted to different types of markers including immunofluorescence. A third fluorescent protein from the deep red spectral range such as mPlum [32] could be introduced to mark a third compartment of interest. More than three fluorescent protein compartment markers could be employed if spectral imaging/linear unmixing is used to separate the signals. The localization distribution information could also be coupled to other information, such as cell size, compartment morphology or measures of the protein distribution within a compartment. With modifications, Q-SCAn could also be adapted to analyze localization within three-dimensional datasets [33]. In addition to analyzing single images, Q-SCAn could further be employed to extract localization data from frames of a video, allowing quantitative analysis of the dynamics of protein relocalization within single cells. Q-SCAn could also have applications in diagnostics and therapeutics, as mislocalization of proteins is associated with disease processes including cancer, autoimmune disorders and degenerative disorders [34].

This study reports the development and utilization of Q-SCAn as a facile, quantitative analytical tool for providing broader and more detailed analysis of the localization of multicompartment proteins when compared to the current approaches available.

Materials and Methods

Yeast strains, media and growth conditions

Saccharomyces cerevisiae strains used in this study are listed in Table 1. S. cerevisiae cells were cultured at 30°C in rich YPD medium (1% yeast extract, 2% peptone, 2% dextrose and 2% agar for plates) or synthetic defined drop-out media for selection (uracil/leucine) and/or transcription induction (methionine). Plasmids or integrated constructs were transformed into cells by a modified lithium acetate method [35].

Table 1. Plasmids and strains used in this study
NameGenotypeRef
FY86 (ACY193)MATa ura3-52 leu2Δ1 his3Δ200[36]
Q-SCAn (DSC569)MATa ura3-52 his3Δ200 NLS-tdTom MTS-mCerThis study
pRS305LEU2, AmpR[37]
pRS426, URA3, AmpR[37]
pNLS-tdTom (pD0464)pCYC1-BPSV40NLS-tdTomato, LEU2, AmpRThis study
pMTS-mCer (pD0465)pTEF1-Su9MTS-mCerulean, LEU2, AmpRThis study
pQ-SCAn (pD0466)pCYC1-BPSV40NLS-tdTomato,This study
pTEF1-Su9MTS-mCerulean, LEU2, AmpR
pNLS-GFP2pMET17-BPSV40NLS-GFP-GFP, CEN, URA3, AmpR[16]
pMTS-GFP2 (pD0467)pMET17-Su9MTS-GFP-GFP, CEN, URA3, AmpRThis study
pNES-GFP2 (pD0468)pMET17-NES-GFP-GFP, CEN, URA3, AmpRThis study
pYap1-GFPYAP1-GFP, CEN, URA3, AmpR[7]
pNtg1-GFPNTG1-GFP, , URA3, AmpR[4]
pNtg1NLS2-GFPntg1NLS2-GFP, , URA3, AmpR[5]
pNtg1NLS1/2-GFPntg1NLS1/2-GFP, , URA3, AmpR[5]
pNtg1MTS-GFPntg1MTS-GFP, , URA3, AmpR[5]
pUng1-GFP (pD0419)UNG1-GFP, , URA3, AmpRThis study
pUng1NLS1-GFP (pD0469)ung1NLS1-GFP, , URA3, AmpRThis study
pUng1NLS2-GFP (pD0470)ung1NLS2-GFP, , URA3, AmpRThis study
pUng1NLS1/2-GFP (pD0471)ung1NLS1/2-GFP, , URA3, AmpRThis study
pUng1MTS-GFP (pD0472)ung1MTS-GFP, , URA3, AmpRThis study

Plasmid construction

Plasmids used in this study are listed in Table 1. Both a nuclear marker gene (NLS-tdTom) and a mitochondrial marker gene (MTS-mCer) were constructed and inserted into an S. cerevisiae integration plasmid (pRS305) [37]. Each marker gene was expressed from a constitutive promoter (low-level CYC1 [18] and high-level TEF1 [18], respectively, chosen to equalize the brightness of the marker protein) and terminated by a generic transcription terminator from the NUF2 gene [19]. The marker genes themselves are composed of a localization sequence fused to a fluorescent protein (NLS-tdTom: bipartite SV40 nuclear localization signal (NLS) [16] and tdTomato [17]; MTS-mCer: Neurospora crassa Su9 mitochondrial matrix targeting sequence (MTS) [20] and mCerulean [21]). The pRS305 LEU2 gene contains a unique EcoRV site to linearize the plasmid and thus enhance integration efficiency. Localization of these reporters in S. cerevisiae was confirmed by co-staining with DAPI or MitoTracker Red CMXRos (Invitrogen) and performing colocalization analysis on the fluorescence micrographs [8, 38]. Mitochondrial and cytoplasmic GFP localization control plasmids were constructed by replacing the NLS of pNLS-GFP [16] with the Su9 MTS [20] or the nuclear export sequence (NES) of mammalian PKI [39], respectively. The Ung1-GFP plasmid was constructed by inserting the UNG1 gene (ORF + 914 bp upstream) amplified by PCR from wild type S. cerevisiae (FY86), an in-frame C-terminal GFP [40], and NUF2 terminator into pRS426 [41]. The predicted classical NLS [25] and MTS [26] of Ung1 were mutated (NLS1: K18A, R19A; NLS2: K40A K41A; MTS: R5A, R6A) using the QuikChange II Site-Directed Mutagenesis Kit (Agilent). All constructs were verified by sequencing.

Confocal microscopy

All fluorescence micrographs were obtained using a Zeiss LSM 510 confocal fluorescence microscope using a Plan-Apochromat 100×/1.4 NA Oil DIC objective. Each fluorescence channel was imaged sequentially, with the transmission brightfield DIC collected simultaneously with mCerulean. See Table 2 for parameters used. Fluorescence intensities from a 90.0 µm square area were encoded into a 1024 × 1024 pixel, 12-bit image file for near-optimal Nyquist sampling [42] and enhanced discrimination of fluorescence intensities, respectively. The laser dwell time was 1.60 µs/pixel. All pinholes were set to 168.00 µm (1 Airy unit for 543-nm laser line), resulting in an optical slice of ˜0.8 µm. Expression of each fluorescent protein alone was used to control for crosstalk between the fluorescence channels. Greater than 100 cells were imaged for each condition. The median projections of 10 uniform brightfield images and 10 dark images per channel were used to correct images for shading and dark current, respectively.

Table 2. Confocal microscopy parameters
 tdTomatoGFPmCeruleanDIC
  1. aSometimes had to be adjusted slightly to maintain an image solidly within the dynamic range.
Excitation laser line (nm)543 nm488 nm405 nm405 nm
Laser power (mW)1.0 mW15.0 mW25.0 mW25.0 mW
Laser transmission (%)10%30%65%65%
Detected λ range (nm)>560 nm505–545 nm420–490 nm
Detector gain (V)750 V725 V750 V260 Va
Amplifier offset−0.01−0.01−0.030.00

Yap1-GFP hydrogen peroxide treatment

Overnight late-log phase (˜1 × 108 cells/mL) cultures of S. cerevisiae cells expressing the Q-SCAn reporter and Yap1-GFP were washed twice with H2O, resuspended in 1 mL H2O, counted via hemocytometer, and adjusted to a density of 2–4 × 107 cells/mL. Cells were treated with 5 mm H2O2 and imaged by confocal microscopy both before treatment and within three time windows following the start of treatment: 1–10 min, 30–45 min or 60–80 min of start of treatment. Cells were visually scored for nuclear only, cytoplasmic only, or nuclear and cytoplasmic Yap1-GFP localization.

Data analysis

Colocalization analysis of the marker proteins was conducted using the commercial software Volocity 5.6.2 (PerkinElmer). ImageJ 1.46r was used to export channels from the raw images and to compute the median shading and dark current correction images [43]. CellProfiler 2.0 (Developer build) with custom plugins was used to process the images, identify S. cerevisiae cells and compartments, and measure fluorescence intensities [27]. CellProfiler Analyst 2.0 (r11710) was used to develop filtering thresholds for the algorithm [44]. Statistical analysis was performed with Stata 11.2. The localization index for a single cell is the mean nuclear GFP intensity divided by the sum of the mean nuclear and mean mitochondrial GFP intensities, producing a value in the range of 0 (mitochondrial) to 1 (nuclear). Cells whose sums of intensities were below the noise threshold were excluded. Shifts in localization were assessed by the Kruskal–Wallis test and the post-hoc multiple comparisons test with correction for ties [45], with α = 0.05.

Acknowledgments

We would like to thank the Emory Custom Cloning Core Facility and the Emory University Integrated Cellular Imaging Core of the Winship Cancer Institute (National Cancer Institute Cancer Center Support Grant P30CA138292), especially Debbie Martinson and Dr. Adam Marcus. We would also like to thank the members of the Doetsch and Corbett laboratories, especially Drs. Dan Swartzlander and Natalya Degtyareva, for their constructive feedback in designing these experiments and in the preparation of this manuscript. Finally, we thank the Imaging Platform at the Broad Institute for help with CellProfiler. This work was supported by the National Institutes of Health [grant numbers P01ES011163 to P. W. D., F31CA168272 to N. C. B.]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Ancillary