Abstract
- Top of page
- Abstract
- Introduction
- Materials and Methods
- Results
- Discussion
- Acknowledgements
- References
- Supporting Information
Engineering robust protein production and purification of correctly folded biotherapeutic proteins in cell-based systems is often challenging due to the requirements for maintaining complex cellular networks for cell viability and the need to develop associated downstream processes that reproducibly yield biopharmaceutical products with high product quality. Here, we present an alternative Escherichia coli-based open cell-free synthesis (OCFS) system that is optimized for predictable high-yield protein synthesis and folding at any scale with straightforward downstream purification processes. We describe how the linear scalability of OCFS allows rapid process optimization of parameters affecting extract activation, gene sequence optimization, and redox folding conditions for disulfide bond formation at microliter scales. Efficient and predictable high-level protein production can then be achieved using batch processes in standard bioreactors. We show how a fully bioactive protein produced by OCFS from optimized frozen extract can be purified directly using a streamlined purification process that yields a biologically active cytokine, human granulocyte-macrophage colony-stimulating factor, produced at titers of 700 mg/L in 10 h. These results represent a milestone for in vitro protein synthesis, with potential for the cGMP production of disulfide-bonded biotherapeutic proteins. Biotechnol. Bioeng. 2011; 108:1570–1578. © 2011 Wiley Periodicals, Inc.
Introduction
- Top of page
- Abstract
- Introduction
- Materials and Methods
- Results
- Discussion
- Acknowledgements
- References
- Supporting Information
Cell-free protein synthesis systems have distinct advantages over traditional in vivo methods for protein production (Endo and Sawasaki, 2006; Jermutus et al., 1998). The absence of a requirement to maintain cell viability allows optimization of the protein synthetic capacity of the cell-free extract to produce a single protein. The absence of a cell wall allows for addition of non-natural factors to the open system in order to manipulate transcription, translation, and folding to provide precise modulation of the protein expression process (Jewett and Swartz, 2004b). Cell-free systems derived from Escherichia coli have a long history as experimental tools for exploring the molecular biology of transcription and translation (Jewett et al., 2002; Noren et al., 1989; Spirin et al., 1988; Zubay, 1973) and have been utilized as a basis for developing in vitro protein production systems at small scales.
Cell-free protein production at the multigram and kilogram scale, an essential starting requirement for biotherapeutic production processes, has been hampered by the lack of scalable systems amenable to standard bioreactor configurations at large scale. Voloshin and Swartz (2008) reported production of 385 mg/L of soluble protein corresponding to a 3-disulfide bonded protein IGF-1 to 1-L scale. However, the fraction of correctly folded and functional protein was not reported, and different yields were reported using different reaction vessel designs. Consistent and robust performance at a wide range of scales is important for using a system as a general production technology and the time and cost of biologic drug discovery and production process development can be substantially reduced with systems that scale in a predictable and efficient manner from high-throughput screening to manufacturing.
The industrial scale application of cell-free protein synthesis for the production of recombinant proteins requires efficient and reproducible preparation of large volumes of active cell extracts (Liu et al., 2005), stable and cost-effective energy generating systems (Jewett and Swartz, 2004a), and the maintenance of stable pools of precursor molecules required for high-level protein synthesis. For example, the overall protein synthetic capacity of E. coli extracts has been increased by genetically engineering pathways responsible for recycling key amino acid precursor molecules, leading to the development of the E. coli strain KGK10 (Calhoun and Swartz, 2006; Knapp et al., 2007; Michel-Reydellet et al., 2004).
Here, we extend the E. coli-based system developed by Swartz and coworkers to show how scalable open cell-free synthesis (OCFS) can be optimized for high-level production of a multi-disulfide-bonded protein, granulocyte-macrophage colony-stimulating factor (rhGM-CSF), through detailed analysis and refinement of several process elements including: extract preparation and storage, optimization of synthetic gene sequences and redox parameters, process scale-up, and downstream product purification steps. We show that after full optimization, the OCFS system can produce active rhGM-CSF, a four-helix bundle human cytokine with two disulfide bonds required for functional activity. The system is linearly scalable and can achieve yields of rhGM-CSF of 700 mg/L in 10 h over ∼106 range in volumes to 100 L. This linear scalability allows for rapid analysis of multiple parameters affecting transcription, protein synthesis, and folding in 24- or 96-well plates for high-throughput optimization of product and process parameters. Finally, the absence of a cell-lysis step after protein production, allows rapid and consistent protein purification optimization.
Discussion
- Top of page
- Abstract
- Introduction
- Materials and Methods
- Results
- Discussion
- Acknowledgements
- References
- Supporting Information
Several cell-based secretion systems produce biologically active GM-CSF in E. coli (Berges et al., 1996; Libby et al., 1987), yeast [50–60 mg/L; (Price et al., 1987)], Aspergillus niger [0.64 µg/L;(Kim et al. 2000)], Pichia pastoris (200–250 mg/L; (Sainathan et al., 2005) and 130 mg/L; [Bhatacharya et al., 2007)], and insect cells (11–45 mg/L;[Chiou and Wu, 1990)]. In general, substantial losses of product due to refolding, filtration, and extensive column chromatography steps are required to isolate homogenously pure product in these systems. Cell-free protein synthesis of GM-CSF as well as other small, disulfide-bonded proteins has been described previously at small scale (Goerke and Swartz, 2008; Kawasaki et al., 2003; Ryabova et al., 1997; Son et al., 2006; Yang et al., 2004).
Our approach represents an advance towards a robust (>95% correctly folded multi-disulfide-bonded protein), high yield (700 mg/L protein over 10 hr), linearly scalable (∼106 fold), and high-recovery (>65%) system for industrial production of disulfide-bonded proteins such as GM-CSF. Using the open nature of cell-free synthesis we optimized several factors affecting transcription, translation, and folding to predictably identify conditions for scalable production of a disulfide-bonded protein to the 100 L scale. In this in vitro synthetic biology-based system, experimental data built from an understanding of the energy source as well as transcription, translational, and protein folding components are sufficient to guide design and testing of protein expression conditions in a microtiter plate format with predictable properties at any scale. More complex secondary effects associated with in vivo gene expression and regulatory networks are avoided. For example, in T7-based systems in vivo, transcription, translation, and folding are uncoupled. The uncoupling mediated by T7 RNAP may in turn mediate protein aggregation. However, after full optimization of plasmid, T7 RNAP, and redox conditions, >95% of the protein produced is soluble, there is no evidence of significant misfolded aggregates formed as measured by non-reducing SDS–PAGE autoradiography (Fig. 5b), or detected in the final product (Fig. 6c).
Previous optimization strategies for cell-free protein expression have sampled parameters sparsely and thus cannot be generalized to the more complex, finely balanced networks of protein synthesis and folding described here. Instead, we found that one must first assemble and experimentally characterize interacting networks of myriad system components at small scale to create generalizable response surfaces such as those in Figures 1 and 3 with quantitative predictive capabilities for process scale-up.
The OCFS system uses a K-12-derived production strain as the basis for a cell-line banked under cGMP conditions. Multiple protein products can now be produced on an as needed basis from the same stored frozen extract produced from this strain. For example, similar scalable results to 4 L have been obtained for a single-chain antibody fragment (scFv), anti-IL-23 scFv (1 g/L in 10 h) and an anti-IL13Rα1 Fab antibody with five disulfide bonds (300 mg/L in 9 h; manuscript in preparation). Use of a single cell-line coupled with the speed of process optimization inherent in the linearly scalable OCFS protein production system should allow for decreases in the timelines for biopharmaceutical protein production and process development.
We have developed the OCFS production process using standard microbial fermentation and process equipment known to scale to thousands of liters under cGMP standards. Additional efforts to improve this production process toward cGMP compliance include (a) developing methods for sterile filtration of the extract, (b) establishing extract process production reproducibility, (c) establishing the translational fidelity of OCFS produced proteins (Heibeck et al., manuscript in preparation), and (d) accurate cost-modeling of the reagents and other process variables. Finally, we aim to develop a molecular understanding of the limiting factors accounting for the time and temperature dependence of extract activation shown in Figure 1, by quantitative LC-MS-based profiling of proteins and metabolites (Roy and Becker, 2007), in order to improve protein production yields further.
The OCFS system is limited to producing proteins with only a few post-translational modifications such as disulfide bonds and proteolytic processing (Son et al., 2006); other therapeutic protein classes such as glycosylated proteins are currently outside the scope of OCFS at large scale. However, OCFS has the potential for synthesis of proteins containing site-specifically incorporated non-natural amino acids (Noren et al., 1989) without the requirement for orthogonal tRNA synthetase-tRNA pairs or limitations to cellular uptake of non-natural amino acids (Antonczak et al., 2009). This will allow the engineering of novel post-translationally modified proteins using the scalable OCFS system.
Acknowledgements
- Top of page
- Abstract
- Introduction
- Materials and Methods
- Results
- Discussion
- Acknowledgements
- References
- Supporting Information
Portions of this work were supported by STTR grants NSF0420208 & NSF0522337 from National Science Foundation. We thank Soheyl Tadjiki (Postnova Analytics, Salt lake City, Utah) for Asymmetrical Flow Field-Flow Fractionation data, and Tom Zupancic (www.Virtual3C.com) for helpful comments and criticisms. The authors are grateful for the efforts of all the people who contributed to the work presented here including Albert Boot, John Garbutt, Eudean Garces, Tyler Heibeck, James Krocak, Seung Lee, Patricio Riquelme, James Rozzelle, Alexei Voloshin, and Juan Zhang.