Article
Exploiting genomic patterns to discover new supramolecular protein assemblies
Article first published online: 2 DEC 2008
DOI: 10.1002/pro.1
Copyright © 2008 The Protein Society
Additional Information
How to Cite
Beeby, M., Bobik, T. A. and Yeates, T. O. (2009), Exploiting genomic patterns to discover new supramolecular protein assemblies. Protein Science, 18: 69–79. doi: 10.1002/pro.1
Publication History
- Issue published online: 16 DEC 2008
- Article first published online: 2 DEC 2008
- Manuscript Accepted: 22 SEP 2008
- Manuscript Revised: 19 SEP 2008
- Manuscript Received: 30 JUL 2008
Funded by
- BER Program, U.S. Department of Energy Office of Science
Keywords:
- supramolecular assembly;
- bacterial ultrastructure;
- paralog;
- homolog;
- self assembly;
- carboxysome;
- bacterial microcompartment
Abstract
Bacterial microcompartments are supramolecular protein assemblies that function as bacterial organelles by compartmentalizing particular enzymes and metabolic intermediates. The outer shells of these microcompartments are assembled from multiple paralogous structural proteins. Because the paralogs are required to assemble together, their genes are often transcribed together from the same operon, giving rise to a distinctive genomic pattern: multiple, typically small, paralogous proteins encoded in close proximity on the bacterial chromosome. To investigate the generality of this pattern in supramolecular assemblies, we employed a comparative genomics approach to search for protein families that show the same kind of genomic pattern as that exhibited by bacterial microcompartments. The results indicate that a variety of large supramolecular assemblies fit the pattern, including bacterial gas vesicles, bacterial pili, and small heat-shock protein complexes. The search also retrieved several widely distributed protein families of presently unknown function. The proteins from one of these families were characterized experimentally and found to show a behavior indicative of supramolecular assembly. We conclude that cotranscribed paralogs are a common feature of diverse supramolecular assemblies, and a useful genomic signature for discovering new kinds of large protein assemblies from genomic data.

1469-896X/asset/olbannerleft.gif?v=1&s=d218899ae53b2862ab119790ed504b8d72122fb3)
1469-896X/asset/olbannerright.gif?v=1&s=59470eb9a1d9b7b13b1be75e9445e6c46ee2214f)
