The Institute for Genomic Research, 9712 Medical Centre Drive, Rockville, Maryland 20850, USA.
Organizational characteristics and information content of an archaeal genome: 156kb of sequence from Sulfolobus solfataricus P2
Article first published online: 6 OCT 2006
DOI: 10.1111/j.1365-2958.1996.tb02666.x
Additional Information
How to Cite
Sensen, C. W., Klenk, H.-P., Singh, R. K., Allard, G., Chan, C. C.-Y., Liu, Q. Y., Penny, S. L., Young, F., Schenk, M. E., Gaasterland, T., Doolittle, W. F., Ragan, M. A. and Charlebois, R. L. (1996), Organizational characteristics and information content of an archaeal genome: 156kb of sequence from Sulfolobus solfataricus P2. Molecular Microbiology, 22: 175–191. doi: 10.1111/j.1365-2958.1996.tb02666.x
Publication History
- Issue published online: 6 OCT 2006
- Article first published online: 6 OCT 2006
- Received 16 November, 1995; revised 16 July, 1996; accepted 22 July, 1996
- Abstract
- References
- Cited By
Summary
We have initiated a project to sequence the 3Mbp genome of the thermoacidophilic archaebacterium Sulfolobus solfataricus P2. Cosmids were selected from a provisional set of minimally overlapping clones, subcloned in pUC18, and sequenced using a hybrid (random plus directed) strategy to give two blocks of contiguous unique sequence, respectively, 100389 and 56105bp. These two contigs contain a total of 163 open reading frames (ORFs) in 26–29 putative operons; 56 ORFs could be identified with reasonable certainty. Clusters of ORFs potentially encode proteins of glycogen biosynthesis, oxidative decarboxylation of pyruvate, ATP-dependent transport across membranes, isoprenoid biosynthesis, protein synthesis, and ribosomes. Putative promoters occur upstream of most ORFs. Thirty per cent of the predicted strong and medium-strength promoters can initiate transcription at the start codon or within 10 nucleotides upstream, indicating a process of initial mRNA-ribosome contact unlike that of most eubacterial genes. A novel termination motif is proposed to account for 15 additional terminations. The two contigs differ in densities of ORFs, insertion elements and repeated sequences; together they contain two copies of the previously reported insertion sequence ISC 1217, five additional IS elements representing four novel types, four classes of long non-IS repeated sequences, and numerous short, perfect repeats.

1365-2958/asset/olbannerleft.gif?v=1&s=5badf94c134fc8798f790efd4821c55ff95fc4cb)
1365-2958/asset/cover.gif?v=1&s=75254d9e3ae6a0f08085f34cc6e5bd65d8e3d52e)