- Short genome report
- Open Access
Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34
Standards in Genomic Sciencesvolume 11, Article number: 70 (2016)
Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. This genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.
Halorubrum lacusprofundi is an extremely halophilic archaeon belonging to the class Halobacteria within the phylum Euryarchaeota . The species is represented by the type strain, ACAM 34 (= DSM 5036 = ATCC 49239 = JCM 8891), and a second strain, ACAM 32, both of which were isolated from Deep Lake, Antarctica . This organism was first described as Halobacterium lacusprofundi but was later transferred to the genus Halorubrum . Members of the genus Halorubrum have been found not only in Antarctica, but also in Africa , Asia , and North America , where they are usually found in saline lakes or salterns. Most members of the genus are neutrophiles, but some are haloalkaliphiles [6, 7]. H. lacusprofundi (Fig. 1) was proposed for sequencing as part of a 2006 Joint Genome Institute Community Sequencing Program project because of its ability to grow at low temperature and its phylogenetic distance from other halophiles with sequenced genomes (Fig. 2).
Classification and features
Halorubrum lacusprofundi ACAM 34 was isolated from a water-sediment sample from Deep Lake, Antarctica . The water-sediment sample was incubated in the light at 18 °C, and after 3 months developed a reddish color. H. lacusprofundi was isolated from the sample by streaking on Deep Lake vitamin agar, which was composed of Lake Deep water with 1 g/L yeast extract, 15 g/L agar, and vitamin solution. The physiological characteristics of H. lacusprofundi were described as follows . Cells were pleomorphic. Motility was not observed, and no flagella were present. Cells grew at a temperature range of −1 °C to 40 °C with an optimal growth temperature of 36 °C . Growth was observed at NaCl concentration of 1.5 M to 4.5 M with an optimum salt concentration of 3.5 M. Cells lysed in distilled water. The optimum magnesium concentration for growth was 0.1 M. No growth was observed at magnesium concentrations of 0 M or 1.0 M. Ammonium could not be used as a nitrogen source; complex media such as yeast extract or peptone was required. Growth was stimulated by addition of glucose, galactose, mannose, ribose, lactose, glycerol, succinate, lactate, formate, acetate, propionate, and ethanol. Growth was not stimulated by addition of glycine. Acid was not produced from sugars.
Genome sequencing information
Genome project history
H. lacusprofundi was selected for sequencing based upon its phylogenetic position relative to other haloarchaea and its cold tolerance (Table 1). It is part of a 2006 Joint Genome Institute Community Sequencing Program project that included six diverse archaeal genomes. Sequencing was done at the JGI Production Genomics Facility. Finishing was done at Los Alamos National Laboratory. Annotation was done at Oak Ridge National Laboratory and JGI. The complete genome sequence was finished in September, 2008 and was released to the public in GenBank in February, 2009. A summary of the project information is shown in Table 2.
Growth conditions and genomic DNA preparation
H. lacusprofundi ATCC 49239 was grown in Franzmann medium (180 g NaCl, 75 g MgCl2 · 6H2O, 7.4 g MgSO4 · 7H2O, 7.4 g KCl, 1 g CaCl2 · 2H2O, 10 g C4H4O4Na2 · 6H2O per liter, pH 7.4 with addition of 10 ml vitamin solution) . The vitamin solution contained 0.1 g biotin, 0.1 g cyanocobalamin, and 0.1 g thiamine HCl per liter. Cells were grown with shaking at 220 rpm at 4 °C with illumination.
The DNA extraction method was modified from . Cells were grown to OD600 = 0.8, collected by centrifugation at 8000 rpm for 10 min at 4 °C, resuspended in 1/20 volume basal salts and lysed by addition of 2 volumes of deionized water and mixing at room temperature. Next, proteinase K was added to a final concentration of 100 μg/ml, mixed gently, and incubated for 1 h at 37 °C. The lysate was extracted using an equal volume of phenol, mixed gently by inverting at room temperature for 5 min, and then spinning at 8000 g for 15 min at 4 °C. The aqueous and interphase was collected and the phenol extraction was repeated twice more. The aqueous and interphase were then dialyzed against TE overnight at 4 °C with one change of buffer. The dialyzed solution was collected and RNase A was added to a final concentration of 50 μg/ml, the solution was mixed and incubated for 2 h at 37 °C with gentle shaking. Proteinase K was added to a final concentration of 100 μg/ml, mixed and incubated for an additional hour at 37 °C. The RNase A and proteinase K steps were repeated. The DNA was then dialyzed overnight against TE at 4 °C with one buffer change.
Genome sequencing and assembly
The genome of H. lacusprofundi was sequenced at the Joint Genome Institute using a combination of 3 kb, 8 kb, and fosmid DNA libraries. All general aspects of library construction and sequencing were performed at the JGI . Draft assemblies were based on 40,800 total reads. All libraries provided 12.5× coverage. The Phred/Phrap/Consed software package was used for sequence assembly and quality assessment [11–13]. After the shotgun stage, reads were assembled with parallel phrap (High Performance Software, LLC). Possible mis-assemblies were corrected with Dupfinisher  or transposon bombing of bridging clones (Epicentre Biotechnologies, Madison, WI). Gaps between contigs were closed by editing in Consed, custom primer walk or PCR amplification (Roche Applied Science, Indianapolis, IN). A total of 1722 additional reactions were necessary to close gaps and to raise the quality of the finished sequence. The completed genome sequence of H. lacusprofundi contains 54,250 reads, achieving an average of 11.8× and 13.8× coverage in the chromosomes per base with an error rate of less than 1 in 50,000 bp.
Protein-coding genes were identified using a combination of CRITICA  and Glimmer  followed by a round of manual curation using the JGI GenePRIMP pipeline . GenePRIMP points out cases where gene start sites may be incorrect based on alignment with homologous proteins. It also highlights genes that appear to be broken into two or more pieces, due to a premature stop codon or frameshift, and genes that are disrupted by transposable elements. All of these types of broken and interrupted genes are labeled as pseudogenes. Genes that may have been missed by the gene calling programs are also identified in intergenic regions. The predicted CDSs were translated and used to search the National Center for Biotechnology Information nonredundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and Interpro databases. Signal peptides were identified with SignalP , and transmembrane helices were determined with TMHMM . CRISPR elements were identified with the CRISPR Recognition Tool . Paralogs are hits of a protein against another protein within the same genome with an e-value of 10−2 or lower. The tRNAScanSE tool  was used to find tRNA genes. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes Expert Review (IMG-ER)  and HaloWeb  platform.
The genome of H. lacusprofundi consists of two chromosomes of length 2,735,295 bp (Chromosome 1) and 525,943 bp (Chromosome 2 or pHL500) and one plasmid of length 431,338 bp (pHL400) (Table 3). The map of the genome is available on HaloWeb . Partial sequence was obtained from a second smaller plasmid, but it appeared to be present in a minority of the cells and its complete sequence could not be determined. The GC content of the large chromosome (67 %) is larger than those of the small chromosome (57 %) and the plasmid (55 %). There are 2801 genes on the large chromosome, 522 genes on the smaller chromosome, and 402 genes on the plasmid. Two of the ribosomal RNA operons are on the large chromosome and one is found on the smaller chromosome. The properties and statistics of the genome are summarized in Table 4, and genes belonging to COG functional categories are listed in Table 5.
The Halorubrum lacusprofundi genome sequence is the first established from a cold-adapted haloarchaeon. The genome has features typical of halophilic Archaea, including high G + C-content, large extrachromosomal replicons, and eukaryotic-like DNA replication and transcription genes. Encoded proteins are highly acidic with properties that suggest looser packing and greater flexibility important for function at cold temperatures [25–28]. H. lacusprofundi co-exists in a community of three major haloarchaea in Deep Lake, Antarctica [29, 30].
Coding region identification tool invoking comparative analysis
PRofils pour l’Identification Automatique du Métabolisme
Kyoto Encyclopedia of Genes and Genomes
Clusters of Orthologous Groups
Transmembrane hidden Markov model
Clustered regularly interspaced short palindromic repeats
Franzmann PD, Stackebrandt E, Sanderson K, Volkman JK, Cameron DE, Stevenson PL, McMeekin TA, Burton HR. Halobacterium lacusprofundi sp. nov., a halophilic bacterium isolated from Deep Lake, Antarctica. Syst Appl Microbiol. 1988;11:20–7.
McGenity TJ, Grant WD. Transfer of Halobacterium saccharovorum, Halobacterium sodomense, Halobacterium trapanicum NRC 34021 and Halobacterium lacusprofundi to the genus Halorubrum gen. nov., as Halorubrum saccharovorum comb. nov., Halorubrum sodomense comb. nov., Halorubrum trapanicum comb. nov., and Halorubrum lacusprofundi comb. nov. Syst Appl Microbiol. 1995;18:237–43.
Kharroub K, Quesada T, Ferrer R, Fuentes S, Aguilera M, Boulahrouf A, Ramos-Cormenzana A, Monteoliva-Sánchez M. Halorubrum ezzemoulense sp. nov., a halophilic archaeon isolated from Ezzemoul sabkha, Algeria. Int J Syst Evol Microbiol. 2006;56:1583–8.
Cui H-L, Tohty D, Zhou P-J, Liu S-J. Halorubrum lipolyticum sp. nov. and Halorubrum aidingense sp. nov., isolated from two salt lakes in Xin-Jiang, China. Int J Syst Evol Microbiol. 2006;56:1631–4.
Pesenti PT, Sikaroodi M, Gillevet PM, Sánchez-Porro C, Ventosa A, Litchfield CD. Halorubrum californiense sp. nov., an extreme archaeal halophile isolated from a crystallizer pond at a solar salt plant in California, USA. Int J Syst Evol Microbiol. 2008;58:2710–5.
Mwatha WE, Grant WD. Natronobacterium vacuolata sp. nov., a haloalkaliphilic archaeon isolated from Lake Magadi, Kenya. Int J Syst Bacteriol. 1993;43:401–4.
Fan H, Xue Y, Ma Y, Ventosa A, Grant WD. Halorubrum tibetense sp. nov., a novel haloalkaliphilic archaeon from Lake Zabuye in Tibet, China. Int J Syst Evol Microbiol. 2004;54:1213–6.
Reid IN, Sparks WB, Lubow S, McGrath M, Livio M, Valenti J, Sowers KR, Shukla HD, MacAuley S, Miller T, Suvanasuthi R, Belas R, Colman A, Robb FT, DasSarma P, Müller JA, Coker JA, Cavicchioli R, Chen F, DasSarma S. Terrestrial models for extraterrestrial life: methanogens and halophiles at Martian temperatures. Int J Astrobiol. 2006;5:89–97.
DasSarma S, Fleischmann EM, Robb FT, Place AR, Sowers KR, Schreier HJ, editors. Archaea–A Laboratory Manual–Halophiles, Cold Spring Harbor. New York: Cold Spring Harbor Press; 1995.
Joint Genome Institute, Our Projects. http://www.jgi.doe.gov/sequencing/protocols/prots_production.html. Accessed 27 Jun 2016.
Ewing B, Green P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998;8:186–94.
Ewing B, Hillier L, Wendl MC, Green P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998;8:175–85.
Gordon D, Abajian C, Green P. Consed: a graphical tool for sequence finishing. Genome Res. 1998;8:195–202.
Han CS, Chain P. Finishing repeat regions automatically with Dupfinisher. In: Arabnia HR, Valafar H, editors. Proceedings of the 2006 international conference on bioinformatics & computational biology. Las Vegas: CSREA Press; 2006. p. 141–6.
Badger JH, Olsen GJ. CRITICA: coding region identification tool invoking comparative analysis. Mol Biol Evol. 1999;16:512–24.
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999;27:4636–41.
Pati A, Ivanova NN, Mikhailova N, Ovchinnikova G, Hooper SD, Lykidis A, Kyrpides NC. GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes. Nat Methods. 2010;7:455–7.
Emanuelsson O, Brunak S, von Heijne G, Nielsen H. Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc. 2007;2:953–71.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305:567–80.
Bland C, Ramsey TL, Sabree F, Lowe M, Brown K, Kyrpides NC, Hugenholtz P. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinform. 2007;8:209.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.
Markowitz VM, Mavromatis K, Ivanova NN, Chen IMA, Chu K, Kyrpides NC. IMG ER: a system for microbial genome annotation expert review and curation. Bioinform. 2009;25:2271–8.
DasSarma SL, Capes MD, DasSarma P, DasSarma S. HaloWeb: the haloarchaeal genomes database. Saline Sys. 2010;6:12.
Halorubrum lacusprofundi Genome. http://halo.umbc.edu/cgi-bin/haloweb/hla.pl?operation=map_query1. Accessed 27 Jun 2016.
DasSarma S, DasSarma P. Halophiles and their enzymes: negativity put to good use. Curr Opin Microbiol. 2015;25:120–6.
Karan R, Capes MD, DasSarma S. Function and biotechnology of extremophilic enzymes in low water activity. Aquat Biosyst. 2012;8:4. doi:10.1186/2046-9063-8-4.
Karan R, Capes MD, DasSarma P, DasSarma S. Cloning, overexpression, purification, and characterization of a polyextremophilic β-galactosidase from the Antarctic haloarchaeon Halorubrum lacusprofundi. BMC Biotechnol. 2013;13:3.
DasSarma S, Capes MD, Karan R, DasSarma P. Amino acid substitutions in cold-adapted proteins from Halorubrum lacusprofundi, an extremely halophilic microbe from antarctica. PLoS One. 2013;8:e58587.
Williams TJ, Allen MA, DeMaere MZ, Kyrpides NC, Tringe SG, Woyke T, Cavicchioli R. Microbial ecology of an Antarctic hypersaline lake: genomic assessment of ecophysiology among dominant haloarchaea. ISME J. 2014;8:1645–58. Erratum in: ISME J. 2014;8:1752.
Tschitschko B, Williams TJ, Allen MA, Zhong L, Raftery MJ, Cavicchioli R. Ecophysiological Distinctions of Haloarchaea from a Hypersaline Antarctic Lake as Determined by Metaproteomics. Appl Environ Microbiol. 2016;82:3165–73.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87:4576–9.
Garrity GM, Holt JG. Phylum AII. Euryarchaeota phy. nov. In: Boone DR, Castenholz RW, Garrity GM, editors. Bergey’s Manual of Systematic Bacteriology, vol. 1. 2nd ed. New York: Springer; 2001. p. 211–355.
List Editor. Validation of publication of new names and new combinations previously effectively published outside the IJSEM. Validation List no. 85. Int J Syst Evol Microbiol. 2002;52:685–90.
Grant WD, Kamekura M, McGenity TJ, Ventosa A. Class III. Halobacteria class. nov. In: Boone DR, Castenholz RW, Garrity GM, editors. Bergey’s Manual of Systematic Bacteriology, vol. 1. 2nd ed. New York: Springer; 2001. p. 294–334.
Grant WD, Larsen H. Group III. Extremely halophilic archaeobacteria. Order Halobacteriales ord. nov. In: Staley JT, Bryant MP, Pfennig N, Holt JG, editors. Bergey’s Manual of Systematic Bacteriology, vol. 3. 1st ed. Baltimore: The Williams & Watkins Co; 1989. p. 2216–28.
Gibbons NE. Family V. Halobacteriaceae fam. nov. In: Bergey’s Manual of Determinative Bacteriology. 8th ed. Baltimore: The Williams & Watkins Co; 1974. p. 279.
Ashburner M, Ball CA, BLake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25:25–9.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.
Perrière G, Gouy M. WWW-Query: an on-line retrieval system for biological sequence banks. Biochim. 1996;78:364–9.
This work was performed under the auspices of the US Department of Energy’s Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under Contract No. DE-AC02-05CH11231, Lawrence Livermore National Laboratory under Contract No. DE-AC52-07NA27344, and Los Alamos National Laboratory under Contract No. DE-AC02-06NA25396. L. H. and M. L. were supported by the Department of Energy under contract DE-AC05-000R22725. P. D. and S. D. were supported by NASA grants NNX10AP47G and NNX15AM07G.
All authors contributed to the generation, analysis and interpretation of the data and manuscript preparation. PD cultured the microbe and prepared the genomic DNA. CW conceived the project. IA led the genomics efforts and IA, PD and SD prepared the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.