- Short report
- Open Access
High quality draft genome sequence of an extremely halophilic archaeon Natrinema altunense strain AJ2T
Standards in Genomic Sciencesvolume 12, Article number: 25 (2017)
Natrinema altunense strain AJ2T, a halophilic archaeal strain, was isolated from a high-altitude (3884 m) salt lake in Xinjiang, China. This strain requires at least 1.7 M NaCl to grow and can grow anaerobically in the presence of nitrate. To understand the genetics underlying its extreme phenotype, we de novo assembled the entire genome sequence of AJ2T (=CGMCC 1.3731T=JCM 12890T). We assembled 3,774,135 bp of a total of 4.4 Mb genome in only 20 contigs and noted its high GC content (64.6%). Subsequently we predicted the gene content and generated genome annotation to identify the relationship between the epigenetic characteristics and genomic features. The genome sequence contains 52 tRNA genes, 3 rRNA genes and 4,462 protein-coding genes, 3792 assigned as functional or hypothetical proteins in nr database. This Whole Genome Shotgun project was deposited in DDBJ/EMBL/GenBank under the accession JNCS00000000. We performed a Bayesian (Maximum-Likelihood) phylogenetic analysis using 16S rRNA sequence and obtained its relationship to other strains in the Natrinema and Haloterrigena genera. We also confirmed the ANI value between every two species of Natrinema and Haloterrigena genera. In conclusion, our analysis furthered our understanding of the extreme-environment adapted strain AJ2T by characterizing its genome structure, gene content and phylogenetic placement. Our detailed case study will contribute to our overall understanding of why Natrinema strains can survive in such a high-altitude salt lake.
When the genus Natrinema was first described in 1998, it contained two species, Natrinema pellirubrum and Natrinema pallidum . The genus Natrinema belongs to family Halobacteriaceae , phylum Euryarchaeota . Five more species of this genus were isolated and characterized since then, including N. versiforme , N. altunense , N. gari , N. ejinorense  and N. salaciae . For now, the genomic sequences of all but N. ejinorense and N. salaciae in the genus Natrinema are publicly available on Genomes Online Database  and/or NCBI Genbank. Our lab first identified the N. altunense strain AJ2T in 2005 in a salt lake . Living cells in salt lake have made numerous adaptations to this special ecosystem, allowing them to flourish in a very harsh environment. To determine if the AJ2T genome contains genes for adaptation to a particular set of environmental restrictions and supply a version of genome assembly in the database, we sequenced its whole genome in 2011 and published the whole genome sequence in the WGS database in May, 2014 as the first reported whole genome sequence of its species.
We isolated the strain AJ2T from a water sample collected from the edge of Ayakekum salt lake (37°37′ N, 89°29′ E) in Altun Mountain (Altyn-Tagh) National Nature Reserve in Xinjiang, China (Table 1). This salt lake is cold and exposed to strong ultraviolet radiation throughout the year due to its high altitude. It also has high salinity and lacks the common organic nutrients for microorganisms .
Classification and features
N. altunense strain AJ2T is an extremely halophilic archaea growing at 1.7–4.3 M NaCl and 0.005–1.0 M MgCl2. Colonies in the agar plate have a vivid orange or red colour. Cells are rod-shaped, but can become pleomorphic under unfavourable conditions as reported in 2005 . The 16S rRNA gene sequence analysis was submitted to the EzTaxon-e service  and revealed 95.77–98.50% sequence similarity to members of the genus Natrinema . Strain AJ2T exhibited the highest 16S rRNA gene sequence similarity with N. gari HIS40-3T (98.50%). Phylogenetic analysis based on 16S rRNA gene sequences showed that strain AJ2T clustered with most type strains of the genus Natrinema with a high bootstrap value (Fig. 1). The other three type strains, N. pellirubrum DSM 15624 T, N. salaciae MDB25T and N. ejinorense EJ-57T, were clustered with the genus Haloterrigena . In the 16S rRNA gene trees (Fig. 1) and rpoB’ (RNA polymerase subunit B′) gene trees , these three type strains of genus Natrinema showed unclear taxonomic positions . The mixture phylogenetic relationship between these strains in the Natrinema and Haloterrigena genera were reported in 2003 . This suggests that Haloterrigena maybe a later synonym (heterotypic) of genus Natrinema . The cell morphology and flagellum of N. altunense strain AJ2T were examined using transmission electron microscopy (JEM-1230, JEOL). The cells of strain AJ2T are straight and rods and have a diameter ranging 0.3–0.8 μm and length of 0.9–4.0 μm (Fig. 2). The cells are motile and their growth requires at least 1.7 M NaCl and 0.005–1 M MgCl2 (optimal 3.0–4.3 M NaCl and 0.05–0.2 M MgCl2). This strain is chemo-organotrophic and can anaerobically grow in the presence of nitrate. The strain had oxidase and catalase activity. The strain can reduce nitrate and nitrite and produce N2 gas. This strain can also hydrolyse gelatine and tweens 20, 40 and 80 as well as produce H2S from thiosulfate .
Genome sequencing information
Genome project history
We selected N. altunense AJ2T for sequencing because its halophilic properties and high-altitude habitat may have caused interesting changes in its genome. Additionally, the five other members of genus Natrinema were sequenced and could be compared to our sequence (Table 2). This Whole Genome Shotgun project has been deposited in the DDBJ/EMBL/GenBank under the accession JNCS00000000. The version described in this paper is version JNCS00000000.1. Table 3 presents the project information and its association with MIGS version 2.0 compliance .
Growth conditions and genomic DNA preparation
N. altunense strain AJ2T was aerobically cultivated at 37 °C for 3 days in modified CM medium, which contained the following (per liter distilled water): 7.5 g Casamino acid (Bacto), 10 g yeast extract (OXOID), 3 g trisodium citrate, 2 g KCl, 20 g MgSO4 · 7H2O and 200 g NaCl (pH 7.2). Genomic DNA was extracted according to the method described by Marmur & Doty . The cells were suspended from 250 ml CM medium and washed once with 20% (w/v) NaCl solution. After extraction, the genomic DNA was dissolved in 1 ml of TE buffer. The quality and quantity of the genomic DNA was determined by 0.7% agarose gel electrophoresis with λ-Hind III digest and λ-EcoT14 I digest DNA marker (TaKaRa, Dalian, China) as well as by the DU800 spectrophotometer (Beckman Coulter, Inc.) with the nucleotide acid analysis method. The OD260/280 of genomic DNA was 1.92.
Genome sequencing and assembly
The next-generation genome sequencing of N. altunense strain AJ2T and quality control was performed using pyrosequencing technology on a GS FLX+ system (454 Life Sciences, Roche). One library with an insert size 2,000 bp was constructed and a total of 380 Mb clean data was obtained after filtering the adapter, artificial or low quality sequence. In other words we sequenced for a genome-wide average coverage of 87. A total of 630,866 reads were used for assembly and produced 20 contigs using the Newbler v.2.5 (454 Life Sciences, Roche). The average contig size was 188,706 bp and the largest contig size was 837,556 bp with the N50 size of 425,349 bp.
The tRNA genes of strain AJ2T were identified using tRNAscan-SE 1.21  with an archaeal model, and its rRNA genes were found via RNAmmer 1.2 Server . Other ORFs were predicted using Glimmer3 . The predicted ORFs were translated and analysed using the BLASTp program (BLAST 2.2.26+) against the non-redundant, Swiss-Prot , Pfam  and COG  databases. Only results with an e-value smaller than 1 × e−5 were kept. For cross-validation purposes, we annotated the genome with a RAST server online . KAAS  was used to assign the predicted amino acids into the KEGG Pathway  with the BBH method. Genes with transmembrane helices were predicted using TMHMM Server v.2.0 . We attempted to predict signal peptides using SignalP 4.1 Server , but because there were not enough experimentally confirmed signal peptides in the Uni-Prot database , the online server failed to provide the archaeal group model. The circular map of the genome was obtained using a local CGView application  with adjusted parameters (−size medium -title ‘AJ2T’ -draw_divider_rings T -gene_decoration arc -linear circular). We uploaded the whole genome sequences in FASTA files and calculated the ANI value between every two genome sequences within the genus Natrinema and Haloterrigena on the EzGenome online server [25, 26]. Genome accession numbers for all five published Natrinema and Haloterrigena strains are listed as follows: N. altunense AJ2 (JNCS00000000); N. versiforme JCM 10478 (AOID00000000); N. pallidum DSM 3751 (AOII00000000); N. pellirubrum DSM 15624 (CP003372); N. gari JCM 14663 (AOIJ00000000); H. thermotolerans DSM 11522 (AOIR00000000); H. salina JCM 13891 (AOIS00000000); H. limicola JCM 13563 (AOIT00000000); H. turkmenica DSM 5511 (CP001860); and H. jeotgali A29 (JDTG00000000). Unless otherwise specified, we used default parameters for all software.
This high-quality draft genome sequence of N. altunense AJ2T revealed a genome size of 3,774,135 bp (all 20 contigs length, 64.56% GC content). We predicted 4517 genes; 4462 are protein-coding sequences. A total of 3792 protein-coding genes (83.95%) were assigned to a putative function or as hypothetical proteins. We also found 52 tRNA genes (removed 1 Pseudo tRNA) and 3 rRNA genes (one 23 S rRNA, one 16 S rRNA and one 5 S rRNA). We assigned 1929 protein-coding genes (42.71%) to Pfam domains and categorized 2255 (49.92%) protein-coding genes into COGs functional groups (Table 4 and Fig. 3). This genome has a gene content redundancy of 36.11%, and there are 1631 protein coding genes belonging to 540 paralog clusters. The genomic ANI values within the Natrinema and Haloterrigena genera are listed in Table 5. In the Richter & Rosselló-Móra report, the proposed ANI cut-off for the species boundary is at 95 ~ 96% . According to our calculation data, the ANI values between any two species of Natrinema with published genome sequences were lower than 93.2% and this value was observed between strains AJ2T and Natrinema pallidum DSM 3751 T. We can also easily observe that N. pellirubrum show higher ANI values (>95%) with H. thermotolerans DSM 11522 T (95.4%) and H. jeotgali A29T (95.2%). These data are also identical to the phylogenetic distance in the 16S rRNA maximum-likelihood tree (Fig. 1). In the tree, the other two strains N. salaciae MDB25T and N. ejinorense EJ-57T, which are in the same clade as genus Haloterrigena , lack of genome information for considering their ANI values in this study.
Insights from the genome sequence
We compared all sequenced strains in the genus Natrinema with strain AJ2T according to the contig numbers, G + C content, predicted protein numbers, total length and N50, which are listed below (Table 6). The other relevant genomic features were listed in Table 7. According to the chemotaxonomic information and characteristic features of strain AJ2T that was mentioned before, the strain contains a flagellin domain protein in its genomic features to support cell motility. It also has DNA repair systems for protecting the stability of its genome from potential damage caused by UV radiation. Additionally, the energy converting system and light-driven pumps are introduced below.
The strict living environment and lack of nutritious carbon/nitrogen sources cause diversification of metabolic pathway strain AJ2T and similar halophilic archaea, as well as for haloarchaea, with more resources. Strain AJ2T might use sunlight to produce ATP. We predicted the existence of two light-energy-converting system genes in the AJ2T genome, namely bop and hop. The two encode homologous proteins bacteriorhodopsin and halorhodopsin, respectively. Bacteriorhodopsin and halorhodopsin share 36% of the amino acid residues in the transmembrane part and 19% in the surface connecting loops .
Bacteriorhodopsin is an integral membrane protein, called purple membrane, located in the archaea cell membrane, and it acts as a light-driven proton pump. It is mainly found in the Halobacteriaceae family [28, 29]. It captures and uses light energy to move protons out of the cell membrane, resulting in a proton electrochemical gradient. Subsequently, the gradient is converted into chemical energy through ATP synthesis or is used to fuel flagellar motility and other energy requiring processes . We obtained the complete bop gene (AY279548, JQ406920, and AFB77278) in the strain AJ2T by the LPA method. We then successfully expressed the AJ2T bacteriorhodopsin protein in E.coli BL21 with recombinant pET28a plasmid. This result indicates that the prediction of the bop gene is correct. Halorhodopsin is a light-activated chloride pump that is also found in archaea. It utilizes light to transfer the chloride ions into the cytoplasm and increase the electrochemical potential of the proton gradient . This gene is extremely important for salty environment tolerance and, by reporting the existence of a hop gene in the N. altunense strain AJ2T, we shed light on the potential mechanism of its adaptation to high salinity.
Bacteriorhodopsin, halorhodopsin and several related bacterio-opsin activator HTH domain proteins were also found in the other sequenced type strains N. pellirubrum , N. pallidum , N. gari and strain Natrinema sp. J7-2 (listed in Table 8). As the haloarchaea species of the genus Natrinema typically live in similar environment, this type of bacteriorhodopsin/halorhodopsin-based phototrophy can help them adapt to extremely hypersaline and oligotrophic niches.
The genome of strain AJ2T did not have the longest length in the sequenced strains of Natrinema , but it had most predicted proteins. Meanwhile, the assembled result in the strain AJ2T had the lowest contig numbers and largest N50 length. This indicated the larger size of the library (2000 bp library) and the longer read length (up to 1000 bp with an average read length 603 bp) may significantly improve the assembling quality.
Our genomic analysis of strain AJ2T shed light on its ability to survive in the Ayakekum salt lake of Altun Mountain National Nature Reserve in Xinjiang, China. This lake is regarded as a relatively extreme environment with low nutrient levels, a cool temperature, strong sunlight and high-altitude. We found evidence for an alternative energy converting system to gain a supplementary energy source. The energy converting system, bacteriorhodopsin, halorhodopsin and HTH domain proteins, were also found in comparison it to all other sequenced strains in the genus Natrinema and they mostly share this energy-producing pathway.
More intensive study and data-mining need to be considered in genomes of the genus Natrinema or another halophilic archaeon. Then, we might find some reasons for these ancient archaeon to have so much vitality and prosperity in extreme environment on planet Earth.
Average Nucleotide Identity
Bi-directional Best Hit
KEGG Automatic Annotation Server
Ligation-mediated PCR Amplification
McGenity TJ, Gemmell RT, Grant WD. Proposal of a new halobacterial genus Natrinema gen. nov., with two species Natrinema pellirubrum nom. nov. and Natrinema pallidum nom. nov. Int J Syst Bacteriol. 1998;48:1187–96.
Xin H, Itoh T, Zhou P, Suzuki K-i, Kamekura M, Nakase T. Natrinema versiforme sp. nov., an extremely halophilic archaeon from Aibi salt lake, Xinjiang, China. Int J Syst Evol Microbiol. 2000;50:1297–303.
Xu X-W, Ren P-G, Liu S-J, Wu M, Zhou P-J. Natrinema altunense sp. nov., an extremely halophilic archaeon isolated from a salt lake in Altun Mountain in Xinjiang, China. Int J Syst Evol Microbiol. 2005;55:1311–4.
Tapingkae W, Tanasupawat S, Itoh T, Parkin KL, Benjakul S, Visessanguan W, Valyasevi R. Natrinema gari sp. nov., a halophilic archaeon isolated from fish sauce in Thailand. Int J Syst Evol Microbiol. 2008;58:2378–83.
Castillo A, Gutiérrez M, Kamekura M, Xue Y, Ma Y, Cowan D, Jones B, Grant W, Ventosa A. Natrinema ejinorense sp. nov., isolated from a saline lake in Inner Mongolia, China. Int J Syst Evol Microbiol. 2006;56:2683–7.
Albuquerque L, Taborda M, La Cono V, Yakimov M, da Costa MS. Natrinema salaciae sp. nov., a halophilic archaeon isolated from the deep, hypersaline anoxic Lake Medee in the Eastern Mediterranean Sea. Syst Appl Microbiol. 2012;35:368–73.
Pagani I, Liolios K, Jansson J, Chen I-MA, Smirnova T, Nosrat B, Markowitz VM, Kyrpides NC. The Genomes OnLine Database (GOLD) v. 4: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 2012;40:D571–9.
Kim B-C, Poo H, Lee KH, Kim MN, Park D-S, Oh HW, Lee JM, Shin K-S. Simiduia areninigrae sp. nov., an agarolytic bacterium isolated from sea sand. Int J Syst Evol Microbiol. 2012;62:906–11.
Tindall B. Taxonomic problems arising in the genera Haloterrigena and Natrinema. Int J Syst Evol Microbiol. 2003;53:1697–8.
Minegishi H, Kamekura M, Itoh T, Echigo A, Usami R, Hashimoto T. Further refinement of the phylogeny of the Halobacteriaceae based on the full-length RNA polymerase subunit B′(rpoB′) gene. Int J Syst Evol Microbiol. 2010;60:2398–408.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7.
Marmur J, Doty P. Thermal renaturation of deoxyribonucleic acids. J Mol Biol. 1961;3:585–94.
Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33:W686–9.
Lagesen K, Hallin P, Rødland EA, Stærfeldt H-H, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–8.
Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23:673–9.
Boeckmann B, Bairoch A, Apweiler R, Blatter M-C, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O’Donovan C, Phan I. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;31:365–70.
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42:D222–30.
Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, Fedorova ND, Koonin EV. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2001;29:22–8.
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75.
Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35:W182–5.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Krogh A, Larsson B, Von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305:567–80.
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8:785–6.
Stothard P, Wishart DS. Circular genome visualization and exploration using CGView. Bioinformatics. 2005;21:537–9.
Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci. 2009;106:19126–31.
Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. DNA–DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57:81–91.
Blanck A, Oesterhelt D. The halo-opsin gene. II. Sequence, primary structure of halorhodopsin and comparison with bacteriorhodopsin. EMBO J. 1987;6:265–73.
Dunn R, McCoy J, Simsek M, Majumdar A, Chang SH, RajBhandary UL, Khorana HG. The bacteriorhodopsin gene. Proc Natl Acad Sci. 1981;78:6744–8.
Trivedi S, Prakash Choudhary O, Gharu J. Different proposed applications of bacteriorhodopsin. Recent Pat DNA Gene Seq. 2011;5:35–40.
Oesterhelt D, Stoeckenius W. Functions of a new photoreceptor membrane. Proc Natl Acad Sci. 1973;70:2853–7.
Sharma AK, Walsh DA, Bapteste E, Rodriguez-Valera F, Doolittle WF, Papke RT. Evolution of rhodopsin ion pumps in haloarchaea. BMC Evol Biol. 2007;7:79.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci. 1990;87:4576–9.
Validation of publication of new names and new combinations previously effectively published outside the IJSEM. International Journal of Systematic and Evolutionary Microbiology. Int J Syst Evol Microbiol. 2002;52:685–690. http://ijs.microbiologyresearch.org/content/journal/ijsem/10.1099/00207713-52-3-685.
Garrity GM, Holt JG. Phylum AII. Euryarchaeota phy. nov. In: Boone DR, Castenholz RW, Garrity GM, editors. Bergey’s Manual® of Systematic Bacteriology, The Archaea and the deeply branching and phototrophic Bacteria, vol. 1. Secondth ed. New York: Springer; 2001. p. 211–355.
Grant WD, Larsen H. Group III. Extremely halophilic archaeobacteria. Order Halobacteriales ord. nov. In: Staley JT, Bryant MP, Pfennig N, Holt JG, editors. Bergey’s Manual® of Systematic Bacteriology, vol. 3. firstth ed. Baltimore: The Williams & Wilkins Co; 1989. p. 2216–8.
Validation of the Publication of New Names and New Combinations Previously Effectively Published Outside the IJSB: List No. 31. Int J Syst Bacteriol. 1989;39:495–497.
Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol. 1980;30:225–420.
Gibbons NE. Family V. Halobacteriaceae Fam. nov. In: Buchanan RE, Gibbons NE, editors. Bergey’s Manual of Determinative Bacteriology. eightth ed. Baltimore: The Williams & Wilkins Co; 1974. p. 269.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. Gene Ontol Consortium Nat Genet. 2000;25:25–9.
Pruesse E, Peplies J, Glöckner FO. SINA: Accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28:1823–9.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
Jukes TH, Cantor CR. Mammalian Protein Metabolism: Evolution of protein molecules. 3rd ed. New York: Academic; 1969. p. 21–132.
We would like to thank Qi-Lan Wang for help with coding custom scripts and performing gene annotations. We also thank Chenling Antelope for language editing and her advice about the phylogenetic analysis. This work was supported by the National Natural Science Foundation of China (Project No. 31170001, No. 31470005, No. 41276173), Zhejiang Provincial Natural Science Foundation of China (No. LQ13D060002) and Scientific Research Fund of the Second Institute of Oceanography, SOA (No. JT1305).
Hong Cheng designed and performed experiments, analysed the data and wrote the paper; Ying-Yi Huo performed experiments and edited the paper; Jing Hu collected and analysed genome data; and Xue-Wei Xu and Min Wu conceived of the experiments and wrote the paper. All authors read and approved of the final manuscript.
The authors declare that they have no competing interests.