- Extended genome report
- Open Access
Draft genome of Myxosarcina sp. strain GI1, a baeocytous cyanobacterium associated with the marine sponge Terpios hoshinota
Standards in Genomic Sciencesvolume 10, Article number: 28 (2015)
To date, genome sequences (complete or in draft form) from only six baeocytous cyanobacteria in four genera have been reported: Xenococcus, Chroococcidiopsis, Pleurocapsa, and Stanieria. To expand our knowledge on the diversity of baeocytous cyanobacteria, this study sequenced the genome of GI1, which is a Myxosarcina-like baeocytous cyanobacterium. GI1 is of interest not only because of its phylogenetic niche, but also because it is a cyanobiont isolated from the marine cyanobacteriosponge Terpios hoshinota, which has been shown to cause the death of corals. The ~7 Mb draft GI1 genome contains 6,891 protein-coding genes and 62 RNA genes. A comparison of genomes among the sequenced baeocytous cyanobacterial strains revealed the existence or absence of numerous discrete genes involved in nitrogen metabolism. It will be interesting to determine whether these genes are important for cyanobacterial adaptations and interactions between cyanobionts and their marine sponge hosts.
In the latest (second) edition of Bergey’s Manual of Systematic Bacteriology, cyanobacteria are classified into five subsections (“orders”) . All members in Subsection II (order Pleurocapsales) reproduce (exclusively or partially) via multiple fission, which produces small reproductive cells called baeocytes ; these species are thus described as “baeocytous”. Baeocytous species are further divided into seven genera according to developmental characteristics, such as: the contribution of baeocyte formation to reproduction, the morphology of cell aggregates associated with successive binary fission in vegetative cells, and the presence of fibrous cell walls at the onset of baeocyte formation. The seven genera are Cyanocystis , Dermocarpella , Stanieria , Xenococcus , Chroococcidiopsis , Pleurocapsa , and Myxosarcina . The taxa in Subsection II present considerable diversity in terms of physiology and ecology. Most baeocytous species are solitary (free-living) entities, which can be found in water or on land. Intertidal zones show a particularly rich diversity of baeocytous species, most of which are epilithic or endolithic . A number of species associate with lichen  or sporadically occur as extracellular symbionts of marine sponges . Terpios hoshinota is a marine cyanobacteriosponge that infests coral reefs in west Indo-Pacific regions . T. hoshinota infestations have been named “black disease” because these sponges tend to overgrow live corals, resulting in the formation of black encrustations, which can spread within a few days and shut down photosynthesis. This causes the death of the coral, with none of the coral pulps able to regenerate following encrustation. In 2006, an unprecedented outbreak of black disease occurred in the waters of Green Island, located southeast of Taiwan. In that outbreak, more than 30% of coral were overgrown by sponge . Little is known about the nature of coral black disease, although Montipora aequituberculae corals appear to be particularly susceptible. T. hoshinota is associated with a substantial quantity of cyanobacteria, mainly Aphanocaps type, which lives intercellularly within the sponge . The cyanobacteria associated with T. hoshinota are able to perform photosynthesis; therefore, it has been suggested that cyanobionts provide nutrients to support the spread of their host . In this work, we succeeded in purifying a cyanobacterium associated with T. hoshinota from Green Island, called GI1. We then cultivated the organisms in the laboratory to study their taxonomy and physiology. Specifically, we describe the morphological, biochemical, and genomic properties of GI1, which resemble those of a Myxosarcina species . The genome sequence of GI1 may also provide insight into symbiotic interactions between cyanobionts and their marine sponge hosts.
Classification and features
A coral sample (Montipora sp.) overgrown by T. hoshinota was collected from the sub-tidal zone of Green Island in 2007. Black scrapings from the surface of the sample were suspended in sterile seawater and then streaked onto plates prepared by supplementing ASN-III medium  with 0.8% agar (ASN-III agar plates). After two months, only one type of cyanobacterium, characterized by a punctiform shape and blackish color, was found on the plate. This cyanobacterium was purified by successively transferring and streaking onto the same type of plates at two month intervals. An axenic culture was then established and added to our collection as strain GI1. This strain produced coccoid and motile baeocytes, which reacted photactically and lost mobility as they enlarged into spherical vegetative cells. Most of the vegetative cells performed successive binary fission in three planes, which resulted in the formation of cubic or irregular cell aggregates and eventually produced baeocytes (Figure 1). Baeocyte diameters (2.3 ± 0.2 μm) differed little from those of parental (mature) vegetative cells (3.7 ± 0.7 μm) that were preparing to release baeocytes. The vegetative cells in GI1 had an average volume only 4.2 times larger than that of a newly released baeocytes; thus, each vegetative cell could produce no more than 4 baeocytes. These characteristics suggest that GI1 belongs to the Myxosarcina . Phylogenetic analysis of 16S rRNA gene sequences led to the segregation of Subsection II cyanobacteria into two groups in the tree (Figure 2). The first group contained Chroococcidiopsis cluster 1, which is similar to heterocyst-forming cyanobacteria; the second group contained the bulk of Subsection II cyanobacteria, including GI1. Note that with high bootstrap support, GI1 did not form a sister clade with Myxosarcina PCC 7325, which was located in the same clade containing Pleurocapsa , Dermocarpella , and Stanieria cluster 2. Stanieria cluster 2 also failed to form a sister clade with Stanieria cluster 1. These observations suggest that the phylogeny of the 16S rRNA gene sequence is not consistent with the taxonomic relationships among baeocytous cyanobacteria. GI1 is a facultative photoheterotroph. Supplementing the ASN-III medium with yeast extract and glucose in 1 and 2 g/L−1 stimulated the growth of GI1 but inhibited the growth of Myxosarcina strain PCC 7312, indicating that the ability of GI1 to use organic resources exceeds that of PCC 7312. The classification and general features of Myxosarcina sp. strain GI1 are summarized in Table 1.
Genome sequencing information
Genome project history
The project information and its association with MIGS version 2.0 compliance  are summarized in Table 2. The genome was first sequenced in 2010 and this work provides a high-quality draft of genome. The assembled contigs have been deposited in NCBI.
Growth conditions and genomic DNA preparation
A single colony of GI1 was selected from the ASN-III agar plate and transferred into a 1 L serum bottle with 200 mL of ASN-III medium. The culture was shaken (90 rpm), aerated (0.2 volume per volume per minute, VVM), and illuminated laterally at 27.0 μmol photons · m−2 · s−1, as measured at the surface of the bottle. Cells were then cultivated in a 12:12 light–dark cycle until the late exponential phase of growth. The cells in each culture were harvested by centrifugation at 5,000 × g for 15 min, rinsed twice using 10 mL deionized water, and extracted using Tri-Total Nucleic Acid Isolation Reagent (Geneaid, New Taipei City, Taiwan) to obtain genomic DNA. Extraction was performed according to manufacturer guidelines. Genomic DNA of GI1 was quantified using the Quant-iT dsDNA BR Assay Kit (Invitrogen, Carlsbad, CA, USA) and quality checked on 0.6% agarose gel. Twenty micrograms of DNA was sheared using a Bioruptor ultrasonicator (Diagenode, Liège, Belgium) set at power on for 30 sec and power off for 30 sec. The fragmented DNA was then separated using either 1.5% or 0.6% agarose gel electrophoresis to obtain DNA strands of various lengths.
Genome sequencing and assembly
One paired-end (PE) and three mate-pair (MP) libraries of GI1 genomic DNA were prepared. The PE library (insert size: 143 ± 50 bp) was sequenced on Illumina GAIIx at the Bioresource Research Center, National Cheng Kung University, Taiwan. The three MP libraries (insert sizes of approximately 3, 5, and 9 kb) were sequenced on Illumina HiSeq 2000 at Yourgene Bioscience, Taiwan. Prior to assembly, low quality reads in the PE and 3 kb MP libraries were filtered out. A read was considered low quality if (1) it contained an unknown base “N”, (2) the lowest quality score was less than 30, or (3) more than 95% of the bases were identical. The reads of the 3 kb MP library were further trimmed to a length of 60 bp. The 5 kb and 9 kb MP libraries were prepared using Illumina’s Nextera kit. Reads containing the Nextera adaptor sequence were retained and the adaptor parts were removed. Resulting reads shorter than 40 bp or containing an unknown base “N” were further discarded. The processed reads of all four libraries accounted for an 1186X coverage based on a 7 Mb genome. These reads were assembled using ALLPATHS-LG (v47833) [9,10] with all parameters set to default.
Protein coding genes were predicted using Glimmer 3  and annotated using the RAST webserver [12,13]. The tRNA genes and rRNA genes were identified using tRNAscanSE (v1.3.1) [14,15] and RNAmmer (v1.2) , respectively. For annotations of COG, Pfam, TIGRfam, and PRK, Conserved Domain Database [17-23] was downloaded from NCBI and the predicted proteins were aligned to each dataset using RPSblast (v2.2.29) ; all parameters were set to default. For each protein, the best alignment (highest score) was selected for annotation. To facilitate genome comparison, protein and nucleotide sequences of the six baeocytous cyanobacterial strains were obtained from either NCBI FTP site ( Chroococcidiopsis thermalis PCC 7203, Pleurocapsa sp. PCC 7319 and PCC 7327, Stanieria cyanosphaera PCC 7437, and Xenococcus sp. PCC 7305) or JGI database ( Chroococcidiopsis sp. PCC 6712) for annotation as described above.
The draft genome of GI1 contained 7.06 M bp in 76 contigs (or 21 scaffolds); the N50 length of the contigs was 195,043 bp (Table 3). The GC content was 40.1%. Gene annotation revealed 6891 protein coding genes, 6 rRNA genes, and 56 tRNA genes. COG annotations of protein coding genes are presented in Table 4. Figure 3 presents the genome atlas of GI1.
Insights from the genome sequence
The seven baeocytous cyanobacterial genomes (including GI1) are compared in Table 5. By comparing COG annotations, we identified 13 genes that existed in all baeocytous cyanobacteria except GI1 (Table 6) and 36 genes that only appeared in GI1 (Table 7). Many products of these genes (e.g. UreE, SpeA, and GltD in Table 6 and ArgR, COG2070, HutG, COG4262, and NtrB in Table 7) are related to nitrogen metabolism. It can therefore be surmised that these genes participate in nitrogen cycles between cyanobionts and their hosts. Moreover, many GI1-specific genes are involved in processing a wide range of organic compounds as carbon, nitrogen, or energy sources. The putative products encoded by these genes include COG 2070 (dioxygenases related to 2-nitropropane dioxygenase) , HutG (N-formylglutamate amidohydrolase) , CelA (cellobiohydrolase A) , and Hdrc (heterodisulfide reductase, subunit C) . These enzymes are rarely found in cyanobacteria but are common among heterotrophic bacteria and fungi. Exploring the origins and functions of these genes in GI1 will no doubt produce interesting results.
The assembly and analysis of GI1 genome revealed distinctive genes involved in nitrogen metabolism and utilization of a large array of organic compounds. The GI1 genome is thus valuable for studying interactions between GI1 and its marine sponge host.
Volume per volume per minute
Castenholz RW. General characteristics of the cyanobacteria. In: Boone DR, Castenholz RW, editors. Bergey's Manual of Systematic Bacteriology. 2nd ed. New York: Springer; 2001. p. 474–87.
Rippka R, Waterbury JB, Herdman M, Castenholz RW. Subsection II. (Formerly Pleurocapsales Geitler 1925, emend. Waterbury and Stanier 1978). In: Boone DR, Castenholz RW, editors. Bergey's Manual of Systematic Bacteriology. 2nd edn. Volume 514–539. New York: Springer; 2001.
Büdel B, Henssen A. Chroococcidiopsis (Cyanophyceae), a phycobiont in the lichen family Lichinaceae1. Phycologia. 1983;22(4):367–75.
Alex A, Vasconcelos V, Tamagnini P, Santos A, Antunes A. Unusual symbiotic cyanobacteria association in the genetically diverse intertidal marine sponge Hymeniacidon perlevis (Demospongiae, Halichondrida). PLoS One. 2012;7(12):e51834.
Rützler K, Muzik K. Terpios hoshinota, a new cyanobacteriosponge threatening Pacific reefs. Sci Mar. 1993;57:395–403.
Liao MH, Tang SL, Hsu CM, Wen KC, Wu H, Chen WM, et al. The "black disease" of reef-building corals at Green Island, Taiwan - outbreak of cyanobacteriosponge, Terpios hoshinota (Suberitidae; Hadromerida). Zool Stud. 2007;46:520.
Waterbury JB, Stanier RY. Patterns of growth and development in Pleurocapsalean cyanobacteria. Microbiol Mol Biol Rev. 1978;42(1):2–44.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Butler J, MacCallum I, Kleber M, Shlyakhter IA, Belmonte MK, Lander ES, et al. ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome Res. 2008;18(5):810–20.
Maccallum I, Przybylski D, Gnerre S, Burton J, Shlyakhter I, Gnirke A, et al. ALLPATHS 2: small genomes assembled accurately and with high continuity from short paired reads. Genome Biol. 2009;10(10):R103.
Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23(6):673–9.
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75.
Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014;42(Database issue):D206–14.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64.
Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33(Web Server issue):W686–9.
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–8.
Marchler-Bauer A, Panchenko AR, Shoemaker BA, Thiessen PA, Geer LY, Bryant SH. CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 2002;30(1):281–3.
Marchler-Bauer A, Anderson JB, DeWeese-Scott C, Fedorova ND, Geer LY, He S, et al. CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 2003;31(1):383–7.
Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, Gwadz M, et al. CDD: a Conserved Domain Database for protein classification. Nucleic Acids Res. 2005;33(Database issue):D192–6.
Marchler-Bauer A, Anderson JB, Derbyshire MK, DeWeese-Scott C, Gonzales NR, Gwadz M, et al. CDD: a conserved domain database for interactive domain family analysis. Nucleic Acids Res. 2007;35(Database issue):D237–40.
Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, et al. CDD: specific functional annotation with the Conserved Domain Database. Nucleic Acids Res. 2009;37(Database issue):D205–10.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(Database issue):D225–9.
Derbyshire MK, Lanczycki CJ, Bryant SH, Marchler-Bauer A. Annotation of functional sites with the Conserved Domain Database. Database (Oxford) 2012;2012:bar058.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
Ha JY, Min JY, Lee SK, Kim HS, Kim DJ, Kim KH, et al. Crystal structure of 2-nitropropane dioxygenase complexed with FMN and substrate: Identification of the catalytic base. J Biol Chem. 2006;281(27):18660–7.
Hu L, Mulfinger LM, Phillips AT. Purification and properties of formylglutamate amidohydrolase from Pseudomonas putida. J Bacteriol. 1987;169:4696–702.
Wang H-C, Chen Y-C, Huang C-T, Hseu R-S. Cloning and characterization of a thermostable and pH-stable cellobiohydrolase from Neocallimastix patriciarum J11. Protein Expr Purif. 2013;90(2):153–9.
Mander GJ, Duin EC, Linder D, Stetter KO, Hedderich R. Purification and characterization of a membrane-bound enzyme complex from the sulfate-reducing archaeon Archaeoglobus fulgidus related to heterodisulfide reductase from methanogenic archaea. Eur J Biochem. 2002;269(7):1895–904.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci. 1990;87(12):4576–9.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
We thank M. Ash for assistance in the editing of the manuscript. We are grateful to Ministry of Science and Technology (MOST 103-2221-E-006-180 and MOST 103-2313-B-006-002-MY3) and the Headquarters of University Advancement at National Cheng Kung University for supporting this research.
The authors declare that they have no competing interests.
YC and TL designed and carried out the experiments. CY, CL, HS, TC, and CH performed the data analysis and drafted the manuscript. All authors read and approved the final manuscript.