Draft genome of Myxosarcina sp. strain GI1, a baeocytous cyanobacterium associated with the marine sponge Terpios hoshinota

Yu, Chun-Hui; Lu, Chung-Kuang; Su, Huang-Ming; Chiang, Tzen-Yuh; Hwang, Chi-Chuan; Liu, Tsunglin; Chen, Yi-Min

doi:10.1186/s40793-015-0011-3

Extended genome report
Open access
Published: 27 May 2015

Draft genome of Myxosarcina sp. strain GI1, a baeocytous cyanobacterium associated with the marine sponge Terpios hoshinota

Chun-Hui Yu^1,2,
Chung-Kuang Lu^3,4,
Huang-Ming Su⁵,
Tzen-Yuh Chiang⁶,
Chi-Chuan Hwang²,
Tsunglin Liu¹ &
…
Yi-Min Chen⁵

Standards in Genomic Sciences volume 10, Article number: 28 (2015) Cite this article

Abstract

To date, genome sequences (complete or in draft form) from only six baeocytous cyanobacteria in four genera have been reported: Xenococcus, Chroococcidiopsis, Pleurocapsa, and Stanieria. To expand our knowledge on the diversity of baeocytous cyanobacteria, this study sequenced the genome of GI1, which is a Myxosarcina-like baeocytous cyanobacterium. GI1 is of interest not only because of its phylogenetic niche, but also because it is a cyanobiont isolated from the marine cyanobacteriosponge Terpios hoshinota, which has been shown to cause the death of corals. The ~7 Mb draft GI1 genome contains 6,891 protein-coding genes and 62 RNA genes. A comparison of genomes among the sequenced baeocytous cyanobacterial strains revealed the existence or absence of numerous discrete genes involved in nitrogen metabolism. It will be interesting to determine whether these genes are important for cyanobacterial adaptations and interactions between cyanobionts and their marine sponge hosts.

Introduction

In the latest (second) edition of Bergey’s Manual of Systematic Bacteriology, cyanobacteria are classified into five subsections (“orders”) [1]. All members in Subsection II (order Pleurocapsales) reproduce (exclusively or partially) via multiple fission, which produces small reproductive cells called baeocytes [2]; these species are thus described as “baeocytous”. Baeocytous species are further divided into seven genera according to developmental characteristics, such as: the contribution of baeocyte formation to reproduction, the morphology of cell aggregates associated with successive binary fission in vegetative cells, and the presence of fibrous cell walls at the onset of baeocyte formation. The seven genera are Cyanocystis , Dermocarpella , Stanieria , Xenococcus , Chroococcidiopsis , Pleurocapsa , and Myxosarcina [2]. The taxa in Subsection II present considerable diversity in terms of physiology and ecology. Most baeocytous species are solitary (free-living) entities, which can be found in water or on land. Intertidal zones show a particularly rich diversity of baeocytous species, most of which are epilithic or endolithic [2]. A number of species associate with lichen [3] or sporadically occur as extracellular symbionts of marine sponges [4]. Terpios hoshinota is a marine cyanobacteriosponge that infests coral reefs in west Indo-Pacific regions [5]. T. hoshinota infestations have been named “black disease” because these sponges tend to overgrow live corals, resulting in the formation of black encrustations, which can spread within a few days and shut down photosynthesis. This causes the death of the coral, with none of the coral pulps able to regenerate following encrustation. In 2006, an unprecedented outbreak of black disease occurred in the waters of Green Island, located southeast of Taiwan. In that outbreak, more than 30% of coral were overgrown by sponge [6]. Little is known about the nature of coral black disease, although Montipora aequituberculae corals appear to be particularly susceptible. T. hoshinota is associated with a substantial quantity of cyanobacteria, mainly Aphanocaps type, which lives intercellularly within the sponge [5]. The cyanobacteria associated with T. hoshinota are able to perform photosynthesis; therefore, it has been suggested that cyanobionts provide nutrients to support the spread of their host [6]. In this work, we succeeded in purifying a cyanobacterium associated with T. hoshinota from Green Island, called GI1. We then cultivated the organisms in the laboratory to study their taxonomy and physiology. Specifically, we describe the morphological, biochemical, and genomic properties of GI1, which resemble those of a Myxosarcina species [2]. The genome sequence of GI1 may also provide insight into symbiotic interactions between cyanobionts and their marine sponge hosts.

Organism information

Classification and features

A coral sample (Montipora sp.) overgrown by T. hoshinota was collected from the sub-tidal zone of Green Island in 2007. Black scrapings from the surface of the sample were suspended in sterile seawater and then streaked onto plates prepared by supplementing ASN-III medium [7] with 0.8% agar (ASN-III agar plates). After two months, only one type of cyanobacterium, characterized by a punctiform shape and blackish color, was found on the plate. This cyanobacterium was purified by successively transferring and streaking onto the same type of plates at two month intervals. An axenic culture was then established and added to our collection as strain GI1. This strain produced coccoid and motile baeocytes, which reacted photactically and lost mobility as they enlarged into spherical vegetative cells. Most of the vegetative cells performed successive binary fission in three planes, which resulted in the formation of cubic or irregular cell aggregates and eventually produced baeocytes (Figure 1). Baeocyte diameters (2.3 ± 0.2 μm) differed little from those of parental (mature) vegetative cells (3.7 ± 0.7 μm) that were preparing to release baeocytes. The vegetative cells in GI1 had an average volume only 4.2 times larger than that of a newly released baeocytes; thus, each vegetative cell could produce no more than 4 baeocytes. These characteristics suggest that GI1 belongs to the Myxosarcina [2]. Phylogenetic analysis of 16S rRNA gene sequences led to the segregation of Subsection II cyanobacteria into two groups in the tree (Figure 2). The first group contained Chroococcidiopsis cluster 1, which is similar to heterocyst-forming cyanobacteria; the second group contained the bulk of Subsection II cyanobacteria, including GI1. Note that with high bootstrap support, GI1 did not form a sister clade with Myxosarcina PCC 7325, which was located in the same clade containing Pleurocapsa , Dermocarpella , and Stanieria cluster 2. Stanieria cluster 2 also failed to form a sister clade with Stanieria cluster 1. These observations suggest that the phylogeny of the 16S rRNA gene sequence is not consistent with the taxonomic relationships among baeocytous cyanobacteria. GI1 is a facultative photoheterotroph. Supplementing the ASN-III medium with yeast extract and glucose in 1 and 2 g/L⁻¹ stimulated the growth of GI1 but inhibited the growth of Myxosarcina strain PCC 7312, indicating that the ability of GI1 to use organic resources exceeds that of PCC 7312. The classification and general features of Myxosarcina sp. strain GI1 are summarized in Table 1.

Table 1 Classification and general features of Myxosarcina sp. strain GI1 according to MIGS recommendations [8]

Full size table

Genome sequencing information

Genome project history

The project information and its association with MIGS version 2.0 compliance [8] are summarized in Table 2. The genome was first sequenced in 2010 and this work provides a high-quality draft of genome. The assembled contigs have been deposited in NCBI.

Table 2 Project information

Full size table

Growth conditions and genomic DNA preparation

A single colony of GI1 was selected from the ASN-III agar plate and transferred into a 1 L serum bottle with 200 mL of ASN-III medium. The culture was shaken (90 rpm), aerated (0.2 volume per volume per minute, VVM), and illuminated laterally at 27.0 μmol photons · m⁻² · s⁻¹, as measured at the surface of the bottle. Cells were then cultivated in a 12:12 light–dark cycle until the late exponential phase of growth. The cells in each culture were harvested by centrifugation at 5,000 × g for 15 min, rinsed twice using 10 mL deionized water, and extracted using Tri-Total Nucleic Acid Isolation Reagent (Geneaid, New Taipei City, Taiwan) to obtain genomic DNA. Extraction was performed according to manufacturer guidelines. Genomic DNA of GI1 was quantified using the Quant-iT dsDNA BR Assay Kit (Invitrogen, Carlsbad, CA, USA) and quality checked on 0.6% agarose gel. Twenty micrograms of DNA was sheared using a Bioruptor ultrasonicator (Diagenode, Liège, Belgium) set at power on for 30 sec and power off for 30 sec. The fragmented DNA was then separated using either 1.5% or 0.6% agarose gel electrophoresis to obtain DNA strands of various lengths.

Genome sequencing and assembly

One paired-end (PE) and three mate-pair (MP) libraries of GI1 genomic DNA were prepared. The PE library (insert size: 143 ± 50 bp) was sequenced on Illumina GAIIx at the Bioresource Research Center, National Cheng Kung University, Taiwan. The three MP libraries (insert sizes of approximately 3, 5, and 9 kb) were sequenced on Illumina HiSeq 2000 at Yourgene Bioscience, Taiwan. Prior to assembly, low quality reads in the PE and 3 kb MP libraries were filtered out. A read was considered low quality if (1) it contained an unknown base “N”, (2) the lowest quality score was less than 30, or (3) more than 95% of the bases were identical. The reads of the 3 kb MP library were further trimmed to a length of 60 bp. The 5 kb and 9 kb MP libraries were prepared using Illumina’s Nextera kit. Reads containing the Nextera adaptor sequence were retained and the adaptor parts were removed. Resulting reads shorter than 40 bp or containing an unknown base “N” were further discarded. The processed reads of all four libraries accounted for an 1186X coverage based on a 7 Mb genome. These reads were assembled using ALLPATHS-LG (v47833) [9,10] with all parameters set to default.

Genome annotation

Protein coding genes were predicted using Glimmer 3 [11] and annotated using the RAST webserver [12,13]. The tRNA genes and rRNA genes were identified using tRNAscanSE (v1.3.1) [14,15] and RNAmmer (v1.2) [16], respectively. For annotations of COG, Pfam, TIGRfam, and PRK, Conserved Domain Database [17-23] was downloaded from NCBI and the predicted proteins were aligned to each dataset using RPSblast (v2.2.29) [24]; all parameters were set to default. For each protein, the best alignment (highest score) was selected for annotation. To facilitate genome comparison, protein and nucleotide sequences of the six baeocytous cyanobacterial strains were obtained from either NCBI FTP site ( Chroococcidiopsis thermalis PCC 7203, Pleurocapsa sp. PCC 7319 and PCC 7327, Stanieria cyanosphaera PCC 7437, and Xenococcus sp. PCC 7305) or JGI database ( Chroococcidiopsis sp. PCC 6712) for annotation as described above.

Genome properties

The draft genome of GI1 contained 7.06 M bp in 76 contigs (or 21 scaffolds); the N50 length of the contigs was 195,043 bp (Table 3). The GC content was 40.1%. Gene annotation revealed 6891 protein coding genes, 6 rRNA genes, and 56 tRNA genes. COG annotations of protein coding genes are presented in Table 4. Figure 3 presents the genome atlas of GI1.

Table 3 Genome statistics

Full size table

Table 4 Number of genes associated with the 25 general COG functional categories

Full size table

Insights from the genome sequence

The seven baeocytous cyanobacterial genomes (including GI1) are compared in Table 5. By comparing COG annotations, we identified 13 genes that existed in all baeocytous cyanobacteria except GI1 (Table 6) and 36 genes that only appeared in GI1 (Table 7). Many products of these genes (e.g. UreE, SpeA, and GltD in Table 6 and ArgR, COG2070, HutG, COG4262, and NtrB in Table 7) are related to nitrogen metabolism. It can therefore be surmised that these genes participate in nitrogen cycles between cyanobionts and their hosts. Moreover, many GI1-specific genes are involved in processing a wide range of organic compounds as carbon, nitrogen, or energy sources. The putative products encoded by these genes include COG 2070 (dioxygenases related to 2-nitropropane dioxygenase) [25], HutG (N-formylglutamate amidohydrolase) [26], CelA (cellobiohydrolase A) [27], and Hdrc (heterodisulfide reductase, subunit C) [28]. These enzymes are rarely found in cyanobacteria but are common among heterotrophic bacteria and fungi. Exploring the origins and functions of these genes in GI1 will no doubt produce interesting results.

Table 5 Genome statistics comparison among baeocytous cyanobacteria

Full size table

Table 6 Putative gene products (obtained from genome wide COG assignment) existing in all sequenced baeocytous cyanobacteria except GI1

Full size table

Table 7 Putative gene products (obtained from genome wide COG assignment) that only appeared in GI1

Full size table

Conclusions

The assembly and analysis of GI1 genome revealed distinctive genes involved in nitrogen metabolism and utilization of a large array of organic compounds. The GI1 genome is thus valuable for studying interactions between GI1 and its marine sponge host.

Abbreviations

MP:: Mate-pair
PE:: Paired-end
VVM:: Volume per volume per minute

References

Castenholz RW. General characteristics of the cyanobacteria. In: Boone DR, Castenholz RW, editors. Bergey's Manual of Systematic Bacteriology. 2nd ed. New York: Springer; 2001. p. 474–87.
Google Scholar
Rippka R, Waterbury JB, Herdman M, Castenholz RW. Subsection II. (Formerly Pleurocapsales Geitler 1925, emend. Waterbury and Stanier 1978). In: Boone DR, Castenholz RW, editors. Bergey's Manual of Systematic Bacteriology. 2nd edn. Volume 514–539. New York: Springer; 2001.
Büdel B, Henssen A. Chroococcidiopsis (Cyanophyceae), a phycobiont in the lichen family Lichinaceae1. Phycologia. 1983;22(4):367–75.
Article Google Scholar
Alex A, Vasconcelos V, Tamagnini P, Santos A, Antunes A. Unusual symbiotic cyanobacteria association in the genetically diverse intertidal marine sponge Hymeniacidon perlevis (Demospongiae, Halichondrida). PLoS One. 2012;7(12):e51834.
Article PubMed Central CAS PubMed Google Scholar
Rützler K, Muzik K. Terpios hoshinota, a new cyanobacteriosponge threatening Pacific reefs. Sci Mar. 1993;57:395–403.
Google Scholar
Liao MH, Tang SL, Hsu CM, Wen KC, Wu H, Chen WM, et al. The "black disease" of reef-building corals at Green Island, Taiwan - outbreak of cyanobacteriosponge, Terpios hoshinota (Suberitidae; Hadromerida). Zool Stud. 2007;46:520.
Google Scholar
Waterbury JB, Stanier RY. Patterns of growth and development in Pleurocapsalean cyanobacteria. Microbiol Mol Biol Rev. 1978;42(1):2–44.
CAS Google Scholar
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Article PubMed Central CAS PubMed Google Scholar
Butler J, MacCallum I, Kleber M, Shlyakhter IA, Belmonte MK, Lander ES, et al. ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome Res. 2008;18(5):810–20.
Article PubMed Central CAS PubMed Google Scholar
Maccallum I, Przybylski D, Gnerre S, Burton J, Shlyakhter I, Gnirke A, et al. ALLPATHS 2: small genomes assembled accurately and with high continuity from short paired reads. Genome Biol. 2009;10(10):R103.
Article PubMed Central PubMed Google Scholar
Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23(6):673–9.
Article PubMed Central CAS PubMed Google Scholar
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75.
Article PubMed Central PubMed Google Scholar
Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014;42(Database issue):D206–14.
Article PubMed Central CAS PubMed Google Scholar
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64.
Article PubMed Central CAS PubMed Google Scholar
Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33(Web Server issue):W686–9.
Article PubMed Central CAS PubMed Google Scholar
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–8.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Panchenko AR, Shoemaker BA, Thiessen PA, Geer LY, Bryant SH. CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 2002;30(1):281–3.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Anderson JB, DeWeese-Scott C, Fedorova ND, Geer LY, He S, et al. CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 2003;31(1):383–7.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, Gwadz M, et al. CDD: a Conserved Domain Database for protein classification. Nucleic Acids Res. 2005;33(Database issue):D192–6.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Anderson JB, Derbyshire MK, DeWeese-Scott C, Gonzales NR, Gwadz M, et al. CDD: a conserved domain database for interactive domain family analysis. Nucleic Acids Res. 2007;35(Database issue):D237–40.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, et al. CDD: specific functional annotation with the Conserved Domain Database. Nucleic Acids Res. 2009;37(Database issue):D205–10.
Article PubMed Central CAS PubMed Google Scholar
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(Database issue):D225–9.
Article PubMed Central CAS PubMed Google Scholar
Derbyshire MK, Lanczycki CJ, Bryant SH, Marchler-Bauer A. Annotation of functional sites with the Conserved Domain Database. Database (Oxford) 2012;2012:bar058.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
Article PubMed Central CAS PubMed Google Scholar
Ha JY, Min JY, Lee SK, Kim HS, Kim DJ, Kim KH, et al. Crystal structure of 2-nitropropane dioxygenase complexed with FMN and substrate: Identification of the catalytic base. J Biol Chem. 2006;281(27):18660–7.
Article CAS PubMed Google Scholar
Hu L, Mulfinger LM, Phillips AT. Purification and properties of formylglutamate amidohydrolase from Pseudomonas putida. J Bacteriol. 1987;169:4696–702.
PubMed Central CAS PubMed Google Scholar
Wang H-C, Chen Y-C, Huang C-T, Hseu R-S. Cloning and characterization of a thermostable and pH-stable cellobiohydrolase from Neocallimastix patriciarum J11. Protein Expr Purif. 2013;90(2):153–9.
Article CAS PubMed Google Scholar
Mander GJ, Duin EC, Linder D, Stetter KO, Hedderich R. Purification and characterization of a membrane-bound enzyme complex from the sulfate-reducing archaeon Archaeoglobus fulgidus related to heterodisulfide reductase from methanogenic archaea. Eur J Biochem. 2002;269(7):1895–904.
Article CAS PubMed Google Scholar
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci. 1990;87(12):4576–9.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.
Article PubMed Central CAS PubMed Google Scholar
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

We thank M. Ash for assistance in the editing of the manuscript. We are grateful to Ministry of Science and Technology (MOST 103-2221-E-006-180 and MOST 103-2313-B-006-002-MY3) and the Headquarters of University Advancement at National Cheng Kung University for supporting this research.

Author information

Authors and Affiliations

Institute of Bioinformatics and Biosignal Transduction, National Cheng Kung University, Tainan, 701, Taiwan
Chun-Hui Yu & Tsunglin Liu
Department of Engineering Science, National Cheng Kung University, Tainan, 701, Taiwan
Chun-Hui Yu & Chi-Chuan Hwang
National Research Institute of Chinese Medicine, Taipei, 112, Taiwan
Chung-Kuang Lu
Department of Life Sciences and Institute of Genome Sciences, School of Life Science, National YangMing University, Taipei, 112, Taiwan
Chung-Kuang Lu
Institute of Biotechnology, National Cheng Kung University, Tainan, 701, Taiwan
Huang-Ming Su & Yi-Min Chen
Department of Life Sciences, National Cheng Kung University, Tainan, 701, Taiwan
Tzen-Yuh Chiang

Authors

Chun-Hui Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Kuang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Huang-Ming Su
View author publications
You can also search for this author in PubMed Google Scholar
Tzen-Yuh Chiang
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Chuan Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Tsunglin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Min Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Tsunglin Liu or Yi-Min Chen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

YC and TL designed and carried out the experiments. CY, CL, HS, TC, and CH performed the data analysis and drafted the manuscript. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver (https://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Yu, CH., Lu, CK., Su, HM. et al. Draft genome of Myxosarcina sp. strain GI1, a baeocytous cyanobacterium associated with the marine sponge Terpios hoshinota . Stand in Genomic Sci 10, 28 (2015). https://doi.org/10.1186/s40793-015-0011-3

Download citation

Received: 23 September 2014
Accepted: 16 April 2015
Published: 27 May 2015
DOI: https://doi.org/10.1186/s40793-015-0011-3

Draft genome of Myxosarcina sp. strain GI1, a baeocytous cyanobacterium associated with the marine sponge Terpios hoshinota

Abstract

Introduction