Draft genome sequence of Arthrospira platensis C1 (PCC9438)
Standards in Genomic Sciences volume 6, pages 43–53 (2012)
Arthrospira platensis is a cyanobacterium that is extensively cultivated outdoors on a large commercial scale for consumption as a food for humans and animals. It can be grown in monoculture under highly alkaline conditions, making it attractive for industrial production. Here we describe the complete genome sequence of A. platensis C1 strain and its annotation. The A. platensis C1 genome contains 6,089,210 bp including 6,108 protein-coding genes and 45 RNA genes, and no plasmids. The genome information has been used for further comparative analysis, particularly of metabolic pathways, photosynthetic efficiency and barriers to gene transfer.
Arthrospira platensis is a cyanobacterium that contains large amounts of proteins, vitamins, lipids and pigments . It is widely used as a human food and an animal feed. In addition, its extracts can enhance the immune system and promote health [1,2]. As the natural habitat is soda lakes, Arthrospira spp are cultivated under highly alkaline conditions in open ponds on a large commercial scale. This condition can minimize and sometimes prevent the culture from contamination . Unlike many plant food products whose nutritional value rapidly deteriorates at high temperatures, the nutritional value of Arthrospira products is maintained even when the cells are processed at high temperatures . In contrast to many cyanobacteria, there is no report of toxicity of Arthrospira for humans, animals or environments .
The genome sequences of Arthrospira spp. have been the subject of immense interest due to the beneficial properties of these organisms in the biotechnology and environmental fields [5,6]. A. platensis C1 is the fifth complete genome report for a member of the genus Arthrospira. A. platensis C1 has long been used as a laboratory strain for physiological and molecular studies due to its non-gliding property, which enables single colonies formation. This property facilitates studies at the molecular level and strain improvement, particularly, the development of a transformation system. Currently, a successful transformation system for Arthrospira has not yet been established. Thus the genome sequences may help to identify barriers responsible for the instability of the transformants. Here, we present a summary classification and a set of features of A. platensis C1 together with the complete genomic sequence and its annotation.
Classification and features
Historically, the classification of the Arthrospira and Spirulina genera [Figure 1] was a subject of controversy. For the commercial strain, Arthrospira or Spirulina was used interchangeably. Both Arthrospira and Spirulina are similar in morphological characters; cylindrical, multicellular, filamentous cyanobacteria with an open, left-handed helical shape [Table 1]. They both belong to the Phylum Cyanobacteria, Order Oscillatoriales and Family Oscillatoriaceae . However, they can be differentiated by the presence of cell septa: Arthrospira possess septa, whereas Spirulina do not .
Arthrospira platensis C1 or Arthrospira sp. PCC 9438, as designated based on its morphology by Pasteur Institute, Paris, France, was originally classified as Spirulina platensis C1 [1,3]. This reclassification was in agreement with the presence of γ-linolenic acid (GLA) in the fatty acid profile, a chemotaxonomic marker of Arthrospira, while GLA is absence in Spirulina [16–18].
The phylogenetic tree of cyanobacteria was reconstructed with evolutionary information embedded in conserved, concatenated ribosomal proteins. A. platensis C1 was positioned into the genus Arthrospira with 100% of the bootstrapping value. The closest strain is Arthrospira sp. PCC 8005 with 97.43% sequence identity, whereas the other strains of this genus share 94.93-96.58% sequence identity. The nearest related order is Nostocales with approximately 70% sequence identity.
Genome sequencing and annotation
Genome project history
Arthrospira genome sequencing projects have been carried out in 5 research centers: Genoscope, France; the DOE Joint Genome Institute, USA; the National Institute of Technology and Evaluation (NITE), Japan; the University of Applied Sciences, Switzerland, and King Mongkut’s University of Technology Thonburi (KMUTT), Thailand, using various Arthrospira strains (A. platensis PCC8005, A. maxima CS-328, A. platensis NIES-39, A. platensis Paraca and A. platensis C1 (this study), respectively). In this study, the genome of A. platensis C1 has been sequenced, and the results provide data that can be used for the further study of its biological functions.
The genome project is deposited in the GenBank Database (NCBI ID 67617 and accession number AFXD00000000). DNA sequencing and finishing were performed in collaboration between the Genome Institute, BIOTEC-NSTDA, Thailand and Kazusa DNA Research Institute, Japan. The genome assembly and annotation steps were performed in collaboration between KMUTT, the Genome Institute, BIOTEC-NSTDA, Chiang Mai University, Thailand and Kazusa DNA Research Institute. The summary of the project information is shown in Table 2.
Growth conditions and DNA isolation
A. platensis strain C1 was obtained from Prof. Avigad Vonshak (Algal Biotechnology, Ben-Gurion University of the Negev, Israel). The cells were routinely grown at 35 °C with constant shaking at 150 rpm in Zarrouk’s medium , under aeration of 1% (v/v) CO2 and illumination of 100 µE m-2 s-1 from fluorescent lamps. For stock cultures, cells were transferred into new Zarrouk’s medium  every two weeks.
For the shotgun genome library preparation, a total DNA sample was sheared by nebulization with nitrogen at 45 psi for 1 min and purified using a QIAGEN Purification kit (QIAGEN, Valencia, CA). DNA fragments with sizes greater than 300 bp were selected by Ampure Bead purification (Beckman Coulter, Brea, CA) and blunt-ended with T4 DNA polymerase and T4 polynucleotide kinase (454 Life Sciences, Roche, Branford, CT). After DNA purification, 454-library adaptors were added to both ends of the fragments, which were immobilized using Library Immobilization Beads for further DNA sequencing.
For paired-end library preparation, a total DNA sample was sheared by nebulization with nitrogen at 10 psi for 2 min to obtain 2.2 kb fragments with hairpin adaptors added to both ends of the DNA fragments. The DNA fragments with hairpin adaptors were immobilized using Library Immobilization Beads and ligated to paired-end adaptors. DNA libraries were linked to the capture beads and amplified by emulsion PCR (Roche Applied Science protocol).
Genome sequencing and assembly
The genome of A. platensis C1 was sequenced using a hybrid method between the 454 Life Sciences technology on the Genome Sequencer (GS) FLX System and BigDye Terminator v3.1 Cycle sequencing. Pyrosequencing reads were assembled using the Newbler de novo sequence assembly software version 2.0.0 (Roche). The Phred/Phrap/Consed software package  was used for sequence assembly and quality assessment in the finishing process. The remaining gaps between contigs were closed by custom primer walk or PCR amplification and then editing in Consed. The final assembly contains 739,684 reads from pyrosequencing and 45,959 reads from Sanger sequencing, resulting in 28× coverage of the genome. Employing A. maxima CS-328 contigs  as a reference, the A. platensis C1 circular genome of 6.08 Mb total size with 1 scaffold and 63 gaps has been constructed.
In agreement with the result from the Integrated Microbial Genomes Expert Review (IMG-ER) platform , all the genes in the A. platensis C1 genome were identified using the GLIMMER 3.0 program in our Microbial Inhouse Annotation Pipeline. Initial criteria for automated functional assignment required a minimum of 50% residue identity with over 80% length match for BLASTP alignments to the NCBI nonredundant database, InterPro, SwissProt, SignalP, COG, and KEGG databases. The tRNAscan-SE tool  was used to find tRNA genes, whereas ribosomal RNAs were found by using the tool RNAmmer . Additional gene prediction analysis and functional annotation were performed within IMG-ER platform  and a round of manual curation, including confirmation with proteomic data [25,26].
The genome of A. platensis C1, with a total of 6.08 Mbp (6,089,210 base pairs), contains 44.68% G+C (Table 3) and, in agreement with the findings of Fujisawa et al. (2010) , no plasmid DNA. Our results confirm the presence of a single genome in A. platensis C1. The A. platensis C1 circular genome of 6.08 Mbp was compared with the A. maxima CS-328 and A. platensis NIES-39 genomes . A total of 6,153 open reading frames (orfs) were predicted. Of these, 3,757 were annotated as coding for known protein functions and 45 for RNA genes (6 for rRNA and 39 for tRNA). The distribution of genes into COGs is presented in Table 4.
The genome atlas of A. platensis C1 is shown in Figure 2. The atlas was created based on one FASTA file containing the nucleotide sequences in one piece. Together with the annotation file, positions of genes were used for the map.
In the genome atlas, the gene annotation, base content, AT and GC skew, percent AT and some structural properties of the DNA are shown. The structural properties are Position Preference, Stacking Energy and Intrinsic Curvature, all of which are related to the flexibility and strength of the DNA molecule .
The genome sequences from all Arthrospira strains (A. maxima CS-328, A. platensis NITE-39, A. platensis PCC8005, A. platensis Paraca, and A. platensis C1) provided the data for a comparative genome analysis of these strains. Based on the genome statistics comparison (Table 5), all of the Arthrospira spp. genomes revealed highly conserved sequences. Interestingly, the number of signal peptides in A. platensis C1 has been reported to be the lowest among the Arthrospira spp. Further study of these primary sequences may reveal the importance of signal peptides in protein targeting in this cyanobacterial species compared with other laboratory and commercial strains.
Insights from the genome sequence
Analysis of the genome of Arthrospira spp. compared to other cyanobacteria confirmed that the Arthrospira are non-nitrogen fixing, filamentous cyanobacteria. They are nontoxic, which might be due to the absence of polyketide and non-ribosomal peptide-producing genes.
Genes involved in gliding motility in A. platensis NIES-39 (a vigorously motile strain utilizing type IV pili as the major mechanism for gliding)  have been compared with those in A. platensis C1 (a nonmotile strain). Interestingly, all the genes involved in type IV pili are present in the A. platensis C1 genome, however, the lack of gliding ability in A. platensis C1 is due to an unknown mechanism. Further studies are needed to elucidate this mechanism of cell motility.
Cellular defense mechanism
Like other genomes of Arthrospira spp., the genome of A. platensis C1 contains highly interspersed repetitive sequences that account for 9% of its genome. Genome comparisons among cyanobacteria revealed unusual genes involved in defense mechanisms, including restriction and modification enzymes, group II introns, insertion elements and CRISPR. These genes are considered to be major barriers for stable transformation. Therefore, these genes have been targeted for the development of a stable gene transformation system for Arthrospira spp. Because of the nonmotile property of A. platensis C1, single colonies can be selected and used for further strain improvement and genetic manipulation experiments.
By combining a stable transformation system with the advantage of colony-forming ability, we should be able to harness A. platensis C1 for many biotechnological applications using gene manipulation and systems biology.
Transporter Characteristic of A. platensis C1
Because of the ability of cyanobacteria to adapt to extremely different habitats, the relationship between their membrane transporter proteins and their habitats has been a focus of interest. Membrane transporters are proteins that allow cell membranes to deliver essential nutrients, eject waste products, and help the cell sense environmental conditions around it .
Interestingly, all Arthrospira species contain genes for Na+/H+ antiporters. The NapA-type Na+/H+ antiporter homolog, which is reported to be involved in salt tolerance at alkaline pH in some cyanobacteria, is found in all Arthrospira genomes. Arthrospira species live in high-alkalinity environments, and there are many alkali transporters in their genomes. These results revealed the relationship between the transporters and the lifestyles and niche adaptations of cyanobacteria.
Habib MAB, Pariv M, Huntington TC, Hasan MR. A review on culture, production and use of spirulina as food for humans and feeds for domestic animals and fish. Rome: Food and Agriculture Organisation (FAO) of the United Nations. 2008; 1–33.
Richmond A. Spirulina, Microalgal Biotechnology. In: Borowitzka M, Borowitzka L, editors. Microalgal Biotechnology. New York: Cambridge University Press; 1987. p. 85–121.
Vonshak A. Spirulina: Growth, physiology and biochemistry. In: Vonshak A, editor. Spirulina platensis (Arthrospira). London: Taylor & Francis; 1997, p. 43–65.
Henrikson R. Spirulina World Food: How this micro algae can transform your health and our planet. Ronore Enterprises, Inc. PO Box 909, Hana, Maui, Hawaii 96718; 2010.
Janssen PJ, Morin N, Mergeay M, Leroy B, Wattiez R, Vallaeys T, Waleron K, Waleron M, Wilmotte A, Quillardet P, et al. Genome sequence of the edible cyanobacterium Arthrospira sp. PCC 8005. J Bacteriol 2010; 192:2465–2466. PubMed http://dx.doi.org/10.1128/JB.00116-10
Fujisawa T, Narikawa R, Okamoto S, Ehira S, Yoshimura H, Suzuki I, Masuda T, Mochimaru M, Takaichi S, Awai K, et al. Genomic structure of an economically important cyanobacterium, Arthrospira (Spirulina) platensis NIES-39. DNA Res 2010; 17:85–103. Med http://dx.doi.org/10.1093/dnares/dsq004
Schirrmeister BE, Antonelli A, Bagheri HC. The origin of multicellularity in cyanobacteria. BMC Evol Biol 2011; 11:45. Med http://dx.doi.org/10.1186/1471-2148-11-45
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol 2008; 26:541–547. Med http://dx.doi.org/10.1038/nbt1360
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. Med http://dx.doi.org/10.1073/pnas.87.12.4576
Woese CR, Stackebrandt E, Macke TJ, Fox GE. A phylogenetic definition of the major eubacterial taxa. Syst Appl Microbiol 1985; 6:143–151. PubMed http://dx.doi.org/10.1016/S0723-2020(85)80047-3
McNeill J, Barrie FR, Burdet HM, Demoulin V, Hawksworth DL, Marhold K, Nicolson DH, Prado J, Silva PC, Skog JE, et al. International Code of Botanical Nomenclature, A.R.G. Ganter, Königstein, 2006, p. 1.
Woese CR, Stackebrandt E, Macke TJ, Fox GE. A phylogenetic definition of the major eubacterial taxa. Syst Appl Microbiol 1985; 6:143–151. PubMed http://dx.doi.org/10.1016/S0723-2020(85)80047-3
Castenholz, R.W. 2001. Oxygenic photosynthetic bacteria. In: Boone D.R. and Castenholz R.W. (eds), Bergey’s Manual of Systematic Bacteriology (2nd ed.), Volume 1, Springer-Verlag, New York, pp. 473–600.
Tomaselli L, Maria Rosa P, Mario RT. On the correct use of the Spirulina designation. Algol Stud 1996; 83:539–548.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. Med http://dx.doi.org/10.1038/75556
Romano I, Bellitti MR, Nicolaus B, Lama L, Manca MC, Pagnotta E, Gambacorta A. Lipid profile: a useful chemotaxonomic marker for classification of a new cyanobacterium in Spirulina genus. Phytochemistry 2000; 54:289–294. Med http://dx.doi.org/10.1016/S0031-9422(00)00090-X
Cohen Z, Margheri MC, Tomaselli L. Chemotaxonomy of cyanobacteria. Phytochemistry 1995; 40:1155–1158. http://dx.doi.org/10.1016/0031-9422(95)00335-5
Cohen Z, Vonshak A. Fatty acid composition of Spirulina and Spirulina-like cyanobacteria in relation to their chemotaxonomy. Phytochemistry 1991; 30:205–206. http://dx.doi.org/10.1016/0031-9422(91)84125-C
Zarrouk C. Contribution a’ l‘e’tude d’une cyanophyce’e. Influence de divers facteurs physiques et chimiques sur la croissance et la photosynthe’se de Spirulina maxima. Ph. D. Thesis, Universite’ de Paris. 1966.
Phrap and Phred for Windows, MacOS, Linux, and Unix.
Arthrospira maxima CS-328 analysis files. http://genome.ornl.gov/microbial/amax
Markowitz VM, Mavromatis K, Ivanova NN, Chen IM, Chu K, Kyrpides NC. IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics 2009; 25:2271–2278. PubMed http://dx.doi.org/10.1093/bioinformatics/ Med http://dx.doi.org/10.1093/bioinformatics/btp393
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 1997; 25:955–964. Med http://dx.doi.org/10.1093/nar/25.5.955
Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 2007; 35:3100–3108. Med http://dx.doi.org/10.1093/nar/gkm160
Hongsthong A, Sirijuntarut M, Prommeenate P, Lertladaluck K, Porkaew K, Cheevadhanarak S, Tanticharoen M. Proteome analysis at the subcellular level of the cyanobacterium Spirulina platensis in response to low-temperature stress conditions. FEMS Microbiol Lett 2008; 288:92–101. PubMed http://dx.doi.org/10.1574-6968.2008.01330.x
Hongsthong A, Sirijuntarut M, Yutthanasirikul R, Senachak J, Kurdrid P, Cheevadhanarak S, Tanticharoen M. Subcellular proteomic characterization of the high-temperature stress response of the cyanobacterium Spirulina platensis. Proteome Sci 2009; 7:33. PubMed http://dx.doi.org/10.1186/1477-5956-7-33
Hallin PF, Binnewies TT, Ussery DW. The genome BLASTatlas-a GeneWiz extension for visualization of whole-genome homology. Mol Biosyst 2008; 4:363–371. Med http://dx.doi.org/10.1039/b717118h
Ren Q, Paulsen IT. Comparative analyses of fundamental differences in membrane transport capabilities in prokaryotes and eukaryotes. PLOS Comput Biol 2005; 1:e27. PubMed http://dx.doi.org/10.1371/journal.pcbi.00100_2Z
This work was supported by the National Center for Genetic Engineering and Biotechnology (BIOTEC), Thailand under the grant numbers P-00-20061 and P-09-00186. The authors would like to thank Prof. Avigad Vonshak (Algal Biotechnology, Ben-Gurion University of the Negev, Israel) for the A. platensis C1 strain used in our experiments and Prof. Norio Murata for all his valuable suggestions during this project.
About this article
Cite this article
Cheevadhanarak, S., Paithoonrangsarid, K., Prommeenate, P. et al. Draft genome sequence of Arthrospira platensis C1 (PCC9438). Stand in Genomic Sci 6, 43–53 (2012). https://doi.org/10.4056/sigs.2525955
- Large Commercial Scale
- Integrate Microbial Genome Expert Review
- Genome Atlas
- Shotgun Genome Library