- Short genome report
- Open Access
High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica
Standards in Genomic Sciences volume 10, Article number: 33 (2015)
Bradyrhizobium sp. Ai1a-2 is is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen fixing root nodule of Andira inermis collected from Tres Piedras in Costa Rica. In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 9,029,266 bp genome has a GC content of 62.56% with 247 contigs arranged into 246 scaffolds. The assembled genome contains 8,482 protein-coding genes and 102 RNA-only encoding genes. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.
Bradyrhizobium sp. strain Ai1a.2 is a representative of a distinctive lineage affiliated with the Bradyrhizobium elkanii superclade . The B. elkanii superclade is one of the three main branches of the genus, together with the B. japonicum / B. diazoefficiens superclade [2,3], and the group encompassing photosynthetic Aeschynomene symbionts .
Members of the lineage represented by strain Ai1a.2 are readily diagnosed because they share a distinctive length variant in helix 9 within the 5′ intervening sequence region of the 23S rRNA gene . Strain Ai1a.2 and its relatives have an insertion of 16 nucleotides in this region in comparison to B. elkanii USDA76, which can be identified by a straightforward PCR assay . In a survey of 420 Bradyrhizobium strains from 25 countries , only 2% of the strains had this 23S rRNA length variant. These strains all clustered together into a strongly supported clade based on concatenated data for 23S rRNA and five protein-coding genes .
This clade was placed as the most basally diverging lineage within the B. elkanii superclade, and it included strains from three locations: Central America, the Caribbean, and South Africa. Strain Ai1a.2 was sampled in Costa Rica from the tree Andira inermis , and highly similar strains are also known to occur as symbionts of the same host legume in Panama . Parker and Rousteau  also detected strains from this group in nodule samples from the beach legume Canavalia rosea in two Caribbean locations (Guadeloupe and Puerto Rico). Two Bradyrhizobium strains from distantly related legume hosts (Leobordea spp.) in South Africa (WSM2632, WSM2783) also belong to this clade .
Andira inermis , the host of strain Ai1a.2, is a large tree (up to 35 m height) commonly found in riparian habitats from southern Mexico through northern South America . Andira was traditionally considered to be an early-diverging lineage within the Tribe Dalbergieae , but more recent phylogenetic analyses have suggested that it forms a separate lineage with unclear relationship to dalbergioid legumes . Here we provide an analysis of the high-quality permanent draft genome sequence of Bradyrhizobium strain Ai1a.1. The fact that the genome of its close relative WSM2783 has also been sequenced as part of the Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project  will enable detailed comparative analysis of this group.
Classification and features
Bradyrhizobium sp. Ai1a-2 is a motile, non-sporulating, non-encapsulated, Gram-negative strain in the order Rhizobiales of the class Alphaproteobacteria . The rod shaped form has dimensions of approximately 0.5 μm in width and 1.5-2.0 μm in length (Figure 1 Left and Center). It is relatively slow growing, forming colonies after 6–7 days when grown on half strength Lupin Agar (½LA) , tryptone-yeast extract agar (TY)  or a modified yeast-mannitol agar (YMA)  at 28°C. Colonies on ½LA are opaque, slightly domed and moderately mucoid with smooth margins (Figure 1 Right).
Figure 2 shows the phylogenetic relationship of Bradyrhizobium sp. Ai1a-2 in a 16S rRNA gene sequence based tree. The 16S rRNA gene sequence of Aia1-2 (using a 1,370 bp intragenic sequence) is identical to that of Bradyrhizobium sp. WSM2783. Bradyrhizobium sp. Ai1a-2 is also closely related to Bradyrhizobium sp. Cp5.3 and Bradyrhizobium sp. Th.b2 with 16S rRNA gene sequence identities of 99.77% and 99.23%, respectively, as determined using NCBI BLAST analysis . The highest identity (99.16%) of the 16S rRNA gene sequence of strain Ai1a-2 to type strain sequences occurs with Bradyrhizobium icense LMTR 13T and Bradyrhizobium paxllaeri LMTR 21T based on alignment using the EzTaxon-e server [18,19].
Strain Ai1a.2 was isolated from the tree Andira inermis , Costa Rica . The authentication of the symbiotic ability could not be performed using this host because seeds could not be accessed. The symbiotic capability of strain Ai1a.2 was tested on Macroptilium atropurpureum and this strain was able to nodulate this host. Acetylene reduction assays showed established nodules contained active nitrogenase, indicating an effective symbiosis with this host .
Genome sequencing information
Genome project history
This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, Root Nodulating Bacteria (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute (JGI). The genome project is deposited in the Genomes OnLine Database  and a high-quality permanent draft genome sequence in IMG . Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology . A summary of the project information is shown in Table 2.
Growth conditions and genomic DNA preparation
Bradyrhizobium sp. Ai1a-2 was cultured to mid logarithmic phase in 60 ml of TY rich media on a gyratory shaker at 28°C . DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method .
Genome sequencing and assembly
The draft genome of Bradyrhizobium sp. Ai1a–2 was generated at the DOE Joint Genome Institute (JGI) using the Illumina technology . An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 21,669,974 reads totaling 3,250.5 Mbp. All general aspects of library construction and sequencing were performed at the JGI and details can be found on the JGI website . All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J, Unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) , (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim , (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r42328) . Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very_clean yes –exportFiltered yes –min_contig_lgth 500 –scaffolding no –cov_cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 247 contigs in 246 scaffolds. The total size of the genome is 9.0 Mbp and the final assembly is based on 1,081.2 Mbp of Illumina data, which provides an average 119.7X coverage of the genome.
Genes were identified using Prodigal , as part of the DOE-JGI genome annotation pipeline [31,32]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool  was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA . Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL . Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) system  developed by the Joint Genome Institute, Walnut Creek, CA, USA.
The genome is 9,029,266 nucleotides with 62.56% GC content (Table 3) and comprised of 246 scaffolds. From a total of 8,584 genes, 8,482 were protein encoding and 102 RNA only encoding genes. The majority of genes (75.10%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Bradyrhizobium sp. Ai1a-2 is a member of a widely distributed Bradyrhizobium lineage, isolated from diverse legume hosts in North, Central and South America and South Africa. Little is currently known of the symbiotic associations of its host Andira inermis , apart from the discovery that the Puerto Rican isolate Bradyrhizobium sp. EC3.3 can also establish a symbiosis with this host . The Costa Rican isolate Aia1-2 16S rRNA gene sequence is distinct to that of EC3.3 but identical to the 16S rRNA sequence of South African isolate Bradyrhizobium sp. WSM2783. Phylogentically, Ai1a-2 is closely related to Bradyrhizobium sp. Cp5.3 and Bradyrhizobium sp. Th.b2 from Panama and USA, respectively. The genome of Bradyrhizobium 1a-2 and Ai sp.WSM2783 were sequenced along with 23 other Bradyrhizobium genomes as a part of the GEBA-RNB project. Of these 25 sequenced strains, the Bradyrhizobium spp. Ai1a-2, WSM2783, Cp5.3, Th.b2 and B. elkanii USDA76T are affiliated with the Bradyrhizobium elkanii superclade. The Bradyrhizobium Ai1a-2 genome has the 2nd lowest genome size (9 Mbp), gene count (8,584) and signal peptide percentage (9.75%) among these five strains. Comparing the genome attributes of Bradyrhizobium sp. Ai1a-2 along with other sequenced Bradyrhizobium genomes will be important for the understanding of the biogeography of Bradyrhizobium spp. interactions required for the successful establishments of effective symbioses with their diverse hosts.
Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria
Joint Genome Institute
Half strength Lupin Agar
Yeast mannitol agar
Cetyl trimethyl ammonium bromide
Parker MA. The spread of Bradyrhizobium lineages across host legume clades: from Abarema to Zygia. Microb Ecol. 2014;69:630–640.
Willems A, Coopman R, Gillis M. Phylogenetic and DNA-DNA hybridization analyses of Bradyrhizobium species. Int J Syst Evol Microbiol. 2001;51:111–7.
Delamuta JR, Ribeiro RA, Ormeno-Orrillo E, Melo IS, Martinez-Romero E, Hungria M. Polyphasic evidence supporting the reclassification of Bradyrhizobium japonicum group Ia strains as Bradyrhizobium diazoefficiens sp. nov. Int J Syst Evol Microbiol. 2013;63:3342–51.
Chaintreuil C, Arrighi JF, Giraud E, Miche L, Moulin L, Dreyfus B, et al. Evolution of symbiosis in the legume genus Aeschynomene. New Phytol. 2013;200:1247–59.
Evguenieva-Hackenberg E, Klug G. RNase III processing of intervening sequences found in helix 9 of 23S rRNA in the alpha subclass of Proteobacteria. J Bacteriol. 2000;182:4719–29.
Parker MA. rRNA and dnaK relationships of Bradyrhizobium sp. nodule bacteria from four papilionoid legume trees in Costa Rica. Syst Appl Microbiol. 2004;27:334–42.
Parker MA. Symbiotic relationships of legumes and nodule bacteria on Barro Colorado Island, Panama: a review. Microb Ecol. 2008;55:662–72.
Parker MA, Rousteau A. Mosaic origins of Bradyrhizobium legume symbionts on the Caribbean island of Guadeloupe. Mol Phylogenet Evol. 2014;77:110–5.
Ardley JK, Reeve WG, O’Hara GW, Yates RJ, Dilworth MJ, Howieson JG. Nodule morphology, symbiotic specificity and association with unusual rhizobia are distinguishing features of the genus Listia within the Southern African crotalarioid clade Lotononis s.l. Ann Bot. 2013;112:1–15.
Weaver PL. Andira inermis (W. Wright) DC. Silvics of forest trees of the American tropics. SO-ITF-SM-20. New Orleans, LA: USDA Forest Service, Southern Forest Experiment Station; 1989. p. 7. 7.
Lewis G SB, Mackinder B, Lock M. Legumes of the world Royal Botanic Gardens, Kew, UK; 2005.
Cardoso D PR, de Queiroz LP, Boatwright JS, Van Wyk BE, Wojciechowski MF, Lavin M. Reconstructing the deep-branching relationships of the papilionoid legumes. South Afr J Bot. 2013;89:58–75
Reeve W, Ardley J, Tian R, Eshragi L, Yoon JW, Ngamwisetkun P, et al. A genomic encyclopedia of the root nodule bacteria: assessing genetic diversity through a systematic biogeographic survey. Stand Genomic Sci. 2015;10:14.
Howieson JG, Ewing MA, D’antuono MF. Selection for acid tolerance in Rhizobium meliloti. Plant Soil. 1988;105:179–88.
Beringer JE. R factor transfer in Rhizobium leguminosarum. J Gen Microbiol. 1974;84:188–98.
Vincent JM. A manual for the practical study of the root-nodule bacteria. International Biological Programme. UK: Blackwell Scientific Publications, Oxford; 1970.
NCBI BLAST [http://blast.ncbi.nlm.nih.gov/Blast.cgi]
Kim O-S, Cho Y-J, Lee K, Yoon S-H, Kim M, Na H, et al. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. Int J Syst Evol Microbiol. 2012;62:716–21.
Pagani I, Liolios K, Jansson J, Chen IM, Smirnova T, Nosrat B, et al. The Genomes OnLine Database (GOLD) v. 4: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 2012;40:D571–9.
Markowitz VM, Chen I-MA, Palaniappan K, Chu K, Szeto E, Pillay M, et al. IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. 2014;42:D560–7.
Mavromatis K, Land ML, Brettin TS, Quest DJ, Copeland A, Clum A, et al. The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation. PLoS One. 2012;7, e48837.
Reeve WG, Tiwari RP, Worsley PS, Dilworth MJ, Glenn AR, Howieson JG. Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria. Microbiology. 1999;145:1307–16.
Protocols and sample preparation information [http://jgi.doe.gov/collaborate-with-jgi/pmo-overview/protocols-sample-preparation-information/]
Bennett S. Solexa Ltd. Pharmacogenomics. 2004;5:433–8.
JGI:Joint Genome Institute [http://www.jgi.doe.gov]
Zerbino D, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–9.
Reads simulator wgsim [https://github.com/lh3/wgsim]
Validation of publication of new names and new combinations previously effectively published outside the IJSEM. List no. 106. Int J Syst Evol Microbiol. 2005;55:2235–38.
Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119.
Mavromatis K, Ivanova NN, Chen IM, Szeto E, Markowitz VM, Kyrpides NC. The DOE-JGI standard operating procedure for the annotations of microbial genomes. Stand Genomic Sci. 2009;1:63–7.
Chen IM, Markowitz VM, Chu K, Anderson I, Mavromatis K, Kyrpides NC, et al. Improving microbial genome annotations in an integrated database context. PLoS One. 2013;8, e54859.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.
Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007;35:7188–96.
Infernal: inference of RNA alignments [http://infernal.janelia.org/]
Markowitz VM, Mavromatis K, Ivanova NN, Chen IM, Chu K, Kyrpides NC. IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics. 2009;25:2271–8.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. Towards a richer description of our complete collection of genomes and metagenomes “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnol. 2008;26:541–7.
Field D, Amaral-Zettler L, Cochrane G, Cole JR, Dawyndt P, Garrity GM, et al. The Genomic Standards Consortium. PLoS Biol. 2011;9, e1001088.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. P Natl A Sci USA. 1990;87:4576–9.
Garrity GM, Bell JA, Lilburn T. hylum XIV. Proteobacteria phyl. nov. In: Garrity GM, Brenner DJ, Kreig NR, Staley JT, editors. Bergey’s manual of systematic bacteriology. Volume 2. Second ed. New York: Springer - Verlag; 2005. p. 1.
Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. P Natl A Sci. 2011;108:1513–8.
Garrity GM, Bell JA, Lilburn T. Class I. Alphaproteobacteria class. In: Garrity GM, Brenner DJ, Kreig NR, Staley JT, editors. Bergey’s manual of systematic bacteriology. Volume 2. Second ed. New York: Springer - Verlag; 2005. p. 1.
Kuykendall LD. Order VI. Rhizobiales ord. nov. In: Garrity GM, Brenner DJ, Kreig NR, Staley JT, editors. Bergey’s manual of systematic bacteriology. Second ed. New York: Springer - Verlag; 2005. p. 324.
Garrity GM, Bell JA, Lilburn T. Family VII. Bradyrhizobiaceae fam. nov. In Bergey’s Manual of Systematic Bacteriology. Volume 2. edition. Edited by Brenn DJ. New York: Springer - Verlag; 2005: 438
Jordan DC. Transfer of Rhizobium japonicum Buchanan 1980 to Bradyrhizobium gen. nov., a genus of slow-growing, root nodule bacteria from leguminous plants. Int J Syst Bacteriol. 1982;32:136–9.
Biological Agents: Technical rules for biological agents. TRBA:466.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000;25:25–9.
Guide to GO evidence codes [http://www.geneontology.org/GO.evidence.shtml]
GOLD ID for Bradyrhizobium elkanii WSM1741 [https://gold.jgi-psf.org/projects?id=9846]
This work was performed under the auspices of the US Department of Energy’s Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under contract No. DE-AC02-05CH11231. We thank Gordon Thompson (Murdoch University) for the preparation of SEM and TEM photos. We would also like to thank the Center of Nanotechnology at King Abdulaziz University for their support.
The authors declare that they have no competing interests.
MP supplied the strain and background information for this project and the DNA to the JGI, TR performed all imaging, TR and WR drafted the paper, MNB and NAB provided financial support and all other authors were involved in sequencing the genome and/or editing the final paper. All authors read and approved the final manuscript.