- Open Access
Complete genome sequence of Rhizobium leguminosarum bv trifolii strain WSM2304, an effective microsymbiont of the South American clover Trifolium polymorphum
Standards in Genomic Sciences volume 2, pages 66–76 (2010)
Rhizobium leguminosarum bv trifolii is the effective nitrogen fixing microsymbiont of a diverse range of annual and perennial Trifolium (clover) species. Strain WSM2304 is an aerobic, motile, non-spore forming, Gram-negative rod, isolated from Trifolium polymorphum in Uruguay in 1998. This microsymbiont predominated in the perennial grasslands of Glencoe Research Station, in Uruguay, to competitively nodulate its host, and fix atmospheric nitrogen. Here we describe the basic features of WSM2304, together with the complete genome sequence, and annotation. This is the first completed genome sequence for a nitrogen fixing microsymbiont of a clover species from the American center of origin. We reveal that its genome size is 6,872,702 bp encoding 6,643 protein-coding genes and 62 RNA only encoding genes. This multipartite genome was found to contain 5 distinct replicons; a chromosome of size 4,537,948 bp and four circular plasmids of size 1,266,105 bp, 501,946 bp, 308,747 bp and 257,956 bp.
Since ancient times, crop fields have been regularly rotated with legumes, and this continues in the modern world because of the recognition that the productivity of agricultural systems is nitrogen dependent . Legumes may redress nitrogen deficiency through the fixation of atmospheric nitrogen by rhizobia in root nodules . Today, despite the ready availability of nitrogen-fertilizer manufactured through the Haber-Bosch process, globally in excess of 400 million ha of agricultural land are sustained by nitrogen derived from forage legumes . These forages are grown for animal feed, for rotation with cereal crops, as disease breaks or as cover crops for tree plantations. Amongst the forage legumes, the Trifolium spp. (clovers) are acknowledged as one of the most important genera, with 237 species distributed across the temperate and sub-tropical regions of North and South America, Europe, Africa and Australasia .
These clovers are nodulated by R. leguminosarum bv trifolii, which is one of the most exploited species of root-nodule bacteria in world agriculture. However, because clovers are geographically widely distributed, and phenologically variable (they may be either annual [e.g. T. subterraneum] or perennial [e.g. T. pratense, T. raepens and T. polymorphum]), it is rare that a single strain of R. leguminosarum bv trifolii can effectively fix nitrogen across a wide diversity of clovers, especially those from different geographical and phenological backgrounds .
Rhizobium leguminosarum bv trifolii strain WSM2304 was isolated from a nodule recovered from the roots of the perennial clover Trifolium polymorphum growing at Glencoe Research Station near Tacuarembó, Uruguay in December 1998. WSM2304 is of particular interest because it is a highly effective microsymbiont of a perennial clover of South American origin, has a narrow, specialized host range for nitrogen fixation , and is highly competitive for nodulation of T. polymorphum in the acid, infertile soils of Uruguay . WSM2304 has also been implicated in host mediated selection for an effective microsymbiont under competitive conditions for nodulation .
Here we present a summary classification and a set of features for R. leguminosarum bv trifolii strain WSM2304 (Table 1), together with the description of the complete genome sequence and annotation.
Classification and features
R. leguminosarum bv trifolii strain WSM2304 is a motile, Gram-negative, non-spore-forming rod (Figure 1 A and B) in the Rhizobiaceae family of the class Alphaproteobacteria that forms mildly mucoid colonies (Figure 1 C) on solid media . It has a mean generation time of 3.5 h in rich medium at the optimal growth temperature of 28°C .
Figure 2 shows the phylogenetic neighborhood of R. leguminosarum bv trifolii strain WSM2304 in a 16S rRNA-based tree. An intragenic fragment of 1,440 bp was chosen since the 16S rRNA gene has not been completely sequenced in many type strains. A comparison of the entire 16S rRNA gene of WSM2304 to completely sequenced 16S rRNA genes of other rhizobia revealed 100% gene sequence identity with R. leguminosarum bv trifolii strain WSM1325 but a 1 bp difference from the 16S rRNA gene of R. leguminosarum bv viciae strain 3841.
R. leguminosarum bv trifolii WSM2304 nodulates (Nod+) and fixes nitrogen effectively (Fix+) with the South American perennial clover T. polymorphum . WSM2304 is Nod+, Fix− with Mediterranean annual clovers T. subterraneum and T. glanduliferum, in contrast to R. leguminosarum bv trifolii WSM1325 [5,29]. When inoculated onto perennial clovers of either North American or Mediterranean origin WSM2304 is variably Nod+, but always Fix− [5,6,30]. Under conditions of competitive nodulation, WSM2304 may preferentially nodulate T. polymorphum even when outnumbered 100:1 by WSM1325 .
Genome sequencing and annotation information
Genome project history
This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Community Sequencing Program at the Department of Energy Joint Genome Institute (JGI) for projects of relevance to DOE missions. The genome project is deposited in the Genomes OnLine Database  and the complete genome sequence in GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute (JGI). A summary of the project information is shown in Table 2 and sequence data statistics from the trace archive for this project are presented in Table 3.
Growth conditions and DNA isolation
R. leguminosarum bv trifolii WSM2304 was grown to mid logarithmic phase in TY medium (a rich medium)  on a gyratory shaker at 28°C. DNA was isolated from 60 ml of cells using a CTAB (Cetyl trimethylammonium bromide) bacterial genomic DNA isolation method (http://my.jgi.doe.gov/general/index.html).
Genome sequencing and assembly
The genome was sequenced using a combination of Sanger and 454 sequencing platforms. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI website (http://www.jgi.doe.gov/). 454 Pyrosequencing reads were assembled using the Newbler assembler version 1.1.02.15 (Roche). Large Newbler contigs were broken into 5,676 fragments of 1,500 bp with 100 bp overlap and entered into the assembly as pseudo-reads. The sequences were assigned quality scores based on Newbler consensus q-scores with modifications to account for overlap redundancy and to adjust inflated q-scores. A hybrid 454/Sanger assembly was made using the phrap assembler. Possible mis-assemblies were corrected and gaps between contigs were closed by custom primer walks from sub-clones or PCR products. A total of 1,826 Sanger finishing reads were produced. Illumina reads were used to improve the final consensus quality using an in-house developed tool (the Polisher). The final assembly consists of 168,617 Sanger reads in addition to 5,663 454 pseudo reads. The error rate of the completed genome sequence is less than 1 in 100,000. Together all sequence types provided about 31.4× coverage of the genome.
Genes were identified using Prodigal  as part of the Oak Ridge National Laboratory genome annotation pipeline, followed by a round of manual curation using the JGI GenePRIMP pipeline . The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. Additional gene prediction analyses and functional annotation were performed within the Integrated Microbial Genomes platform (http://img.jgi.doe.gov/er) .
The genome is 6,872,702 bp long with a 61.18% GC content, (Table 4) and comprised of 5 replicons; 1 circular chromosome of size 4,537,948 bp (Figure 3) and 4 circular plasmids of size 4,537,948, 1,266,105, 501,946, 308,747 and 257,956 bp (Figure 4). Of the 6,643 genes predicted, 6,581 were protein coding genes, and 62 RNA only encoding genes. In addition, 166 pseudogenes were identified. The majority of the genes (72.44%) were assigned a putative function whilst the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 5.
Hamblin J. Preface, 1998, pp xi–xiii. In: Lupins as Crop Plants. Biology, Production and Utilization. Gladstones JS, Atkins CA, Hamblin J (Eds.). CAB International, Madison, NY.
Sprent JI. Legume nodulation: a global perspective. 2009. Oxford, Wiley-Blackwell.
Herridge DF, Peoples MB, Boddey RM. Global inputs of biological nitrogen fixation in agricultural systems. Marschner Review. Plant Soil 2008; 311:1–18. doi:10.1007/s11104-008-9668-3
Zohary M, Heller D. The Genus Trifolium. The Israel Academy of Sciences and Humanities, Ahva Printing Press 1984, Jerusalem.
Howieson JG, Yates RJ, O’Hara GW, Ryder M, Real D. The interactions of Rhizobium leguminosarum biovar trifolii in nodulation of annual and perennial Trifolium spp from diverse centres of origin. Aust J Exp Agric 2005; 45:199–207. doi:10.1071/EA03167
Yates RJ, Howieson JG, Real D, Reeve WG, Vivas-Marfisi A, O’Hara GW. Evidence of selection for effective nodulation in the Trifolium spp. symbiosis with Rhizobium leguminosarum biovar trifolii. Aust J Exp Agric 2005; 45:189–198. doi:10.1071/EA03168
Yates RJ, Howieson JG, Reeve WG, Brau L, Speijers J, Nandasena K, Real D, Sezmis E, O’Hara GW. Host-strain mediated selection for an effective nitrogen-fixing symbiosis between Trifolium spp. and Rhizobium leguminosarum biovar trifolii. Soil Biol Biochem 2008; 40:822–833. doi:10.1016/j.soilbio.2007.11.001
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nat Biotechnol 2008; 26:541–547. PubMed doi:10.1038/nbt1360
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA 1990; 87: 4576–4579. PubMed doi:10.1073/pnas.87.12.4576
Garrity GM, Holt JG. The Road Map to the Manual. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 1, Springer, New York, 2001, p. 119–169.
Garrity GM, Bell JA, Lilburn T. Class I. Alphaproteobacteria class. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 2, Part C, Springer, New York, 2005, p. 1.
List editor. Validation List No. 107. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol 2006; 56: 1–6. PubMed doi:10.1099/ijs.0.64188-0
Kuykendall LD. Order VI. Rhizobiales ord. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 2, Part C, Springer, New York, 2005.
Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol 1980; 30: 225–420.
Frank B. Über die Pilzsymbiose der Leguminosen. Ber Dtsch Bot Ges 1889; 7: 332–346.
Jordan DC, Allen ON. Genus I. Rhizobium Frank 1889, 338; Nom. gen. cons. Opin. 34, Jud. Comm. 1970, 11. In: Buchanan RE, Gibbons NE (eds), Bergey’s Manual of Determinative Bacteriology, Eighth Edition, The Williams and Wilkins Co., Baltimore, 1974, p. 262–264.
Young JM, Kuykendall LD, Martínez-Romero E, Kerr A, Sawada H. A revision of Rhizobium Frank 1889, with an emended description of the genus, and the inclusion of all species of Agrobacterium Conn 1942 and Allorhizobium undicola de Lajudie et al. 1998 as new combinations: Rhizobium radiobacter, R. rhizogenes, R. rubi, R. undicola and R. vitis.. Int J Syst Evol Microbiol 2001; 51: 89–103. PubMed
Editorial Secretary (for the Judicial Commission of the International Committee on Nomenclature of Bacteria). OPINION 34: Conservation of the Generic Name Rhizobium Frank 1889. Int J Syst Bacteriol 1970; 20: 11–12; doi:10.1099/00207713-20-1-11.
Ramírez-Bahena MH, García-Fraile P, Peix A, Valverde A, Rivas R, Igual JM, Mateos PF, Martínez-Molina E, Velázquez E. Revision of the taxonomic status of the species Rhizobium leguminosarum (Frank 1879) Frank 1889AL, Rhizobium phaseoli Dangeard 1926AL and Rhizobium trifolii Dangeard 1926AL. R. trifolii is a later synonym of R. leguminosarum. Reclassification of the strain R. leguminosarum DSM 30132 (=NCIMB 11478) as Rhizobium pisi sp. nov.. Int J Syst Evol Microbiol 2008; 58: 2484–2490. PubMed doi:10.1099/ijs.0.65621-0
Kuykendall LD, Hashem F, Wang ET. Genus VII. Rhizobium, 2005, pp 325–340. In: Bergey’s Manual of Systematic Bacteriology. Second Edition. Volume 2 The Proteobacteria. Part C The Alpha-, Delta-, and Epsilonproteobacteria. Brenner DJ, Krieg NR, Staley JT (Eds.), Garrity GM (Editor in Chief) Springer Science and Business Media Inc, New York, USA.
Biological Agents. Technical rules for biological agents www.baua.de TRBA 466.
Liolios K, Mavromatis K, Tavernarakis N, Kyrpides NC. The Genomes OnLine Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 2008; 36:D475–D479.PubMed doi:10.1093/nar/gkm884
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. The Gene Ontology Consortium. Gene ontology: tool for the unification of biology. Nat Genet 2000; 25:25–29. PubMed doi:10.1038/75556
Howieson JG, Ewing MA, D’Antuono MF. Selection for acid tolerance in Rhizobium meliloti. Plant Soil 1988; 105:179–188. doi:10.1007/BF02376781
Kumar S, Tamura K, Nei M. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief Bioinform 2004; 5:150–163. PubMed PubMed doi:10.1093/bib/5.2.150
Kimura M. A simple model for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 1980; 16:111–120. PubMed doi:10.1007/BF01731581
Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 1985; 39:783–791. doi:10.2307/2408678
Saitou N, Nei M. Reconstructing phylogenetic trees. Mol Biol Evol 1987; 4:406–425. PubMed
Bullard GK, Roughley RJ, Pulsford DJ. The legume inoculant industry and inoculant quality control in Australia: 1953–2003. Aust J Exp Agric 2005; 45:127–140. doi:10.1071/EA03159
Centre for Rhizobium Studies. Annual Report. JG Howieson (Ed). 2001. Murdoch University Print, Perth, Australia.
Reeve WG, Tiwari RP, Worsely PS, Dilworth MJ, Glenn AR, Howieson JG. Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria. Microbiology 1999; 145:1307–1316. PubMed doi:10.1099/13500872-145-6-1307
Anonymous. Prodigal Prokaryotic Dynamic Programming Genefinding Algorithm. Oak Ridge National Laboratory and University of Tennessee 2009. http://compbio.ornl.gov/prodigal
Pati A, Ivanova N, Mikhailova N, Ovchinikova G, Hooper SD, Lykidis A, Kyrpides NC. GenePRIMP: A Gene Prediction Improvement Pipeline for microbial genomes. (Submitted) 2009
Markowitz VM, Szeto E, Palaniappan K, Grechkin Y, Chu K, Chen IMA, Dubchak I, Anderson I, Lykidis A, Mavromatis K, et al. The Integrated Microbial Genomes (IMG) system in 2007: data content and analysis tool extensions. Nucleic Acids Res 2008; 36:D528–D533. PubMed doi:10.1093/nar/gkm846
This work was performed under the auspices of the US Department of Energy’s Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under contract No. DE-AC02-05CH11231, Lawrence Livermore National Laboratory under Contract No. DE-AC52-07NA27344, and Los Alamos National Laboratory under contract No. DE-AC02-06NA25396. We thank Gordon Thompson (Murdoch University) for the preparation of SEM and TEM photos. We gratefully acknowledge the funding received from Murdoch University Strategic Research Fund through the Crop and Plant Research Institute (CaPRI), and the Grains Research and Development Corporation (GRDC), to support the National Rhizobium Program (NRP) and the Centre for Rhizobium Studies (CRS) at Murdoch University.
About this article
Cite this article
Reeve, W., O’Hara, G., Chain, P. et al. Complete genome sequence of Rhizobium leguminosarum bv trifolii strain WSM2304, an effective microsymbiont of the South American clover Trifolium polymorphum. Stand in Genomic Sci 2, 66–76 (2010). https://doi.org/10.4056/sigs.44642
- Gram-negative rod
- root-nodule bacteria
- nitrogen fixation