- Short genome report
- Open access
- Published:
Complete genome sequence of the potato pathogen Ralstonia solanacearum UY031
Standards in Genomic Sciences volume 11, Article number: 7 (2016)
Abstract
Ralstonia solanacearum is the causative agent of bacterial wilt of potato. Ralstonia solanacearum strain UY031 belongs to the American phylotype IIB, sequevar 1, also classified as race 3 biovar 2. Here we report the completely sequenced genome of this strain, the first complete genome for phylotype IIB, sequevar 1, and the fourth for the R. solanacearum species complex. In addition to standard genome annotation, we have carried out a curated annotation of type III effector genes, an important pathogenicity-related class of genes for this organism. We identified 60 effector genes, and observed that this effector repertoire is distinct when compared to those from other phylotype IIB strains. Eleven of the effectors appear to be nonfunctional due to disruptive mutations. We also report a methylome analysis of this genome, the first for a R. solanacearum strain. This analysis helped us note the presence of a toxin gene within a region of probable phage origin, raising the hypothesis that this gene may play a role in this strain’s virulence.
Introduction
Ralstonia solanacearum is the causal agent of bacterial wilt, one of the most devastating plant diseases worldwide [1]. It is a highly diversified bacterial plant pathogen in terms of host range, geographical distribution, pathogenicity, epidemiological relationships, and physiological properties [2]. Strains are divided in four phylotypes, corresponding roughly to their geographic origin: Asia (phylotype I), the Americas (II), Africa (III), and Indonesia (IV) [3]. Strain UY031 belongs to phylotype IIB, sequevar 1 (IIB1), the group considered mainly responsible for bacterial wilt of potato in cold and temperate regions [4]. Phylotype IIB, sequevar 1 is also traditionally classified as race 3 biovar 2.
Strain UY031 was isolated in Uruguay from infected potato tubers in 2003 and displays high aggressiveness both on potato and tomato hosts [5]. This strain is being used as a model in plant-pathogen gene expression studies carried out by our group; having its genome available greatly facilitates the identification of pathogenicity-related genes. Four other IIB1 R. solanacearum strains have been partially sequenced: UW551 [6], IPO1609 [7], NCPPB909 [8], and CFIA906 [8]. This is the first genome of this group to be completely sequenced, and the fourth within the R. solanacearum species complex (the other three are strains GMI1000 [9], Po82 [10] , and PSI07 [11]).
Organism information
Classification and features
Ralstonia solanacearum UY031 strain is classified within the order Burkholderiales of the class Betaproteobacteria . It is an aerobic, non-sporulating, Gram-negative bacterium with rod-shaped cells ranging from 0.5 to 1.5 μm in length (Fig. 1, (a) and (b)). The strain is moderately fast-growing, forming 3–4 mm colonies within 2–3 days at 28 °C. On a general nutrient medium containing tetrazolium chloride and high glucose content, strain UY031 usually produces a diffusible brown pigment and develops pearly cream-white, flat, irregular, and fluidal colonies with characteristic pink whorls in the centre (Fig. 1, (c)). Strain UY031 was isolated from a naturally infected potato tuber showing typical brown rot symptoms (creamy exudates from the vascular rings and eyes of the tuber). This strain is highly pathogenic in different solanaceous hosts including important crops like tomato and potato [5]. Pathogenicity of this strain was also confirmed in several accessions of Solanum commersonii Dunal, a wild species considered as a valuable source of resistance for potato breeding. Due to its great aggressiveness, strain UY031 is being used for selection of resistant germplasm as part of the potato breeding program developed in Uruguay. This strain has been deposited in the CFBP collection of plant-associated bacteria, and has received code CFBP 8401. Minimum Information about the Genome Sequence of R. solanacearum strain UY031 is summarized in Table 1, and a phylogenetic tree is shown in Fig. 2.
Genome sequencing information
Genome project history
This sequencing project was carried out in 2015; the result is a complete and finished genome. Project data is available from GenBank (Table 2). Accession codes for reads in the Sequence Read Archive are SRP064191, SRR2518086, and SRZ132405.
Growth conditions and genomic DNA preparation
R. solanacearum strain UY031 was routinely grown in rich B medium (10 g/l bactopeptone, 1 g/l yeast extract and 1 g/l casaminoacids). Genomic DNA was extracted from a bacterial culture grown to stationary phase to avoid over-representation of genomic sequences close to the origin of replication. Twelve ml of a culture grown for 16 h at 30 °C and shaking at 200 rpm (OD600 = 0.87) were used to extract DNA with Blood & Cell Culture DNA Midi kit (Qiagen), following manufacturer’s instructions for gram-negative bacteria. DNA concentration and quality were measured in a Nanodrop (ND-8000 8-sample spectrophotometer).
Genome sequencing and assembly
Whole-genome sequencing was performed on the PacBio RS II platform at the Duke Center for Genomic and Computational Biology (USA). P5-C3 chemistry and a single SMRTcell were used, and quality control was performed with DUGSIM. The number of Pre-Filter Polymerase Read Bases was greater than 749 million (>130x genome coverage). Reads were assembled using RS_HGAP_Assembly.2 protocol from SMRT Analysis 2.3 [12]. This resulted in one circular chromosome (3,412,138 bp) and one circular megaplasmid (1,999,545 bp). These lengths are very similar to those of the corresponding replicons in R. solanacearum Po82, a IIB sequevar 4 strain, also a potato pathogen and which has also been completely sequenced [10]. The origin of replication was defined for both replicons based on the putative origin for reference strain GMI1000 [9].
An assembly quality assessment was performed before all downstream analyses. All reads were mapped back to the assembled sequences using RS_Resequencing.1 protocol from SMRT Analysis 2.3. This analysis revealed that chromosome and megaplasmid sequences had 100 % of bases called (percentage of assembled sequence with coverage > = 1) and 99.9999 % and 99.9992 %, respectively, of consensus concordance.
Genome annotation
Genome annotation was done using Prokka [13] with the option for ncRNA search. Type III effectors of strain UY031 were identified and annotated in three steps: First, 17 of the T3Es from the R. solanacearum species complex [14] were identified based on the Prokka annotations. Second, the 15 T3Es annotated as “Type III Effector Protein”, “Probable Type III Effector Protein” or “Putative Type III Effector Protein” by Prokka were manually annotated using the first BLAST [15] hits (usually 100 % identity) of their DNA sequences against genome sequences of phylotype IIB strains MOLK2 and Po82. Third, the UY031 genome was uploaded to the “ Ralstonia T3E” web interface tool [14] to search for additional T3Es not annotated as such with Prokka. The additional 28 T3E genes identified were manually annotated as above. Homologous Gene Group clustering was performed with get_homologues [16] using the orthoMCL program [17] and requiring a minimum sequence identity in BLAST query/subject pairs of 30 %.
The sequencing plataform used to assemble the genome (PacBio RS II) also gives kinectics information about the sequenced genome. The presence of a methylated base in the DNA template delays the incorporation of the complementary nucleotide; such modifications in the kinectics may be used to characterize modified bases by methylation including: 6-mA, 5-mC and 4-mC [18]. The analysis of these modifications in a genome-wide and single-base-resolution scale allowed us to characterize the ‘methylome’ of this strain. These epigenetic marks are commonly used by bacteria, and its implications vary from a defense mechanism, protecting the cell from invading bacteriophages or other foreign DNA, to the bacterial virulence itself [19–21]. We performed methylome analysis and motif detection using RS_Modification_and_Motif_analysis.1 protocol from SMRT Analysis 2.3. Such epigenetic marks arise from DNA methyl-transferases, sometimes coupled with a restriction endonuclease (a Restriction-Modification System). We further characterized which genes give rise to the modified motifs using tools available at REBASE [22].
Genome properties
The genome of R. solanacearum strain UY031 has one chromosome (3,412,138 bp) and one circular megaplasmid (1,999,545 bp) (Table 3). The average GC content of the chromosome is 66.5 % while that of the megaplasmid is 66.7 %. A total of 4,778 genes (4,683 CDSs and 95 RNAs) were predicted. Of the protein-coding genes, 3,566 (76.1 %) had functions assigned while 1,212 were considered hypothetical (Table 4). Of all CDSs, 76.6 % could be assigned to one COG functional category and for 83.1 % one or more conserved PFAM-A domains were identified (Table 5).
Insights from the genome sequence
We performed a pan-genome analysis of the R. solanacearum UY031 genome, comparing it to four other genomes: two closely-related R. solanacearum strains (UW551 and IPO1609) and two others with complete genome sequences available (GMI1000 and Po82). The pan-genome consists of 7,594 HGGs while the core genome consists of 2,958 HGGs; the variable genome consists of 2,643 HGGs, and the number of strain-specific HGGs ranges from 193 to 774 (Fig. 3). We identified 193 HGGs that are UY031-specific; 75.1 % of them were annotated as hypothetical proteins.
Type III effector genes are among the most important for virulence determinants in bacterial plant pathogens such as R. solanacearum [14]. Based on comparisons with effector gene sequences in public databases (see above) we have identified 60 T3Es (Table 6), of which 11 appear to be nonfunctional due to frameshifts or other mutations that disrupt the coding sequence. For example, the effector RipS5 is encoded by a gene that has been clearly interrupted by a 34 kbp prophage. Table 6 also shows the orthologs of these genes in the related strains GMI1000, Po82, IPO1609, and UW551. In the table it can be seen that the genes that code for RipAA and RipAR have frameshifts or truncations in strain UY031 only. The absence of a particular effector may be enough for a pathogen to avoid host defenses, and therefore cause disease. These two genes are therefore a good starting point for additional investigations of phenotypic differences between these strains. Other effector genes of interest are those that are present and do not have disrupting mutations in UY031 but are absent or appear to be nonfunctional in other strains. We have found several such cases (Table 6), but in all cases there is at least one other strain that also has the same gene in what appears to be a functional state.
Our modification analysis revealed two motifs that are essentially always methylated, namely: CAACRAC and GTWWAC. Both are fairly frequent in the genome, occurring respectively 2144 and 716 times. Motif CAACRAC is associated with the product of gene RSUY_11320 (R. Roberts, personal communication), which is hypothesized to be an enzyme of the Restriction-Modification System, with a restriction nuclease and a DNA methyltransferase role. This gene does not have homologs in other R. solanacearum strains and is located close to a region containing phage-related genes. This region contains gene RSUY_11410, which has been annotated as encoding a zonular occludens toxin. The provenance of this annotation is an enterotoxin gene found in Vibrio cholera [23]; in R. solanacearum the role of this toxin gene is still unclear [24]. Motif GTWWAC is probably associated with the product of gene RSUY_22890 (R. Roberts, personal communication), which is hypothesized to be a solitary DNA methyltransferase (no restriction endonuclease linked). This gene does have homologs in other R. solanacearum strains (GMI1000, IPO1609, Po82 and PSI07). To our knowledge this is the first R. solanacearum genome with a methylome profile available.
Conclusions
The complete sequence of R. solanacearum UY031 strain presented here should provide a rich platform upon which additional plant-pathogen studies can be carried out. Even though this is the fifth phylotype IIB1 sequenced, we found many differences with respect to the genomes of the other strains. In particular, the repertoire of T3E genes has many variations among these strains, and this may help explain some of the most relevant pathogenicity-related phenotypes described in the literature, opening the way to new control methods for bacterial wilt.
Abbreviations
- IIB1:
-
Phylotype IIB, sequevar 1
- T3E:
-
Type III effectors
- HGG:
-
Homologous gene groups
References
Mansfield J, Genin S, Magori S, Citovsky V, Sriariyanum M, Ronald P, et al. Top 10 plant pathogenic bacteria in molecular plant pathology. Mol Plant Pathol. 2012;13(6):614–29.
Genin S, Denny TP. Pathogenomics of the Ralstonia solanacearum species complex. Annu Rev Phytopathol. 2012;50:67–89.
Fegan M, Prior P. How complex is the ‘Ralstonia solanacearum’ species complex? In: Allen CP, editor. Bacterial wilt: The disease and the ralstonia solanacearum species complex. Prior, Hayward AC. St. Paul, MN: American Phytopathological Society; 2005. p. 449–61.
Janse JD, van den Beld HE, Elphinstone J, Simpkins S, Tjou-Tam-Sin NNA, van Vaerenbergh J. Introduction to Europe of Ralstonia solanacearum biovar 2, race 3 in Pelargonium zonale cuttings. J Plant Pathol. 2004;86(2):147–55.
Siri MI, Sanabria A, Pianzolla MJ. Genetic diversity and aggressiveness of Ralstonia solanacearum strains causing bacterial wilt of potato in Uruguay. Plant Dis. 2011;95(10):1292–301.
Gabriel DW, Allen C, Schell M, Denny TP, Greenberg JT, Duan YP, et al. Identification of open reading frames unique to a select agent: Ralstonia solanacearum race 3 biovar 2. MPMI. 2006;19(1):69–79.
Guidot A, Elbaz M, Carrere S, Siri MI, Pianzzola MJ, Prior P, et al. Specific genes from the potato brown rot strains of Ralstonia solanacearum and their potential use for strain detection. Phytopathology. 2009;99(9):1105–12.
Yuan KX, Cullis J, Levesque CA, Tambong J, Chen W, Lewis CT, et al. Draft genome sequences of Ralstonia solanacearum race 3 biovar 2 strains with different temperature adaptations. Genome Announc. 2015;3(4).
Salanoubat M, Genin S, Artiguenave F, Gouzy J, Mangenot S, Arlat M, et al. Genome sequence of the plant pathogen Ralstonia solanacearum. Nature. 2002;415(6871):497–502.
Xu J, Zheng HJ, Liu L, Pan ZC, Prior P, Tang B, et al. Complete genome sequence of the plant pathogen Ralstonia solanacearum strain Po82. J Bacteriol. 2011;193(16):4261–2.
Remenant B, Coupat-Goutaland B, Guidot A, Cellier G, Wicker E, Allen C, et al. Genomes of three tomato pathogens within the Ralstonia solanacearum species complex reveal significant evolutionary divergence. BMC Genomics. 2010;11:379.
Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods. 2013;10(6):563–9.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9.
Peeters N, Carrere S, Anisimova M, Plener L, Cazale AC, Genin S. Repertoire, unified nomenclature and evolution of the type III effector gene set in the Ralstonia solanacearum species complex. BMC Genomics. 2013;14:859.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
Contreras-Moreira B, Vinuesa P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Appl Environ Microbiol. 2013;79(24):7696–701.
Li L, Stoeckert Jr CJ, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13(9):2178–89.
Flusberg BA, Webster DR, Lee JH, Travers KJ, Olivares EC, Clark TA, et al. Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods. 2010;7(6):461–5.
Sanchez-Romero MA, Cota I, Casadesus J. DNA methylation in bacteria: from the methyl group to the methylome. Curr Opin Microbiol. 2015;25:9–16.
Garcia-Del Portillo F, Pucciarelli MG, Casadesus J. DNA adenine methylase mutants of Salmonella typhimurium show defects in protein secretion, cell invasion, and M cell cytotoxicity. Proc Natl Acad Sci U S A. 1999;96(20):11578–83.
Heithoff DM, Sinsheimer RL, Low DA, Mahan MJ. An essential role for DNA adenine methylation in bacterial virulence. Science. 1999;284(5416):967–70.
Roberts RJ, Vincze T, Posfai J, Macelis D. REBASE--a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res. 2015;43(Database issue):D298–9.
Di Pierro M, Lu R, Uzzau S, Wang W, Margaretten K, Pazzani C, et al. Zonula occludens toxin structure-function analysis. Identification of the fragment biologically active on tight junctions and of the zonulin receptor binding domain. J Biol Chem. 2001;276(22):19160–5.
Murugaiyan S, Bae JY, Wu J, Lee SD, Um HY, Choi HK, et al. Characterization of filamentous bacteriophage PE226 infecting Ralstonia solanacearum strains. J Appl Microbiol. 2011;110(1):296–303.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59(3):307–21.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87(12):4576–9.
Garrity GM, Bell JA, Lilburn T. Phylum XIV. Proteobacteria phyl. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology, vol. 2. Second ed. New York: Springer; 2005. p. Part B:1.
Garrity GM, Bell JA, Lilburn T. Class II. Betaproteobacteria class. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology, vol. 2. Second ed. New York: Springer; 2005. p. 575. part C.
List Editor. Validation List Number 107. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol. 2006,56:1–6
Garrity GM, Bell JA, Lilburn T. Order I. Burkholderiales ord. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology, vol. 2. Second ed. New York: Springer; 2005. p. 575. part C.
Garrity GM, Bell JA, Lilburn T. Family I. Burkholderiaceae fam. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology, vol. 2. Second ed. New York: Springer; 2005. p. 575. part C.
Yabuuchi E, Kosako Y, Yano I, Hotta H, Nishiuchi Y. Transfer of two Burkholderia and an Alcaligenes species to Ralstonia gen. Nov.: proposal of Ralstonia pickettii (Ralston, Palleroni and Doudoroff 1973) comb. nov., Ralstonia solanacearum (Smith 1896) comb. nov. and Ralstonia eutropha (Davis 1969) comb. Nov. Microbiol Immunol. 1995;39(11):897–904.
List Editor. Validation List No. 57. Validation of the publication of new names and new combinations previously effectively published outside the IJSB. Int J Syst Bacteriol. 1996,46:625–626
Denny TP, Hayward AC, Schaad NW, Jones JB, Chun W. II. Gram negative bacteria. F. Ralstonia. In: Laboratory guide for identification of plant pathogenic bacteria. Thirdth ed. St. Paul, MN, USA: American Phytopathological Society Press; 2001.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000;25(1):25–9.
Acknowledgements
We thank Carlos Balsalobre and Cristina Madrid for their helpful advice and for kindly providing materials and protocols; and Carlos Morais for help with NCBI submission. We also thank COST action Sustain from the European Union for funding and Nemo Peeters and Stéphane Genin for hosting MP for a short stay to carry out UY031 effector annotation. RGS has a Ph.D. fellowship from FAPESP, Brazil. JCS has an investigator fellowship from the Conselho Nacional de Desenvolvimento Cientifico e Tecnologico, Brazil.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have followed all local, national and international guidelines and legislation and obtained the required permissions and/or licenses for this study.
The authors declare that they do not have any financial and non-financial competing interests.
Authors’ contributions
Conceived the project: MV, JCS, RGS. Provided strains and metadata: MIS, MJP. Assembled and annotated the genome: RGS. Performed effector gene annotation: MP, NSC. Analyzed and interpreted results: JCS, MV, MP, NSC, RGS, MIS, MJP. Wrote the manuscript: JCS, MV, MP, RGS, MIS, MJP. All authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Guarischi-Sousa, R., Puigvert, M., Coll, N.S. et al. Complete genome sequence of the potato pathogen Ralstonia solanacearum UY031. Stand in Genomic Sci 11, 7 (2016). https://doi.org/10.1186/s40793-016-0131-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40793-016-0131-4