- Extended genome report
- Open Access
The genome of the cotton bacterial blight pathogen Xanthomonas citri pv. malvacearum strain MSCT1
Standards in Genomic Sciences volume 12, Article number: 42 (2017)
Xanthomonas citri pv. malvacearum is a major pathogen of cotton, Gossypium hirsutum L.. In this study we report the complete genome of the X. citri pv. malvacearum strain MSCT1 assembled from long read DNA sequencing technology. The MSCT1 genome is the first X. citri pv. malvacearum genome with complete coding regions for X. citri pv. malvacearum transcriptional activator-like effectors. In addition functional and structural annotations are presented in this study that will provide a foundation for future pathogenesis studies with MSCT1.
Xanthomonas citri pv. malvacearum is the causal agent of bacterial blight of cotton ( Gossypium hirsutum L.). Xanthomonas citri pv. malvacearum infects plant tissues and organs of cotton during all stages of development beginning with the seedling stage . Typical disease symptoms caused by X. citri pv. malvacearum include cotyledon/seedling blight, angular leaf spot, systemic vein blight, black arm (of petioles and main stems), boll shedding, and internal boll rot . Histology studies reported that the host cotton plant cells begin to degenerate 3 days post-infection . Over the 3 day period the degradation of host cells begins by; first, the host tissue appearing to loosen, then the granal and stromal membranes of the chloroplasts disappear, followed by the degeneration of the chloroplast and other organelles [2, 3]. At 6 days post-infection, cellular degeneration along with the production of a hydrophilic extracellular polymeric substance by the bacterium, causes water to accumulate in the infected tissues forming lesions known as “water soaked spots”, a classical plant pathogen-associated symptom [2,3,4].
Resistance to X. citri pv. malvacearum has been identified in cotton, as well as additional Gossypium species. Currently, most lines resistant to X. citri pv. malvacearum exist in G. hirsutum cultivars since breeding for X. citri pv. malvacearum resistance has been ongoing since 1939  and continues today as G. hirsutum cultivars and germplasm releases are screened for X. citri pv. malvacearum resistance [6,7,8]. At least 18 genes participate in resistance to X. citri pv. malvacearum [1, 9]. The ability of the X. citri pv. malvacearum strains to escape specific resistance genes resulted in a classification scheme of races. To date, 22 races have been reported and assigned numerical names (i.e. 1 to 22) . Most races are geographically distinct. Of note, bacterial blight in the U.S. is predominantly caused by race 18. Genetic resistance within cotton cultivars is generally attributed to a certain race or multiple races of X. citri pv. malvacearum. The ability of G. hirsutum to mount a defense response to X. citri pv. malvacearum is, at least in some cases, dependent upon the transcription activator-like effector avrBs3/pthA gene family in X. citri pv. malvacearum indicating the presence of a gene-for-gene relationship in X. citri pv. malvacearum-G. hirsutum interactions [9, 10]. With the ever increasing understanding of the importance of TAL effectors in pathogenesis [11,12,13], the objective of this study was to generate the first genome sequence for a X. citri pv. malvacearum strain that contains the TAL effector complement to serve as a foundation for a better understanding of the X. citri pv. malvacearum-G. hirsutum interaction.
To date, four draft genomes of Xanthomonas citri pv. malvacearum have been published. However, all sequenced X. citri pv. malvacearum isolates were obtained from outside the United States [14, 15]. The diversity of the four previously reported draft genomes includes two race 18 isolates, one race 20 isolate, and a highly virulent strain. The project described here was undertaken to provide the first X. citri pv. malvacearum genome sequence from the Mid-South region of the United States, a major production area of upland cotton. The isolate, MSCT1, was isolated during the 2011 outbreak of X. citri pv. malvacearum in the Mississippi Delta (i.e. Mississippi river’s alluvial plain). This outbreak resulted in the greatest estimated X. citri pv. malvacearum-based losses (52,000 bales) in Arkansas and Mississippi as reported by the National Cotton Council Disease Database . This study was undertaken to generate a genome sequence for the X. citri pv. malvacearum strain MSCT1 to identify protein candidates that may be involved in the pathogenesis of bacteria bight of cotton. The genome sequence will also serve as a template for which further studies of genetic diversity of X. citri pv. malvacearum in the United States can be conducted.
Classification and features
Xanthomonas citri pv. malvacearum has gone through a series of name changes over time as additional information has been learned about its biology and genetics. In chronological order, X. citri pv. malvacearum has previously been classified as Pseudomonas malvacearum, Bacterium malvacearum, and Xanthomonas malvacearum . In 2009, Ah-You et al. assigned the X. citri pv. malvacearum moniker [9, 17]. Xanthomonas citri pv. malvacearum is a motile, Gram-negative, rod-shaped bacterium that produces yellow, copiously mucoid, wet, shining growth on 2% w/v peptone-sugar agar . Xanthomonas citri pv. malvacearum, like other Xanthomonas species (xanthomonads), produces the heteropolysaccharide xanthan . Additional characteristics of X. citri pv. malvacearum are provided in Table 1.
For specimen isolation, cotton leaves with the typical blight symptoms (Fig. 1) were collected from a field located north of Yazoo City, Mississippi in Yazoo County, during the 2011 growing season. Strain MSCT1 was isolated using a routine method for foliar bacterial pathogens. In brief, the disease lesions were cut into small pieces (3 × 3 mm) from the junction of diseased and healthy tissues. The cut pieces were transferred into a sterile 1.5 ml microcentrifuge tube and surface-sterilized using 10% sodium hypochlorite (bleach; Clorox) for 1 min. The sterilized tissues were washed twice using sterile water, and then stabbed with a sterile lab needle in 200 μl of sterile water. A full loop of the resulting bacterial suspension was streaked on nutrient broth-yeast extract plates . The streaked nutrient broth-yeast extract plates were incubated at 20 °C for 2 days under ambient laboratory temperatures and a 16:8 day: night photoperiod. Single colonies of the resulting bacterium were isolated in a sterilized loop and streaked onto fresh NBY plates for purification. The pathogenicity of MSCT1 to cause bacterial blight of cotton was confirmed by fulfilling Koch’s Postulates. Briefly, cotton seedlings (cotton cultivar PHY499WRF) were grown in the greenhouse until they reached the three-leaf growth stage. A vacuum system (20″ psi for 10 s) was used to inoculate the seedling leaves with a suspension of MSCT1 (OD 0.3 at 420 nm) suspended in sterile phosphate buffer (0.01 M; pH 7.0). After 10 days the characteristic symptoms of bacterial blight were observed on the inoculated leaf tissues. The X. citri pv. malvacearum strain MSCT1 that is described in this manuscript was deposited in the USDA Agricultural Research Service Culture Collection under deposition number NRRL B-65440. The isolate MSCT1 was confirmed to be Xanthomonas citri pv. malvacearum based on the 16S rRNA sequence analysis, as described previously . Multilocus sequence typing was used to construct a phylogenic tree for Xanthomonas strains based upon three genes from the MLST described by Ah-You et al. 2009 , and included; atpD coding ATP synthase β chain, dnaK coding heat shock protein 70, and gyrB coding the gyrase subunit β (Fig. 2). A transmission election micrograph of MSCT1 was generated by the Mississippi State University’s Institute for Imaging & Analytical Technologies (Fig. 3).
Genome sequencing information
Genome project history
The MSCT1 sequencing project arose from the 2011 outbreak of bacterial blight in the cotton growing regions of the Mississippi Delta. Following MSCT1 isolation, additional testing determined that MSCT1 was capable of producing disease symptoms on several cultivars of upland cotton commonly grown in the Mid-South. Preliminary bioinformatics investigations determined X. citri pv. malvacearum assemblies, generated from short reads, lacked detectable TAL effectors in their genomes, although TAL effectors have been previously described as occurring in X. citri pv. malvacearum [20,21,22]. To better understand the pathology of X. citri pv. malvacearum, and more specifically of isolate MSCT1, we conducted a long read genome sequencing project to identify MSCT1’s effector complement, including the TAL effectors that do not assemble well with short read DNA sequencing technology. The resultant complete genome sequence has been deposited in the NCBI genome database under genome assembly accession GCF_001719155.1.
Growth conditions and genomic DNA preparation
An MSCT1 colony was isolated from a LB plate (pectone 10 g/L, yeast extract 5 g/L sodium chloride 10 g/ L, agarose 15 g/L) and used to inoculate 1.5 ml of LB medium (pectone 10 g/L, yeast extract 5 g/L sodium chloride 10 g/L) in a sterile, plastic culture tube. The culture tube was placed at 25 °C with 200 rpm orbital shaking overnight. The resulting bacterial culture was pelleted by centrifugation at 5000 rpm for 10 min. The pellet was washed twice to remove LB medium; each wash consisted of resuspending the pellet in 1 ml of phosphate buffered saline (PBS; NaCL 8 g/L, KCl 0.2 g/L, Na2HPO4 1.42 g/L, KH2PO4 0.24 g/L), centrifuging the suspension at 5000 rpm for 10 min, and discarding the supernatant. Genomic DNA was isolated using a modified version of the method described in Chen and Kuo 1993 . Briefly, the cell pellet was resuspended in 300 μl of extraction buffer (40 mM Tris-HCl, 1 mM EDTA, 1% w/v SDS, pH 7.8). After adding 50 μl of 10 mg/ml lysozyme (Sigma-Aldrich; St. Louis, MO, USA), the cell suspension was incubated at 37 °C for 30 min with occasional mixing until the cell suspension became clear. The bacterial nucleic acid sample was further purified using a series of phenol, phenol/chloroform, and chloroform extraction steps, then precipitated with two volumes of 100% ethanol. DNA was pelleted by centrifugation at 12,000 rpm for 10 min. After two washes with 70% ethanol (v/v), the nucleic acid pellet was air-dried (approximately 15 min). The pellet was then dissolved in 50 μl of 10 mM Tris buffer (pH 7.5). The bacterial nucleic acid sample was treated with 20 μl of 30 mg/ml RNase A (Sigma-Aldrich; St. Louis, MO, USA) at 37 °C for 20 min, followed by phenol/chloroform and chloroform extraction steps to remove the enzyme. The DNA was precipitated with 100% ethanol and cleaned with 70% ethanol as described above. The air-dried genomic DNA pellet was dissolved in 50 μl of 10 mM Tris buffer (pH 7.5). The resultant DNA was visualized on a 0.8% w/v agarose gel.
Genome sequencing and assembly
Two long read technologies, PacBio (Pacific Biosciences of California, Melon Park, CA, USA) and Nanopore (Oxford Nanopore Technologies, Oxford, UK), were used to sequence MSCT1. A 20 kb PacBio library was prepared and sequenced on two P6-C4 SRMT cells at the University of Delaware Sequencing & Genotyping Center (Newark, DE, USA). Additionally, a Nanopore library was prepared and sequenced on a R9 Nanopore flowcell at the Mississippi State Institute for Genomics, Biocomputing, and Biotechnology (Mississippi State, MS, USA). The PacBio and Nanopore reads were assembled with the Canu long read assembler . The resultant contigs from the assembly were aligned against themselves with blastn to identify the overlapping ends of the assembly for circularization of the DNA molecules. Following circularization, open reading frames (ORFs) were predicted with the getorf program within the ESBOSS software package  and the dnaA coding region for the protein was identified with blastn . The chromosome was rearranged to place the start of the molecule 41 bp from the start of the dnaA coding region. The plasmid molecules were rearranged to put the resultant ends of the circularization within the middle of the molecule while allowing the new cut sight to fall outside a predicted ORF. To ensure the circulation was correct PacBio reads longer than 4000 bp were aligned to the circularized assembly with blasr  and manually checked with IGV [28, 29]. For additional error correction, an Illumina PCR-free DNA library with a DNA insert size of 416 bp was prepared at the Institute of Genomics, Biocomputing and Biotechnology (Mississippi State, MS, USA). The Illumina library was paired-end sequenced (2 × 300 bp) using the Illumina MiSeq. The short read pairs were trimmed with Trimmomatic  and subsequently used to error correct the Canu assembly with Pilon . After Pilon error correction, the resultant assembly was polished with 20 kb PacBio reads using the Quiver algorithm within the PacBio SMRT Analysis software suite (version 22.214.171.124936, Pacific Biosciences of California). The Minimum Information about a Genome Sequence specification was used to report the MSCT1 genome sequencing and assembly methods (Table 2) .
Proteins and noncoding RNAs (including rRNA, tRNA, ncRNA) were predicted with the NCBI Prokaryote Genome Annotation Pipeline . Clusters of Orthologous Groups annotation of the predicted proteins against the COG position-specific scoring matrices downloaded from the NCBI Conserved Domain Database was conducted with RSP-BLAST [34,35,36]. InterProScan V51.0 was used to add Pfam annotations using the Pfam applet . Signal peptides and transmembrane helices were predicted with SignalP  and TMHMM , respectively. Clustered regularly-interspaced short palindromic repeats sequences were identified using CRISPRFinder . Plant inducible promoter sequences in the promoter region (both strands) of genes were identified with the regular expression ‘TTCGN  TTCG’, where N is any nucleotide, as described by Lee et al. 2005 [41,42,43]. EffectiveDB was used to determine if MSCT1 contains functional T3SS, T4SS, and T6SS secretory systems. EffectiveDB also identified eukaryotic-like domains, potential T3SS, and potential T4SS secreted proteins in the MSCT1 predicted proteome. Additionally, blastp was used to align the proteins of the MSCT1 predicted proteome to the 502 proteins representing 53 effector families of Xanthomonas species found in the Xanthomonas effector database (Xanthomonas.org) . Transcription activator-like effectors and Repeat Variable Diresidues were predicted with AnnoTALE . TALgetter  was used to identify the DNA target domain on the G. hirsutum line TM − 1 promoterome .
The MSCT1 long read assembly had a sum length of 5,123,946 bp distributed along one large circular chromosome 5 Mb (Fig. 4) in length and 3 circular plasmids (60, 44, and 15 kb in length) (Table 3). Sequencing depth was 558.48 genome equivalents for the long read sequencing technology and 1820.26 genome equivalents for the Illumina PCR-free DNA library (Table 2). Dot plots determined the MSCT1 chromosome exhibited a high degree of sequence similarity to the circular chromosomes reported in previous X. citri pv. malvacearum assemblies (Fig. 5). A total of 4410 genes were predicted for MSCT1 including 4102 protein coding, 95 rRNA, and 213 pseudogenes (Table 4). The NCBI Prokaryotic Genome Annotation Pipeline added functional annotation to 2843 proteins.
The predominate COG functional classifications were R (general function), E (amino acid transport and metabolism), M (cell wall/membrane biogenesis), and H (coenzyme transport and metabolism), representing 16.31, 11.68, 10.36, and 9.68% of the predicted proteome, respectively (Table 5). InterProScan identified 3302 proteins containing at least one Pfam domain. In total, 3375 proteins contained at least one functional annotation from either the Pfam or COG annotations (Table 4). The rRNA segments were comprised of two copies of each of the 23S, 5S, and 16S rRNA subunits. At least one tRNA for each of the 20 basic amino acids was identified in the 54 predicted tRNA loci. Transmembrane helices prediction identified 911 proteins with at least one predicted transmembrane helix. Signal peptides were identified on 553 proteins; of these, after in silico cleavage of the predicted signal peptide, 23 contained a predicted transmembrane helix leaving 530 proteins that can be secreted from the cell. A single CRISPR sequence with a sequence length of 298 bp was predicted in the genome assembly in the 27,394 to 27,692 bp region of the MSCT1 chromosome. As is common in species of Xanthomonas multiple copies of the transposase coding genes were identified dispersed throughout the genome . In total 26 transpose genes were predicted in MSCT1, making it the fourth most abundant functional annotation in the proteome (Table 6).
Insights from the genome sequence
Functional T3SS, T4SS, and T6SS secretory systems were predicted in MSCT1. Comparison of the MSCT1 predicted proteins with previously described Xanthomonas effectors resulted in the identification of 7 families of effectors common among species of Xanthomonas (Table 7). These classes include AvrBs2, XopAG, XopK, XopP, XopR, XopT, and XopZ1. Effectors AvrBs2, XopK, XopP, XopR, and XopZ1, have been shown to suppress the host disease resistance response and immunity in other plant- Xanthomonas interactions [48,49,50,51,52,53,54]. XopAG effector family members have been shown to be responsible for eliciting the hyper-sensitive response in grapefruit . The predicted protein sequence WP_033481547.1, predicted from the MSCT1 genome, exhibited homology to AvrBs2 effector proteins from several species of Xanthomonas and contained a predicted glycerophosphoryl diester phosphodiesterase family (PF03009) domain characteristic of the AvrBs2 effector family . AvrBs2 produced by Xanthomonas campestris pv. vesicatoria is recognized by a NBS-LRR in peppers containing the Bs2 resistance gene; however, field strains of X. campestris pv. vesicatoria have been identified that evade the recognition [56, 57].
EffectiveDB predicted 408 T3SS and 44 T4SS secreted proteins. MSCT1 predicted secreted proteins that have previously been associated with diseases in G. hirsutum and other plant systems include; endoglucanase , polygalacturonase , glutathione S-transferase , pectate lyase , glutathione peroxidase , as well as catabolic enzymes such as peptidases and lipases. These protein likely aid the mediation of the host disease response as well as the breaking down of host tissues. The PIP-box sequence was identified 78 bp up stream of the start codon for the HrpB1 gene, that indicates gene regulation via PIP targeted transcription factors are present in the MSCT1 genome. EffectiveDB also identified 22 eukaryotic-like domains among 36 MSCT1 proteins. The most represented eukaryotic-like domains were the of M13 peptidase family (PF01431 and PF05649); however, M13 peptidases are commonly identified among bacteria .
AnnoTAL identified 8 potential CDS regions in the MSCT1 genome that could potentially code for TAL effectors (Table 8). AnnoTAL did not predict any TAL sequences in the other four draft X. citri pv. malvacearum genomes reported previously (GCF_000454505.1 (strain: X18), GCF_000454525.1 (strain: X20), GCF_000309925.1 (strain: GSPB22388) and GCF_000309915.1 (strain: GSPB1386)). Interestingly, 7 of the 8 TAL effectors in X. citri pv. malvacearum MSCT1 are located on plasmids. This arrangement is in contrast to other xanthomonads such as Xanthomonas oryzae pv. oryzae and Xanthomonas oryzae pv. oryzicola where the vast majority of TAL effectors are located on the large chromosome. The presence of the X. citri pv. malvacearum TAL effectors in X. citri pv. malvacearum plasmids can be traced back to the initial report by De Feyter et al. 1991 , that described 6 avirulence genes on a 90 kb X. citri pv. malvacearum plasmid [20,21,22]. However, the X. citri species and X. oryzae species such as X. oryzae pv. oryzae and X. oryzae pv. oryzicola exhibit evolutionarily divergence and fall into different clades among the other sequenced xanthomonads in phylogenic analysis . Although, the overall total number of TAL effectors found in MSCT1 (n = 8) is less than what has been previously reported for some X. oryzae pv. oryzae and X. oryzae pv. oryzicola strains it is similar to strains of X. translucens [43, 47, 66].
The variable dinucleotide repeats were identified in the 8 MSCT1 TAL sequences for recognition of the TAL DNA target domain with the previously reported TAL code (Table 9). Due to the inherit degeneracy nature of TAL DTD prediction [12, 45, 67,68,69,70], potential TAL DTDs reported in this study are limited to the top 2 DTD site predictions for each TAL with the additional constraint of being within 150 bp of the gene start codon. Interestingly, MSCT1 TALs (MSCT1-TAL2 and MSCT1-TAL8) with a DTD prediction had predictions on corresponding sections of the A and D sub-genomes of the G. hirsutum TM − 1 assembly . However, these in silico predictions still need to be confirmed with RNA expression data from studies of G. hirsutum undergoing infection by MSCT1. Of note, no MSCT1 TAL DTD was predicted to target any promoter region on G. hirsutum chromosome 14 or 20 that contain the B 2, B 3 and B 12 genes that are a major source of resistance to X. citri pv. malvacearum [71,72,73].
Of the predicted TALs only two, MSCT1-TAL2 and MSCT1-TAL8, had target sequences that fall within 100 bp of the start codon. MSCT1-TAL2 was predicted to target 21 bp from the start codon of the two paralogous proteins (Gh_A04G1143, Gh_D04G1757) found on chromosome 4 of each of the respective sub-genomes of tetraploid cotton. The proteins that MSCT1-TAL2 potential targets contain the ProSiteProfiles NAC domain profile (PS51005). The NAC domain has been reported to participate in both biotic and abiotic stress related responses . MSCT1-TAL8 targeted 19 and 20 base pairs upstream of the paralogous proteins (Gh_A01G1702, Gh_D01G1952) in the A and D sub-genomes of G. hirsutum, respectively.
The MSCT1 genome reported in this study is the first X. citri pv. malvacearum genome to be completed with long read DNA sequencing technology. The long read sequencing and assembly strategy allowed for the identification of eight TAL effectors in X. citri pv. malvacearum and makes the MSCT1 genome assembly the only X. citri pv. malvacearum genome with assembled TAL effectors. In addition to the TAL effector identification, many T3SS effectors were identified in MSCT1 genome. The genome assembly, as outlined in this paper, provides a basis for future epidemiological and pathogenesis studies of the X. citri pv. malvacearum-G. hirsutum pathogen host complex.
DNA target domain
Highly virulent strain
Multilocus sequence typing
Plant inducible promoter
Hillocks R. Bacterial blight. In: Hillocks R, editor. Cotton diseases. Wallingford: CAB International; 1992. p. 39–85.
Morgham AT, Richardson PE, Essenberg M, Cover EC. Effects of continuous dark upon ultrastructure, bacterial populations and accumulation of phytoalexins during interactions between Xanthomonas campestris pv. malvacearum and bacterial blight susceptible and resistant cotton. Physiol Mol Plant Pathol. 1988;32(1):141–62.
Al Mousawi A, Richardson P, Essenberg M, Johnson W. Ultrastructural studies of a compatible interaction between Xanthomonas campestris pv. malvacearum and cotton [Gossypium hirsutum, bacterial blight]. Phytopathology. 1982;72:1222-30.
Rudolph K. Infection of the plant by Xanthomonas. In: Swings JG, Civerolo EL, editors. Xanthomonas. London: Chapman & Hall; 1993. p. 193–264.
Knight R, Clouston T. The genetics of blackarm resistance. J Genet. 1939;38(1):133–59.
Bourland F, Myers GO. Conventional cotton breeding. In: Cotton. Madison: American Society of Agronomy, Inc., Crop Science Society of America, Inc., and Soil Science Society of America, Inc.; 2015.
Bourland FM, Jones DC. Registration of ‘UA103’ cotton cultivar. J Plant Registrations. 2013;7(2):135–9.
Bourland FM, Jones DC. Registration of Arkot 0305, Arkot 0306, Arkot 0309, and Arkot 0316 germplasm lines of cotton. J Plant Registrations. 2015;9(1):94–8.
Delannoy E, Lyon BR, Marmey P, Jalloul A, Daniel JF, Montillet JL, Essenberg M, Nicole M. Resistance of cotton towards Xanthomonas campestris pv. malvacearum. Annu Rev Phytopathol. 2005;43:63–82.
Buttner D, Bonas U. Regulation and secretion of Xanthomonas virulence factors. FEMS Microbiol Rev. 2010;34(2):107–33.
Zhang J, Yin Z, White F. TAL effectors and the executor R genes. Front Plant Sci. 2015;6:641.
Boch J, Bonas U, Lahaye T. TAL effectors – pathogen strategies and plant resistance engineering. New Phytol. 2014;204(4):823–32.
Mak AN-S, Bradley P, Bogdanove AJ, Stoddard BL. TAL effectors: function, structure, engineering and applications. Curr Opin Struct Biol. 2013;23(1):93–9.
Cunnac S, Bolot S, Forero Serna N, Ortiz E, Szurek B, Noel LD, Arlat M, Jacques MA, Gagnevin L, Carrere S, et al. High-quality draft genome sequences of two Xanthomonas citri pv. malvacearum strains. Genome Announc. 2013;1(4):1–2.
Zhai J, Xia Z, Liu W, Jiang X, Huang X. Genomic sequencing globally identifies functional genes and potential virulence-related effectors of Xanthomonas axonopodis pv. malvacearum. Eur J Plant Pathol. 2013;136(4):657–63.
National Cotton Council Disease Database. National Cotton Council of America. 2013. http://www.cotton.org/tech/pest/upload/Disease-Database-2013-2.xls. Accessed 25 Jan 2017.
Ah-You N, Gagnevin L, Grimont PA, Brisse S, Nesme X, Chiroleu F, Bui Thi Ngoc L, Jouen E, Lefeuvre P, Verniere C, et al. Polyphasic characterization of xanthomonads pathogenic to members of the Anacardiaceae and their relatedness to species of Xanthomonas. Int J Syst Evol Microbiol. 2009;59(Pt 2):306–18.
Gross DC, DeVay JE. Production and purification of syringomycin, a phytotoxin produced by Pseudomonas syringae. Physiol Plant Pathol. 1977;11(1):13–28.
Xu J, Deng P, Showmaker KC, Wang H, Baird SM, Lu S-E. The pqqC gene is essential for antifungal activity of Pseudomonas kilonensis JX22 against Fusarium oxysporum f. Sp. lycopersici. FEMS Microbiol Lett. 2014;353(2):98–105.
Yang Y, De Feyter R, Gabriel DW. Host-specific symptoms and increased release of Xanthomonas citri and X. campestris pv. malvacearum from leaves are determined by the 102-bp tandem repeats of pthA and avrb6, respectively. Mol Plant Microbe Interact. 1994;7(3):345–55.
Yang Y, Gabriel DW. Xanthomonas avirulence/pathogenicity gene family encodes functional plant nuclear targeting signals. Mol Plant Microbe Interact. 1995;8(4):627–31.
Boch J, Bonas U. Xanthomonas AvrBs3 family-type III effectors: discovery and function. Annu Rev Phytopathol. 2010;48(1):419–36.
Chen WP, Kuo TT. A simple and rapid method for the preparation of gram-negative bacterial genomic DNA. Nucleic Acids Res. 1993;21(9):2260.
Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–36.
Rice P, Longden I, Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. 2000;16(6):276–7.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10.
Chaisson MJ, Tesler G. Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics. 2012;13(1):238.
Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative genomics viewer. Nat Biotech. 2011;29(1):24–6.
Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14(2):178–92.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9(11):e112963.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016;44(14):6614–24.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.
Galperin MY, Makarova KS, Wolf YI, Koonin EV. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res. 2015;43(Database issue):D261–9.
Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, Geer RC, He J, Gwadz M, Hurwitz DI, et al. CDD: NCBI’s conserved domain database. Nucleic Acids Res. 2015;43(Database issue):D222–6.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. InterProScan: protein domains identifier. Nucleic Acids Res. 2005;33(Web Server issue):W116–20.
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8(10):785–6.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305(3):567–80.
Grissa I, Vergnaud G, Pourcel C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 2007;35(Web Server issue):W52–7.
Fenselau S, Bonas U. Sequence and expression analysis of the hrpB pathogenicity operon of Xanthomonas campestris pv. vesicatoria which encodes eight proteins with similarity to components of the Hrp, Ysc, spa, and Fli secretion systems. Mol Plant Microbe Interact. 1995;8(6):845–54.
Lee B-M, Park Y-J, Park D-S, Kang H-W, Kim J-G, Song E-S, Park I-C, Yoon U-H, Hahn J-H, Koo B-S, et al. The genome sequence of Xanthomonas oryzae pathovar oryzae KACC10331, the bacterial blight pathogen of rice. Nucleic Acids Res. 2005;33(2):577–86.
Peng Z, Hu Y, Xie J, Potnis N, Akhunova A, Jones J, Liu Z, White FF, Liu S. Long read and single molecule DNA sequencing simplifies genome assembly and TAL effector gene analysis of Xanthomonas translucens. BMC Genomics. 2016;17(1):1–19.
Grau J, Reschke M, Erkes A, Streubel J, Morgan RD, Wilson GG, Koebnik R, Boch J. AnnoTALE: bioinformatics tools for identification, annotation, and nomenclature of TALEs from Xanthomonas genomic sequences. Sci Rep. 2016;6:21077.
Grau J, Wolf A, Reschke M, Bonas U, Posch S, Boch J. Computational predictions provide insights into the biology of TAL effector target sites. PLoS Comput Biol. 2013;9(3):e1002962.
Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, Zhang J, Saski CA, Scheffler BE, Stelly DM, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM−1) provides a resource for fiber improvement. Nat Biotech. 2015;33(5):531–7.
Booher NJ, Carpenter SCD, Sebra RP, Wang L, Salzberg SL, Leach JE, Bogdanove AJ. Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships. Microbial Genomics. 2015;1(4):1–22.
Akimoto-Tomiyama C, Furutani A, Tsuge S, Washington EJ, Nishizawa Y, Minami E, Ochiai H. XopR, a type III effector secreted by Xanthomonas oryzae pv. oryzae, suppresses microbe-associated molecular pattern-triggered immunity in Arabidopsis thaliana. Mol Plant-Microbe Interact. 2011;25(4):505–14.
Ishikawa K, Yamaguchi K, Sakamoto K, Yoshimura S, Inoue K, Tsuge S, Kojima C, Kawasaki T. Bacterial effector modulation of host E3 ligase activity suppresses PAMP-triggered immunity in rice. Nat Commun. 2014;5:1–11.
Li S, Wang Y, Wang S, Fang A, Wang J, Liu L, Zhang K, Mao Y, Sun W. The type III effector AvrBs2 in Xanthomonas oryzae pv. Oryzicola suppresses rice immunity and promotes disease development. Mol Plant-Microbe Interact. 2015;28(8):869–80.
Sinha D, Gupta MK, Patel HK, Ranjan A, Sonti RV. Cell wall degrading enzyme induced rice innate immune responses aresuppressed by the type 3 secretion system effectors XopN, XopQ, XopX and XopZ of Xanthomonas oryzae pv. oryzae. PLoS One. 2013;8(9):e75867.
Song C, Yang B. Mutagenesis of 18 type III effectors reveals virulence function of XopZPXO99 in Xanthomonas oryzae pv. oryzae. Mol Plant-Microbe Interact. 2010;23(7):893–902.
Wang S, Sun J, Fan F, Tan Z, Zou Y, Lu D. A Xanthomonas oryzae pv. oryzae effector, XopR, associates with receptor-like cytoplasmic kinases and suppresses PAMP-triggered stomatal closure. Sci China Life Sci. 2016;59(9):897–905.
Zhao S, Mo W-L, Wu F, Tang W, Tang J-L, Szurek B, Verdier V, Koebnik R, Feng J-X. Identification of non-TAL effectors in Xanthomonas oryzae pv. oryzae Chinese strain 13751 and analysis of their role in the bacterial virulence. World J Microbiol Biotechnol. 2013;29(4):733–44.
Escalon A, Javegny S, Vernière C, Noël LD, Vital K, Poussier S, Hajri A, Boureau T, Pruvost O, Arlat M, et al. Variations in type III effector repertoires, pathological phenotypes and host range of Xanthomonas citri pv. citri pathotypes. Mol Plant Pathol. 2013;14(5):483–96.
Gassmann W, Dahlbeck D, Chesnokova O, Minsavage GV, Jones JB, Staskawicz BJ. Molecular evolution of virulence in natural field strains of Xanthomonas campestris pv. vesicatoria. J Bacteriol. 2000;182(24):7053–9.
Tai TH, Dahlbeck D, Clark ET, Gajiwala P, Pasion R, Whalen MC, Stall RE, Staskawicz BJ. Expression of the Bs2 pepper gene confers resistance to bacterial spot disease in tomato. Proc Natl Acad Sci U S A. 1999;96(24):14153–8.
Wubben MJ, Ganji S, Callahan FE. Identification and molecular characterization of a β−1,4-endoglucanase gene (Rr-eng-1) from Rotylenchulus reniformis. J Nematol. 2010;42(4):342–51.
Showmaker KC, Bednárová A, Gresham C, Hsu C-Y, Peterson DG, Krishnan N. Insight into the salivary gland transcriptome of Lygus lineolaris (Palisot de Beauvois). PLoS One. 2016;11(1):e0147197.
Espada M, Jones JT, Mota M. Characterization of glutathione S-transferases from the pine wood nematode. Nematology. 2016;18(6):697–709.
Danchin EGJ, Rosso M-N, Vieira P, de Almeida-Engler J, Coutinho PM, Henrissat B, Abad P. Multiple lateral gene transfers and duplications have promoted plant parasitism ability in nematodes. Proc Natl Acad Sci. 2010;107(41):17651–6.
Jones JT, Reavy B, Smant G, Prior AE. Glutathione peroxidases of the potato cyst nematode Globodera Rostochiensis. Gene. 2004;324:47–54.
Bianchetti L, Oudet C, Poch O. M13 endopeptidases: new conserved motifs correlated with structure, and simultaneous phylogenetic occurrence of PHEX and the bony fish. Proteins. 2002;47(4):481–8.
De Feyter R, Gabriel DW. At least six avirulence genes are clustered on a 90-kilobase plasmid in Xanthomonas campestris pv. malvacearum. Mol Plant-Microbe Interact. 1991;4(5):423–32.
Rodriguez-R LM, Grajales A, Arrieta-Ortiz ML, Salazar C, Restrepo S, Bernal A. Genomes-based phylogeny of the genus Xanthomonas. BMC Microbiol. 2012;12(1):43.
Hersemann L, Wibberg D, Widmer F, Vorhölter F-J, Kölliker R. Draft genome sequences of three Xanthomonas translucens pathovar reference strains (pv. arrhenatheri, pv. poae and pv. phlei) with different specificities for forage grasses. Stand Genomic Sci. 2016;11(1):50.
Cernadas RA, Doyle EL, Niño-Liu DO, Wilkins KE, Bancroft T, Wang L, Schmidt CL, Caldo R, Yang B, White FF, et al. Code-assisted discovery of TAL effector targets in bacterial leaf streak of rice reveals contrast with bacterial blight and a novel susceptibility gene. PLoS Pathog. 2014;10(2):e1003972.
Moscou MJ, Bogdanove AJ. A simple cipher governs DNA recognition by TAL effectors. Science (New York, NY). 2009;326(5959):1501.
Boch J, Scholze H, Schornack S, Landgraf A, Hahn S, Kay S, Lahaye T, Nickstadt A, Bonas U. Breaking the code of DNA binding specificity of TAL-type III effectors. Science (New York, NY). 326(5959):2009, 1509–1512.
Booher NJ, Bogdanove AJ. Tools for TAL effector design and target prediction. Methods. 2014;69(2):121–7.
Silva RA, Barroso PAV, Hoffmann LV, Giband M, Coutinho WM. A SSR marker linked to the B 12 gene that confers resistance to race 18 of Xanthomonas axonopodis pv. malvacearum in cotton is also associated with other bacterial blight resistance gene complexes. Australas Plant Pathol. 2014;43(1):89–91.
Wallace TP, El-Zik KM. Inheritance of resistance in three cotton cultivars to the HV1 isolate of bacterial blight. Crop Sci. 1989;29(5):1114–9.
Wright RJ, Thaxton PM, El-Zik KM, Paterson AH. D-subgenome bias of Xcm resistance genes in tetraploid Gossypium (cotton) suggests that polyploid formation has created novel avenues for evolution. Genetics. 1998;149(4):1987.
Nuruzzaman M, Sharoni AM, Kikuchi S. Roles of NAC transcription factors in the regulation of biotic and abiotic stress responses in plants. Front Microbiol. 2013;4:248.
Field D, Amaral-Zettler L, Cochrane G, Cole JR, Dawyndt P, Garrity GM, Gilbert J, Glöckner FO, Hirschman L, Karsch-Mizrachi I, et al. The genomic standards consortium. PLoS Biol. 2011;9(6):e1001088.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87(12):4576–9.
Garrity GM, Bell JA, Lilburn T. Phylum XIV. Proteobacteria phyl. Nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology volume 2, part B. New York: Springer; 2005.
Garrity GM, Bell JA, Class LT. Gammaproteobacteria class. Nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT, editors. Bergey’s manual of systematic bacteriology volume 2, part B. New York: Springer; 2005.
Euzéby J. Validation of publication of new names and new combinations previously effectively published outside the IJSEM. List no. 106. Int J Syst Evol Microbiol. 2005;55:2235–8.
Dowson D. On the systematic position and generic names of the gram negative bacterial plant pathogens. Zentralblatt fur Bakteriologie, Parasitenkunde und Infektionskrankheiten, 2. 1939;100:177–93.
Hayward A, Waterson C. Xanthomonas malvacearum. UK: Commonwealth Mycological Institute Kew; 1964.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000;25(1):25–9.
Furutani A, Takaoka M, Sanada H, Noguchi Y, Oku T, Tsuno K, Ochiai H, Tsuge S. Identification of novel type III secretion effectors in Xanthomonas oryzae pv. oryzae. Mol Plant-Microbe Interact. 2008;22(1):96–106.
Mutka AM, Fentress SJ, Sher JW, Berry JC, Pretz C, Nusinow DA, Bart R. Quantitative, image-based phenotyping methods provide insight into spatial and temporal dimensions of plant disease. Plant Physiol. 2016;172(2):650–60.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993;10(3):512–26.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.
Grant JR, Stothard P. The CGView server: a comparative genomics tool for circular genomes. Nucleic Acids Res. 2008;36(Web Server issue):W181–4.
Noé L, Kucherov G. YASS: enhancing the sensitivity of DNA similarity search. Nucleic Acids Res. 2005;33(suppl 2):W540–3.
We would like acknowledge Amanda Lawrence from the Mississippi State University Institute for Imaging & Analytical Technologies for assistance in the generation of the TEM imaging of MSCT1.
This work was funded by Cotton Incorporated 13-479 (to SL and DGP); USDA ARS 58-6402 − 1-644 (to DGP), and USDA ARS 58-6066-6-046 (to DGP).
There are no significant competing financial, professional or personal interests that might have influenced the performance or presentation of the research reported in this manuscript.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Showmaker, K.C., Arick, M.A., Hsu, CY. et al. The genome of the cotton bacterial blight pathogen Xanthomonas citri pv. malvacearum strain MSCT1. Stand in Genomic Sci 12, 42 (2017). https://doi.org/10.1186/s40793-017-0253-3
- Xanthomonas citri pv. malvacearum
- Bacterial blight
- TAL effectors
- Long read DNA sequencing