- Extended genome report
- Open Access
The draft genome of Brucella abortus strain Ba col-B012, isolated from a dairy farm in Nariño, Colombia, bring new insights into the epidemiology of biovar 4 strains
Standards in Genomic Sciences volume 12, Article number: 89 (2017)
Brucellosis is a commonly diagnosed zoonosis that causes infertility and abortion in cattle, it is acquired from handling of infected animals or consuming contaminated milk or milk products. In Colombia, it belongs to the official notifiable disease list, despite its relevance little is known about the origin, epidemiology and the genetic constituents of the strains circulating in dairy farms. Here we present the draft genome of B. abortus Ba Col-B012, an isolate obtained from a female Holstein belonging to a dairy farm in Nariño, Colombia. This genome comprises 3,234,714 bp and 3018 predicted protein-encoding genes. Using comparative genomics and phylogenetic analysis, we found that the strain Ba Col-B012 clustered with known biovar 4 variants. The analysis of the core genes allowed the identification of polymorphisms only present in biovar 4 genomes, these regions are proposed as possible targets for identification by PCR. The sequencing of B. abortus Ba Col-B012 genome provides important insights to improve the diagnosis and the epidemiology of this disease and represents the first report of the biovar 4 in Colombia.
The brucellosis is one of the most important zoonotic diseases that causes infertility and abortion in cattle. In livestock, brucellosis is mainly caused by Brucella abortus , a Gram-negative coccobacillus that behaves as a facultative intracellular pathogen. There are up to eight variants of this species that differ on their physiological characteristics and are classified as biovars. However, some of these biovars differ only slightly and their status as true variants is unresolved. Some biovars have a wide geographic distribution; B. abortus biovar1 and biovar2 are found around the world, while others as the biovar 5 are mainly distributed in Europe . In South America, recent studies have identified several biovars, for instance, a survey of a 30-year B. abortus collection from Brazil, found biovars 1, 2, and 3 , while in Ecuador, biovar 1 and 4 have been reported . However, there still is a lack of sufficient information to establish biovar presence and distribution in other countries of the continent. In Colombia, even though there are regions with high prevalence and isolation of B. abortus [4, 5], there are no reports on the identification of their corresponding biovars.
The genome presented here belongs to a larger collection of pathogens isolated as part of a monitoring program to identify the principal infectious agents related to infertility and abortion in cattle present in the southern part of Colombia . During this survey, 12 B. abortus strains were isolated from dairy farms (Nariño, Colombia). Recently some of these strains were typified using AMOS-ERY-PCR  and MLVA methods , and a representative isolate was chosen for sequencing. Here we present the draft quality genome of the strain, B. abortus Ba Col-B012, this genome contributes to a better understanding of the genomic constituents of local isolates and to the identification of virulence factors and conserved genes that code for immunogenic proteins that can eventually be used in the development of vaccines and new serological tests.
Classification and features
Brucella abortus is a non-motil, Gram-negative short bacillus measuring about 0.6 to 1.5 μm by 0.5–0.7 μm (Fig. 1). The B. abortus species belong to the family Brucellaceae , class Alphaproteobacteria and phylum Proteobacteria . Colonies are smooth, small, round, convex, and non-pigmented, on Brucella agar small colorless punctate colonies, appear within 48 to 72 h at 37 °C. Even though they are aerobes, providing a CO2 atmosphere may enhance growth.
The Brucella abortus Ba Col-B012 strain was obtained from a female Holstein with an episode of abortion. The sample was taken from vaginal fluids with a swab and isolation was done on trypticase soy agar and brain infusion agar supplemented with 5% Horse serum, this media was incubated at 37 °C for 72 to 96 h, with a 5% CO2 atmosphere. Small transparent colonies were obtained with regular edges. Isolates were characterized by being non-motile and positive for the urease and oxidase tests and for the agglutination test by using polyclonal anti- Brucella abortus antibody (Difco). A summary of the classification and general features of B. abortus strain Ba Col-B012 is presented in Table 1.
Genome sequencing and information
Genome project history
B. abortus strain Ba Col-B012 was isolated as part of a monitoring program to identify the principal infectious agents related to infertility and abortion in cattle present in the southern part of Colombia . The main objective for sequencing B. abortus genomes is to explore the genomic constituents of the local isolates and to identify virulence factors, polymorphic regions, and immunogenic proteins that can be eventually be used in the development of vaccines and new serological and molecular tests. A summary of the project information is shown in Table 2.
Growth conditions and genomic DNA preparation
Brucella abortus strain Ba Col-B012 strain was grown on trypticase soy agar and brain infusion agar supplemented with 5% horse serum, this media was incubated at 37 °C for 72 h. Genomic DNA extraction was done with the CTBA-Phenol Chloroform method couple to ethanol precipitation . DNA was quantified using the dsDNA HS (High Sensitivity) kit on a Qubit™ (Life Technologies), a greater than 30 ng/μl DNA concentration was obtained. Quality and purity of DNA was determined by spectrophotometry (Nanodrop® 2000 Thermo Fisher Scientific) obtaining a 260/280 and 260/230 ratio equal to 2.
Genome sequencing and assembly
Whole-genome sequencing of the B. abortus strain Ba Col-B012 strain was performed by employing the Illumina HiScan SQ (Molecular Biology Lab, Corpoica). Libraries were generated using the Sure Select Strand Agilent Sample Preparation, once the DNA concentration was determined library amplification was done with the TruSeq PE Cluster Kit v3, (Illumina), using Cbot (Illumina). For de novo assembly, we used 3,956,238 paired-end Illumina reads (150 bp) and the Newbler v 2.0.01.14 software. The assembly resulted in 233 contigs with total genome length of 3227,565 bp and with 50× average coverage.
Gene prediction was conducted with GeneMarkS+ , and PRODIGAL  and annotation was done automatically using the NCBI Prokaryotic Genome Annotation Pipeline. The annotation was corrected manually using the data from different databases (Swiss-Prot  and RAST ). We use LipoP v 1.0  for finding genes with signal peptides and with transmembrane helices.
The genome statistics are provided in Table 3. The assembly resulted in 233 contigs with total genome length of 3227,565 bp and with 50× average coverage. The N50 contig size is 22,624 and a maximum contig size of 106,301 bp and a G + C content of 57.28 mol%. These values are similar to those reported for the genomes NC_006932.1, NZ_CP007709.1 and NZ_CP007705.1 of B. abortus at NCBI. Using our annotation pipeline, it was possible to identify 3227 predicted genes of which 3018 were putatively protein-encoding, 166 pseudogenes, 42 tRNAs and 1 ncRNA. For the majority of the protein-encoding genes (78.12%) a function could be assigned. The distribution of these genes into COG functional categories  is shown in Table 4. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession LODQ00000000. The version described in this paper is version LODQ01000000.
Insights from genome sequences
Genomes used in this study
A total of 28 B. abortus genomes were downloaded from the NCBI database of complete and draft bacterial genomes, even though there are many more genomes in the database, only those with identified biovar were used for further analyses. The genomes and their GeneBank accession numbers are listed in Table 5. The genes used in the analysis were predicted from the genomes using PRODIGAL with the default settings .
Genomic differences between B. abortus BA col-B012 and the type strain B. abortus 2308
The comparative genomic analysis between B. abortus strain BA Col-B012 and the type strain, B. abortus 2308, shows that both genomes shared 3015 genes, most of these genes are identical (2862 genes with 100%). Within this set of genes there are around 12 genes that are divergent with a nucleotide identity ranging from 77 to 94%, (Additional file 1: Table S1) among the genes are an ABC transporter permease, benzoate transporter, an alpha/beta hydrolase, a 5-hydroxymethyluracil DNA glycosylase, several hypothetical proteins and a hemolysin D gene (HlyD). Hemolysin D is part of the membrane transporter of the HlyA, a pore-forming toxin that affects the membrane of the host . We also identified 16 genes present in strain BA Col-B012 that were not found in the type strain (Additional file 1: Table S2). Most genes in this set are hypothetical proteins, transporters and transcriptional regulators. These differences show that strain BA Col-B012 differs from the type strain 2308. In order to elucidate if these differences are related to the biovar classification a comparative genomic analysis with more strain was done in the next section.
The evolutionary distance and phylogenetic relationship of B. abortus strain Ba col-B012
A phylogenomic approach was done to establish the evolutionary relationship of B. abortus strain Ba Col-B012 and to evaluate whether biovars are congruent with true genetic groupings. The phylogenetic analysis was done by concatenating the alignment of orthologues genes shared by all strains. In order to identify a set of orthologous genes, an in-house PERL script that incorporates the reciprocal best match approach was used . In brief, the predicted genes of strain Ba Col-B012 were searched using the blastn algorithm  against the genomic sequences of each of the remaining genomes. The best match for each query gene (genes with higher than 70% identity and alignment coverage) was extracted and searched against the complete gene complement of the Ba Col-B012 strain to identify reciprocal best matches. The reciprocal best match genes were denoted as orthologues, 3139 orthologous genes were shared among all strains, from these 2169 were identical among all strains (100% nucleotide identity). Average nucleotide identity (ANI) was quantified using the nucleotide identity of orthologues between the strain Ba Col-B012 and the other genomes, this is a measurement of genomic divergence that is used in modern taxonomy as the gold standard to delimitate new species [19, 20]. The ANI values between the Ba Col-B012 and the rest of the strains were higher than 99.6 % (Additional file 1), these high identity reflect the close evolutionary relationship between the B. abortus strains that make difficult the identification of biovar variants. Despite the close relationship between all genomes, strain Ba Col-B012 showed a closest affiliation with biovar 4 strains (99.88%).
In order to corroborate the affiliation of Ba Col-B012 to biovar 4, the phylogenetic relationship of shared polymorphic genes, around 2961 genes, was inferred using the Neighbor Joining algorithm with the Jukes-Cantor distance and 1000 bootstraps (Fig. 2). As shown before by the ANI analyses, strain Ba Col-B012 was more closely related to the biovar 4 strains clustering in the same clade with a 100 bootstrap value. This represents the first confirmed report of a biovar 4 strain in Colombia, and may suggest a possible transfer from Ecuador which is the country that delimits with the Nariño region and where biovar 4 has been reported .
Used of polymorphic regions in the identification of B. abortus Biovar 4 and its potential for diagnosis and vaccination
Current identification of biovars is based on standard microbiological methods and molecular approaches like MLVA analysis. MLVA is particularly a high discriminatory method useful in epidemiological studies and in the identification of genetic variability of strains . However, this methodology is not always conclusive. In order to complement the current methods of diagnosis with PCR-based amplification and sequencing, orthologous regions that could be used to differentiate biovar 4 genomes from others were identified. We found around 42 genes with polymorphism that differentiate biovar 4 genomes from the rest. Most genes have only one single nucleotide polymorphism (SNP), from this set almost half of the SNPs are non-synonymous. From all evaluated genes, only one hypothetical gene has two polymorphisms that are synonymous (set 12). We also found two genes that have insertion-deletions and three genes that are shorter than the biovar 1 counterpart due to the presence of an early stop codon (See Table 6 for a description of genes and differences). All gene set described in the analysis are provided in the Additional files section.
In order to design primers for genetic markers for biovar 4, we focused on orthologues amplifiable by PCR (<400 bp) that have large INDELs or genes with synonymous polymorphisms, this guarantees that the observed changes are not under selection. We identified six genes that met this criteria, these were: hypothetical protein similar with BA14K family domain (gene set 8), hypothetical protein (gene set 12), DNA-3-methyladenine glycosylase (gene set 13), tyrosine--tRNA ligase (gene set 19), glutamine synthetase (gene set 30), and ABC transporter permease (gene set 40). Based on these genes, we designed sets of primers that amplify the polymorphic regions and therefore can be used for the identification. Table 7 summarizes the designed primers and their predicted PCR conditions.
Comparative genome analysis of B. abortus strains is a powerful tool for the identification of allele variants/polymorphism that modulate virulence. Interestingly, among the identified polymorphic genes, two genes have been associated with pathogenicity and immune response, a hypothetical protein similar with BA14K family domain (Table 6, gene set 8) and a gene coding for the subunit B of the exonuclease ABC (Table 6, gene set 7). The domain BAL14K had been demonstrated to induce a strong immunoreactivity in mice, with a Th1 response and induction of IL-12 secretion . While changes in the subunit B of exonuclease ABC have been associated with minor virulence changes between attenuated and virulent Brucella strains . It is also worth mentioning that several other sets of genes identified as polymorphic might also display immunogenic reactivity, as their coding proteins are located in the membrane at the interphase with the environment, for instance, several transporters in B. abortus have been used to produce in vivo-induced antigens . These genes are potential targets for future vaccination and diagnosis.
The genome of B. abortus Ba Col-B012 contributes to the better understanding of the distribution and origin of zoonotic pathogens in Colombia and South America. A better representation of biovar genomes can be used to elucidate the correspondence between evolutionary relationship and phenotypic characteristics. The phylogenomic relationship between strain Ba Col-B012 and the examined genomes shows that biovar 4 strains form a distinctive clade with high bootstrap support. This pattern is not observed for other biovars, for example, strain 90–0737 and strain B10–0018, which cluster in the same clade, are classified into different biovar groups. The clear clustering of biovar 4 genomes reflects a common ancestor of the group and suggests the existence of allele differences that might be associated with the phenotypic and pathogenic characteristics of the group. Finally, the identification of biovar 4 distinctive genomic region allowed us to design sets of primers that coupled with sequencing could be incorporated into current methods of identification to distinguish biovar 4 strains from others. The B. abortus Ba Col-B012 genome provides important insights to improve the diagnosis and the epidemiology of this disease and represents the first report of the biovar 4 in Colombia.
Average Amino Acid Identity
Multiple-Locus Variable number tandem repeat Analysis.
Garin-Bastuji B. Brucelloses bovine, ovine et caprine: Contrôle et prevention. Le Point vétérinaire: revue d'enseignement post-universitaie et de formation permanente. 1993;25:15–22.
Minharro S, Silva Mol J, Dorneles E, Pauletti R, Neubauer H, Melzer F, Poester F, Dasso M, Pinheiro E, Soares Filho P, Santos R, Heinemann M, Lage A. Biotyping and genotyping (MLVA16) of Brucella abortus isolated from cattle in Brazil, 1977 to 2008. PLoS One. 2013; https://doi.org/10.1371/journal.pone.0081152.
Rodriguez-Hidalgo R, Contreras-Zamora J, Benitez-Ortiz W, Guerrero-Viracocha K, Salcan-Guaman H, Minda E, Ron Garrido L. Circulating strains of Brucella abortus in cattle in Santo Domingo de Los Tsáchilas Province – Ecuador. Frontiers of. Public Health. 2015;3:45.
Rivera DY, Rueda OE, Calderon CP, Marino OC, Gall D, Nielsen K. Comparative evaluation of the indirect enzyme-linked immunosorbant assay in milk for the detection of cattle infected with Brucella abortus, in herds located in the province of Cundinamarca, Colombia. Revue Scientifique et Technique (International Office of Epizootics). 2003;22(3):1065–75.
Griffiths IB, Gallego MI, De Leon LS. Levels of some reproductive diseases in the dairy cattle of Colombia. Trop Anim Health Prod. 1984;16(4):219–23.
González Cardona HG, Patiño Burbano RE: Principales agentes infectocontagiosos del aborto e infertilidad en el ganado lechero de Nariño y Alto Putumayo.1999.http://bibliotecadigital.agronet.gov.co/bitstream/11348/3879/1/20061127144049_Agentes%20aborto%20infertilidad%20ganado%20lechero.pdf. Accessed 12 November 2016.
Ocampo-Sosa A, Agüero-Balbín J, García-Lobo J. Development of a new PCR assay to identify Brucella abortus biovars 5, 6 and 9 and the new subgroup 3b of biovar 3. Vet Microbiol. 2005;110:41–51.
Bricker BJ, Ewalt DR, Halling SM. Brucella 'HOOF-Prints': strain typing by multi-locus analysis of variable number tandem repeats (VNTRs). BMC Microbiol. 2003;3(15):1–13.
Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA, Struhl K. Associates and John Wiley & Sons. Current Protocols in Molecular Biology. Vol I. Greene Publishing. 1997; 2.4.2–2.4.3.
Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001;29:2607–18.
Hyatt D, Chen G, LoCascio P, Land M, Larimer F, Hauser L. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119.
Gasteiger E, Jung E, Bairoch A. SWISS-PROT: connecting biomolecular knowledge via a protein database. Current Issues Mol Biol. 2001;3:47–55.
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. https://doi.org/10.1186/1471-2164-9-75.
Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, Krogh A. Prediction of lipoprotein signal peptides in gram-negative bacteria. Protein Sci. 2003;12:1652–62.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, et al. The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003;4:41.
Lenders MH, Beer T, Smits SH, Schmitt L. Vivo quantification of the secretion rates of the hemolysin a type I secretion system. Sci Rep. 2016;6
Konstantinidis K, Serres M, Romine M, Rodrigues J, Auchtung J, McCue L, Lipton M, Obraztsova A, Giometti C, Nealson K, Fredrickson J, Tiedje J. Comparative systems biology across an evolutionary gradient within the Shewanella genus. Proc Natl Acad Sci U S A. 2009;106:15909–14.
Altschul SF, Gish W, Miller W, Myers EW. Lipman DJ. Basic local alignment search tool. 1990;21:403–10.
Konstantinidis K, Tiedje J. Genomic insights that advance the species definition for prokaryotes. Proc Natl Acad Sci U S A. 2005;102:2567–72.
Richter M, Rossello-Mora R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A. 2009;106:19126–31.
Tiller R, De B, Boshra M, Huynh L, Van Ert M, Wagner D, Klena J, Mohsen T, El-Shafie S, Keim P, Hoffmaster A, Wilkins P, Pimentel G. Comparison of two multiple-locus variable-number tandem-repeat analysis methods for molecular strain typing of human Brucella melitensis isolates from the middle east. J Clin Microbiol. 2009;47:2226–31.
Chirhart-Gilleland RL, Kovach ME, Elzer PH, Jennings SR, Roop RM. Identification and characterization of a 14-kilodalton Brucella abortus protein reactive with antibodies from naturally and experimentally infected hosts and T lymphocytes from experimentally infected BALB/c mice. Infect Immun. 1998;66:4000–3.
Crasta O, Folkerts O, Fei Z, Mane S, Evans C, Martino-Catt S, Bricker B, Yu G, Du L, Sobral B. Genome sequence of Brucella abortus vaccine strain S19 compared to virulent strains yields candidate virulence genes. PLoS One. 2008;3:e2193.
Lowry J, Isaak D, Leonhardt J, Vernati G, Pate J, Andrews G. Vaccination with Brucella abortus recombinant in vivo-induced antigens reduces bacterial load and promotes clearance in a mouse model for infection. PLoS One. 2011;6:e17425.
Garrity GM, Bell JA, Lilburn T. Phylum XIV. Proteobacteria phyl. Nov. in: Garrity GM, Brenner D, Krieg N, Staley JA, editors. Bergey’s manual of systematic bacteriology. New York: Springer; 2005.
Euzéby J. Validation list no. 107. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol. 2006;56:1–6.
Garrity GM, Bell JA, Lilburn T. Class I. Alphaproteobacteria class. Nov. in: Garrity GM, Brenner D, Krieg N, Staley JA, editors. Bergey’s manual of systematic bacteriology. New York: Springer; 2005.
Kuykendall LD. Order VI. Rhizobiales ord. nov. In: Garrity GM, Brenner DJ, Kreig NR, Staley JT. Bergey’s Manual of Systematic Bacteriology. 2nd ed. New York: Springer - Verlag; 2005: 324.
Breed RS, Murray EGD, Smith NR. Family V Brucellaceae, nom. nov. Bergey's Manual of Determinative Bacteriology. 1957;394–423.
Skerman VBD, McGowan V, Sneath PHA. Approved lists of bacterial names. Int J Syst Bacteriol. 1980;30:225–420.
Meyer KF, Shaw EBA. Comparison of the morphologic, cultural and biochemical characteristics of B. Abortus and B. Melitensis from cattle. Studies on the genus Brucella nov. gen. Int J Infect Dis. 1920;27:173–84.
López-Merino A, Monnet DL, Hernández I, Sánchez NL, Boeufgras JM, Sandoval H, Freney J. Identification of Brucella abortus, B. canis, B. melitensis, and B. suis by carbon substrate assimilation tests. Vet Microbiol. 2001;80:359.
Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–25.
Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–91.
Tamura K, Nei M, Kumar S. Prospects for inferring very large phylogenies by using the neighbor-joining method. Proc Natl Acad Sci U S A. 2004;101:11030–5.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Mol Biol Evol. 2013;30:2725–9.
We thank Yolanda Gomez Vargas, Johan Bernal Morales for their contribution in the DNA extraction, preparation of the genomic libraries and sequencing.
This study was funded by the Colombian Ministry of Agriculture (Ministerio de Agricultura y Desarrollo Rural de Colombia).
The authors declare that they have no competing interests.
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.