Non contiguous-finished genome sequence and description of Bacillus timonensis sp. nov.
Standards in Genomic Sciences volume 6, pages 346–355 (2012)
Bacillus timonensis strain 10403188T sp. nov. is the type strain of a proposed new species within the genus Bacillus. This strain, whose genome is described here, was isolated from the fecal flora of a healthy patient. B. timonensis is an aerobic Gram-negative rod shaped bacterium. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,632,049 bp long genome (1 chromosome but no plasmid) contains 4,610 protein-coding and 74 RNA genes, including 5 rRNA genes.
Bacillus timonensis strain 10403188T (= CSUR P162 = DSM 25372) is designated as the type strain of B. timonensis, a new Gram-negative aerobic, indole-positive bacillus that was isolated from the stool of a healthy Senegalese patient as part of a “culturomics” study aiming at cultivating individually all species within human feces.
To date, DNA-DNA hybridization and G+C content determination  remain the gold standard methods for the definition of bacterial species, despite the development of 16S rRNA PCR and sequencing which have deeply changed bacterial taxonomy . Over recent years, high throughput genome sequencing provided a wealth of genetic information . In an effort to include genomic data in bacterial taxonomy we recently used a polyphasic approach  that includes genomic data, MALDI-TOF spectrum and main phenotypic characteristics to describe new bacterial species [5,6].
Here we present a summary classification and a set of features for B. timonensis sp. nov. strain 10403188T together with the description of the complete genomic sequencing and annotation. These characteristics support the circumscription of the species B. timonensis.
The genus Bacillus (Cohn 1872) was created in 1872 . To date, this genus, mostly comprised of Gram-positive, motile, and spore-forming bacteria, is made of 256 species and 7 subspecies with validly published names . Members of the genus Bacillus are ubiquitous bacteria isolated from various environments including soil, fresh and sea water, food, and occasionally from humans in whom they are either pathogens, such as B. anthracis and B. cereus, or opportunists in immunocompromised patients . Apart from anthrax, caused by B. anthracis , and toxi-infections caused by B. cereus, Bacillus species may be involved in a variety of aspecific human infections, including cutaneous, ocular, central nervous system or bone infections, pneumonia, endocarditis and bacteremia .
Classification and features
A stool sample was collected from a healthy 16-year-old male Senegalese volunteer patient living in Dielmo (a rural village in the Guinean-Sudanian zone in Senegal), who was included in a research protocol. The patient gave an informed and signed consent, and the agreement of the National Ethics Committee of Senegal and the local ethics committee of the IFR48 (Marseille, France) was obtained under agreements 09-022 and 11-017). The fecal specimen was preserved at −80°C after collection and sent to Marseille. Strain 10403188 (Table 1) was isolated in June 2011 by cultivation on 5% sheep blood-enriched Brain Heart Infusion agar with (Becton Dickinson, Heidelberg, Germany). This strain exhibited a 98.2% nucleotide sequence similarity with Bacillus humi, the phylogenetically closest validated Bacillus species (Figure 1). This value was lower than the 98.7% 16S rRNA gene sequence threshold recommended by Stackebrandt and Ebers to delineate a new species without carrying out DNA-DNA hybridization .
Different growth temperatures (25, 30, 37, 45°C) were tested. Growth occurred at all tested temperatures, but optimal growth occurred between 30 and 37°C. Colonies were 3 mm in diameter on blood-enriched BHI agar. Growth of the strain was tested under anaerobic and microaerophilic conditions using GENbag anaer and GENbag microaer systems, respectively (BioMérieux), and in aerobic conditions, with or without 5% CO2. Growth was achieved in aerobic (with and without CO2) and microaerophilic conditions. No growth was observed in anaerobic conditions. Gram staining showed Gram negative bacilli (Figure 2). A motility test was positive. Cells grown on agar are sporulated and have a mean diameter of 0.66 µm (Figure 3).
Strain 10403188T exhibited oxidase activity but not catalase activity, and was positive for indole. Using API 50CH, a positive reaction was obtained for L-arabinose, D-lactose, D-melibiose, D-trehalose, D-saccharose, and D-turanose fermentation. A weak reaction was obtained for aesculin. Other tests were negative. Using API-ZYM, positive reactions were obtained for esterase, α-chimotrypsine, β-glucorinidase, and α- and β-glucosinidase. B. timonensis was susceptible to penicillin G, amoxicillin, vancomycin, gentamicin, erythromycin, doxycyclin, rifampicin, and ciprofloxacin but resistant to trimethoprim/sulfamethoxazole.
By comparison with B. humi, B. timonensis differed in Gram staining, in culture atmosphere, as B. humi was able to grow anaerobically, in catalase activity, in spore forming capacity, in indole production, and in carbohydrate metabolism, notably for arbutin, salicin, L-arabinose, melibiose, turanose, and trehalose .
Matrix-assisted laser-desorption/ionization time-of-flight (MALDI-TOF) MS protein analysis was carried out as previously described . Briefly, a pipette tip was used to pick one isolated bacterial colony from a culture agar plate, and to spread it as a thin film on a MTP 384 MALDI-TOF target plate (Bruker Daltonics, Leipzig, Germany). Four distinct deposits were done for strain 10403188 from four isolated colonies. Each smear was overlaid with 2µL of matrix solution (saturated solution of alpha-cyano-4-hydroxycinnamic acid) in 50% acetonitrile, 2.5% tri-fluoracetic-acid, and allowed to dry for five minutes. Measurements were performed with a Microflex spectrometer (Bruker). Spectra were recorded in the positive linear mode for the mass range of 2,000 to 20,000 Da (parameter settings: ion source 1 (IS1), 20 kV; IS2, 18.5 kV; lens, 7 kV). A spectrum was obtained after 675 shots at a variable laser power. The time of acquisition was between 30 seconds and 1 minute per spot. The four 10403188 spectra were imported into the MALDI BioTyper software (version 2.0, Bruker) and analyzed by standard pattern matching (with default parameter settings) against the main spectra of 3,769 bacteria including 129 spectra from 98 Bacillus species, notably B. humi, used as reference data, in the BioTyper database. The method of identification included the m/z from 3,000 to 15,000 Da. For every spectrum, 100 peaks at most were taken into account and compared with spectra in the database. A score enabled the presumptive identification and discrimination of the tested species from those in the database: a score > 2 with a validated species enabled the identification at the species level, a score > 1.7 but < 2 enabled the identification at the genus level; and a score < 1.7 did not enable any identification. For strain 10403188T, the obtained score was 1.2, thus suggesting that our isolate was not a member of a known species. We incremented our database with the spectrum from strain 10403188 (Figure 4). The spectrum was made available online in our free-access URMS database .
Genome sequencing information
Genome project history
The organism was selected for sequencing on the basis of its phylogenetic position and 16S rRNA similarity to other members of the genus Bacillus, and is part of a “culturomics” study of the human digestive flora aiming at isolating all bacterial species within human feces. It was the 60th genome of a Bacillus species and the first genome of Bacillus timonensis sp. nov. A summary of the project information is shown in Table 2. The Genbank accession number is CAET00000000 and consists of 146 contigs.
Growth conditions and DNA isolation
B. timonensis sp. nov. strain 10403188T, CSUR P162, DSM 25372, was grown aerobically on 5% sheep blood-enriched BHI agar at 37°. Four petri dishes were spread and growth from the plates was resuspended in 3x500µl of TE buffer and stored at 80°C. Then, 500µl of this suspension were thawed, centrifuged 3 minutes at 10,000 rpm and resuspended in 3x100µL of G2 buffer (EZ1 DNA Tissue kit, Qiagen). A first mechanical lysis was performed by glass powder on the Fastprep-24 device (Sample Preparation system, MP Biomedicals, USA) using 2x20 seconds cycles. DNA was then treated with 2.5µg/µL lysozyme (30 minutes at 37°C) and extracted using the BioRobot EZ1 Advanced XL (Qiagen). The DNA was then concentrated and purified using the Qiamp kit (Qiagen). The yield and the concentration was measured by the Quant-it Picogreen kit (Invitrogen) on the Genios Tecan fluorometer at 50ng/µl.
Genome sequencing and assembly
DNA (5 µg) was mechanically fragmented on a Hydroshear device (Digilab, Holliston, MA, USA) with an enrichment size at 3–4kb. The DNA fragmentation was visualized through the Agilent 2100 BioAnalyzer on a DNA labchip 7500 with an optimal size of 3.345kb. The library was constructed according to the 454 GS FLX Titanium paired-end protocol. Circularization and nebulization were performed and generated a pattern with an optimum at 492 bp. After PCR amplification through 15 cycles followed by double size selection, the single stranded paired end library was then quantified on the Quant-it Ribogreen kit (Invitrogen) on the Genios Tecan fluorometer at 339 pg/µL. The library concentration equivalence was calculated as 12,6E+08 molecules/µL. The library was stored at −20°C until further use.
The shotgun library was clonally amplified with 3cpb and the paired-end library was amplified with lower cpb (1 cpb) in 4 emPCR reactions with the GS Titanium SV emPCR Kit (Lib-L) v2 (Roche). The yields of the emPCR was 5.97% for the shotgun and 15.92% for the paired end as expected by the range of 5 to 20% from the Roche procedure.
Approximately 790,000 beads for a 1/4 region and 340,000 beads for a 1/8 region were loaded on the GS Titanium PicoTiterPlate PTP Kit 70×75 and sequenced with the GS FLX Titanium Sequencing Kit XLR70 (Roche). The run was performed overnight and then analyzed on the cluster through the gsRunBrowser and Newbler assembler (Roche). For the shotgun sequencing, 112,962 passed filter wells were obtained and generated 34.48Mb with a length average of 322 bp. For the shotgun sequencing, 213,882 passed filter wells were obtained and generated 50.6 Mb with a length average of 236 bp. The passed filter sequences were assembled Using Newbler with 90% identity and 40bp as overlap. The final assembly identified 11 scaffolds and 89 contigs (>1500bp) generating a genome size of 4.6 Mb.
Open Reading Frames (ORFs) were predicted using Prodigal  with default parameters but the predicted ORFs were excluded if they were spanning a sequencing gap region. The predicted bacterial protein sequences were searched against the GenBank database  and the Clusters of Orthologous Groups (COG) databases using BLASTP. The tRNAScanSE tool  was used to find tRNA genes, whereas ribosomal RNAs were found by using RNAmmer  and BLASTn against the GenBank database. ORFans were identified if their BLASTP E-value was lower than 1e-03 for alignment length greater than 80 amino acids. If alignment lengths were smaller than 80 amino acids, we used an E-value of 1e-05. Such parameter thresholds have already been used in previous works to define ORFans.
To estimate the mean level of nucleotide sequence similarity at the genome level between Bacillus species, we compared the ORFs only using BLASTN and the following parameters: a query coverage of ≥ 70% and a minimum nucleotide length of 100 bp.
The genome is 4,632,049 bp long (1 chromosome, but no plasmid) with a 37.30% GC content (Figure 5 and Table 3). Of the 4,684 predicted genes, 4,610 were protein-coding genes and 74 were RNAs. A total of 3,399 genes (75.56%) were assigned a putative function. Three hundred forty genes were identified as ORFans (7.4%). The remaining genes were annotated as hypothetical proteins. The properties and the statistics of the genome are summarized in Tables 3. The distribution of genes into COGs functional categories is presented in Table 4.
Comparison with the genomes from other Bacillus species
Genome sequences are currently available for more than 25 validly named Bacillus species. Here we compared the genome sequence of B. timonensis strain 10403188T with that of B. licheniformis strain ATCC 14580, the most closely related phylogenetic neighbor for which the genome sequence is available. The draft genome sequence of B. timonensis is larger than B. licheniformis (4.6 Mb and 4.2 Mb, respectively) but its G+C content is lower (37.30 and 46.19%, respectively). B. timonensis has more predicted genes than B. licheniformis (4,684 and 4,356, respectively), and more genes assigned to COGs (3,399 and 3,130, respectively). However, the distribution of genes into COG categories (Table 4) was highly similar in both genomes. In addition, B. timonensis shared a mean 86.10% (range 76.4-93%) sequence similarity with B. licheniformis at the genome level.
Although the degree of 16S rRNA similarity was elevated (98.2%) between strain 10403188 and B. humi strain DSM 16318, both strains exhibited several phenotypic and genomic differences, and we formally propose the creation of Bacillus timonensis sp. nov. that contains the strain 10403188T. This strain has been found in Senegal.
Description of Bacillus timonensis sp. nov.
Bacillus timonensis (tim.on.en′sis. L. gen. masc. n. timonensis, of Timone, the name of the hospital where strain 10403188T was cultivated.) Isolated from stool from an asymptomatic Senegalese patient. B. timonensis is an aerobic Gram-negative bacterium. Grows on axenic medium at 37°C in an aerobic atmosphere. Colonies were 3 mm in diameter on blood-enriched BHI agar. Cells grown on agar are sporulated and have a mean diameter of 0.66 µm. A positive reaction was obtained for L-arabinose, D-lactose, D-melibiose, D-trehalose, D-saccharose, and D-turanose fermentation. Positive reactions were obtained for oxidase, esterase, α-chimotrypsine, β-glucorinidase, and α- and β-glucosinidase activity. No catalase activity was exhibited. Positive for indole. By comparison with B. humi, B. timonensis differs in Gram staining, in culture atmosphere, as B. humi grows anaerobically, in catalase activity, in spore forming capacity, in indole production, and in carbohydrate metabolism, notably for arbutin, salicin, L-arabinose, melibiose, turanose, and trehalose. B. timonensis is susceptible to penicillin G, amoxicillin, vancomycin, gentamicin, erythromycin, doxycyclin, rifampicin, and ciprofloxacin but resistant to trimethoprim/sulfamethoxazole. Motile. The G+C content of the genome is 37.30%. The 16S rRNA and genome sequences are deposited in GenBank under accession numbers JF824810 and CAET00000000, respectively. The type strain 10403188T (= CSUR P162 = DSM 25372) was isolated from the fecal flora of a healthy patient from Senegal.
Rossello-Mora R. DNA-DNA Reassociation Methods Applied to Microbial Taxonomy and Their Critical Evaluation. In: Stackebrandt E (ed), Molecular Identification, Systematics, and population Structure of Prokaryotes. Springer, Berlin, 2006, p. 23–50.
Stackebrandt E, Ebers J. Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 2006; 33:152–155.
Welker M, Moore ER. Applications of whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry in systematic microbiology. Syst Appl Microbiol 2011; 34:2–11. PubMed http://dx.doi.org/10.1016/j.syapm.2010.11.013
Tindall BJ, Rossello-Mora R, Busse HJ, Ludwig W, Kämpfer P. Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol 2010; 60:249–266. PubMed http://dx.doi.org/10.1099/ijs.0.016949-0
Lagier JC, El Karkouri K, Nguyen TT, Armougom F, Raoult D, Fournier PE. Non-contiguous finished genome sequence and description of Anaerococcus senegalensis sp. nov. Stand Genomic Sci 2012; 6:116–125. PubMed http://dx.doi.org/10.4056/sigs.2415480
Mishra AK, Lagier JC, Robert C, Raoult D, Fournier PE. 2012. Non-contiguous finished genome sequence and description of Clostridium senegalense sp. nov. Stand.Genomic.Sci. In press 5. Cohn F. Untersuchungen über Bakterien. Beitrage zur Biologie der Pflanzen Heft 1872; 1:127–224.
Jernigan JA, Stephens DS, Ashford DA, Omenaca C, Topiel MS, Galbraith M, Tapper M, Fisk TL, Zaki S, Popovic T, et al. Bioterrorism-related inhalational anthrax: the first 10 cases reported in the United States. Emerg Infect Dis 2001; 7:933–944. PubMed http://dx.doi.org/10.3201/eid0706.010604
Bottone EJ. Bacillus cereus, a volatile human pathogen. Clin Microbiol Rev 2010; 23:382–398. PubMed http://dx.doi.org/10.1128/CMR.00073-09
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archae, Bacteria, and Eukarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. PubMed http://dx.doi.org/10.1073/pnas.87.12.4576
Gibbons NE, Murray RGE. Proposals Concerning the Higher Taxa of Bacteria. Int J Syst Bacteriol 1978; 28:1–6. http://dx.doi.org/10.1099/00207713-28-1-1
Garrity GM, Holt JG. The Road Map to the Manual. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 1, Springer, New York, 2001, p. 119–169.
Murray RGE. The Higher Taxa, or, a Place for Everything…? In: Holt JG (ed), Bergey’s Manual of Systematic Bacteriology, First Edition, Volume 1, The Williams and Wilkins Co., Baltimore, 1984, p. 3134.
List Editor. List of new names and new combinations previously effectively, but not validly, published. List no. 132. Int J Syst Evol Microbiol 2010; 60:469–472. http://dx.doi.org/10.1099/ijs.0.022855-0
Ludwig W, Schleifer KH, Whitman WB. Class I. Bacilli class nov. In: De Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, Rainey FA, Schleifer KH, Whitman WB (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer-Verlag, New York, 2009, p. 19–20.
Skerman VBD, Sneath PHA. Approved list of bacterial names. Int J Syst Bact 1980; 30:225–420. http://dx.doi.org/10.1099/00207713-30-1-225
Prevot AR. Dictionnaire des bactéries pathogens. In: Hauduroy P, Ehringer G, Guillot G, Magrou J, Prevot AR, Rosset, Urbain A (eds). Paris, Masson, 1953, p.1–692.
Fischer A. Untersuchungen über bakterien. Jahrbücher für Wissenschaftliche Botanik 1895; 27:1–163.
Gibson T, Gordon RE. Genus I. Bacillus Cohn 1872, 174; Nom. gen. cons. Nomencl. Comm. Intern. Soc. Microbiol. 1937, 28; Opin. A. Jud. Comm. 1955, 39. In: Buchanan RE, Gibbons NE (eds), Bergey’s Manual of Determinative Bacteriology, Eighth Edition, The Williams and Wilkins Co., Baltimore, 1974, p. 529–550.
Mathews WC, Caperna J, Toerner JG, Barber RE, Morgenstern H. Neutropenia is a risk factor for Gram-negative Bacillus bacteremia in human immunodeficiency virus-infected patients: results of a nested case-control study. Am J Epidemiol 1998; 148:1175–1183. PubMed http://dx.doi.org/10.1093/oxfordjournals.aje.a009606
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. PubMed http://dx.doi.org/10.1038/75556
Stackebrandt E, Ebers J. Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 2006; 33:152–155.
Heyrman J, Rodriguez-Diaz M, Devos J, Felske A, Logan NA, De Vos P. Bacillus arenosi sp. nov., Bacillus arvi sp. nov., Bacillus humi sp. nov., isolated from soil. Int J Syst Evol Microbiol 2005; 55:111–117. PubMed http://dx.doi.org/10.1099/ijs.0.63240-0
Seng P, Drancourt M, Gouriet F, La SB, Fournier PE, Rolain JM, Raoult D. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis 2009; 49:543–551. PubMed http://dx.doi.org/10.1086/600885
Benson DA, Karsch-Mizrachi I, Clark K, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res 2012; 40:D48–D53. PubMed http://dx.doi.org/10.1093/nar/gkr1202
Lowe TM, Eddy SR. t-RNAscan-SE: a program for imroved detection of transfer RNA gene in genomic sequence. Nucleic Acids Res 1997; 25:955–964. PubMed
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 2007; 35:3100–3108. PubMed http://dx.doi.org/10.1093/nar/gkm160
About this article
Cite this article
Kokcha, S., Mishra, A.K., Lagier, JC. et al. Non contiguous-finished genome sequence and description of Bacillus timonensis sp. nov.. Stand in Genomic Sci 6, 346–355 (2012). https://doi.org/10.4056/sigs.2776064
- Bacillus timonensis