- Open Access
Non-contiguous finished genome sequence and description of Anaerococcus senegalensis sp. nov.
Standards in Genomic Sciences volume 6, pages116–125 (2012)
Anaerococcus senegalensis strain JC48T sp. nov. is the type strain of A. senegalensis sp. nov. a new species within the genus Anaerococcus. This strain whose genome is described here was isolated from the fecal flora of a healthy patient. A. senegalensis is an obligate anaerobic coccus. Here we describe the features of this organism together with the complete genome sequence and annotation. The 1,790,835 bp long genome (1 chromosome but no plasmid) contains 1,721 protein-coding and 53 RNA genes including 5 rRNA genes
Anaerococcus senegalensis strain JC48T (= CSUR P156 = DSM25366) is the type strain of A. senegalensis sp. nov. This bacterium is a Gram-positive, anaerobic, indole-negative coccus that was isolated from the stool of a healthy Senegalese patient as part of a “culturomics” study aimed at cultivating individually all species within human feces.
Defining bacterial species is a matter of debate. This is notably due to the elevated cost and poor reproducibility and inter-laboratory comparability of the “gold standard” DNA-DNA hybridization and G+C content determination . In contrast, the development of PCR and sequencing methods is now widely available and cost-effective, which profoundly changes the way Archaea, Bacteria and are classified. Using 16S rRNA sequences with internationally-validated cutoff values enabled the taxonomic classification or reclassification of hundreds of taxa . More recently, high throughput genome sequencing and mass spectrometric analyses of bacteria gave unprecedented access to a wealth of genetic and proteomic information . As a consequence, we propose to use a polyphasic approach  to describe new bacterial taxa that includes their genome sequence, MALDI-TOF spectrum and main phenotypic characteristics (habitat, Gram-stain reaction, culture and metabolic characteristics, and when applicable, pathogenicity).
The genus Anaerococcus (Ezaki et al. 2001) was created in 2001 . To date, this genus, comprised of saccharolytic and butyrate-producing anaerobic and non-motile Gram-positive cocci, contains seven species including A. hydrogenalis (Ezaki et al. 1990) Ezaki et al. 2001, A. lactolyticus (Li et al. 1992) Ezaki et al. 2001, A. murdochii (Song et al. 2010), A. octavius (Murdoch et al. 1997) Ezaki et al. 2001, A. prevotii (Foubert and Douglas 1948) Ezaki et al. 2001, A. tetradius (Ezaki et al. 1983) Ezaki et al. 2001, and A. vaginalis (Li et al. 1992) Ezaki et al. 2001. Members of the genus Anaerococcus have mainly been isolated from the human vagina, but have also occasionally been identified in the nasal cavity, on the skin, and various infectious processes including ovarian, peritoneal, sacral, digital and cervical abscesses, vaginoses, bacteremias, foot ulcers, a sternal wound, and a knee arthritis [5–9]. In addition, uncultured bacteria with 16S rRNA sequences highly similar to members of the Anaerococcus genus have been detected in metagenomes from the human skin flora . However, to the best of our knowledge, our report is the first to describe the isolation of a member of the genus Anaerococcus from the normal fecal flora.
Here we present a summary classification and a set of features for A. senegalensis sp. nov. strain JC48T together with the description of the complete genomic sequencing and annotation. These characteristics support the circumscription of the species A. senegalensis.
Classification and features
A stool sample was collected from a healthy 16-year-old male Senegalese volunteer patient living in Dielmo (a rural village in the Guinean-Sudanian zone in Senegal), who was included in a research protocol. The patient gave an informed and signed consent, and the agreement of the National Ethics Committee of Senegal and the local ethics committee of the IFR48 (Marseille, France) were obtained under agreement 09-022). The fecal specimen was preserved at −80°C after collection and sent to Marseille. Strain JC48 (Table 1) was isolated in June 2011 by anaerobic cultivation on 5% sheep blood-enriched Columbia agar (BioMerieux, Marcy l’Etoile, France). This strain exhibited two distinct 16S rRNA sequences, with a 97.8% nucleotide sequence similarity with A. vaginalis, the phylogenetically closest validated Anaerococcus species (Figure 1). This value was lower than the 98.7% 16S rRNA gene sequence threshold recommended by Stackebrandt and Ebers to delineate a new species without carrying out DNA-DNA hybridization . By comparison to the GenBank database  strain JC48 also exhibited nucleotide sequence similarities greater than 99% with uncultured bacterial clones detected in a metagenomic study of the human skin flora . These bacteria are most likely classified within the same species as strain JC48 (Figure 1).
Different growth temperatures (25, 30, 37, 45°C) were tested; no growth occurred at 25°C and 45°C, growth occurred between 30 and 37°C, and optimal growth was observed at 37°C. Colonies were 0.5 mm to 1 mm in diameter on blood-enriched Columbia agar and Brain Heart Infusion (BHI) agar. Growth of the strain was tested under anaerobic and microaerophilic conditions using GENbag anaer and GENbag microaer systems, respectively (BioMérieux), and in the presence of air, with or without 5% CO2, and in aerobic conditions. Optimal growth was achieved anaerobically. Weak growth was observed in microaerophilic conditions and with 5% CO2.. No growth was observed in aerobic conditions. Gram staining showed Gram positive cocci. A motility test was negative. Cells grown on agar are Gram-positive (Figure 2) and have a mean diameter of 0.87 µm, and are mostly grouped in pairs, short chains or small clumps (Figure 3).
Strain 48T exhibited catalase activity but not oxidase activity. Using API Rapid ID 32A, a positive reaction was obtained for urease, arginine dihydrolase, indole production, β glucuronidase, mannose fermentation, alkaline phosphatase, arginine arylamidase, leucyl glycine arylamidase, histidine arylamidase. A weak reaction was obtained for pyroglutamyl arylamidase. A. senegalensis is susceptible to penicillin G, imipeneme, amoxicillin + clavulanic acid, vancomycin, clindamycin and metronidazole. By comparison with A. vaginalis, strain 48T differed in urease and pyroglutamyl arylamidase production .
Matrix-assisted laser-desorption/ionization time-of-flight (MALDI-TOF) MS protein analysis was carried out as previously described . Briefly, a pipette tip was used to pick one isolated bacterial colony from a culture agar plate, and to spread it as a thin film on a MTP 384 MALDI-TOF target plate (Bruker Daltonik, Leipzig, Germany). Four distinct deposits were done for strain JC48 from four isolated colonies. Each smear was overlaid with 2 µL of matrix solution (saturated solution of alpha-cyano-4-hydroxycinnamic acid) in 50% acetonitrile, 2.5% tri-fluoracetic-acid, and allowed to dry for five minutes. Measurements were performed with a Microflex spectrometer (Bruker). Spectra were recorded in the positive linear mode for the mass range of 2,000 to 20,000 Da (parameter settings: ion source 1 (IS1), 20 kV; IS2, 18.5 kV; lens, 7 kV). A spectrum was obtained after 675 shots at a variable laser power. The time of acquisition was between 30 seconds and 1 minute per spot. The four JC48 spectra were imported into the MALDI BioTyper software (version 2.0, Bruker) and analyzed by standard pattern matching (with default parameter settings) against the main spectra of 2,843 bacteria, including spectra from the seven validated Anaerococcus species used as reference data, in the BioTyper database. The method of identification included the m/z from 3,000 to 15,000 Da. For every spectrum, 100 peaks at most were taken into account and compared with the spectra in the database. A score enabled the identification, or not, from the tested species: a score ≥ 2 with a validated species enabled the identification at the species level; a score ≥ 1.7 but < 2 enabled the identification at the genus level; and a score < 1.7 did not enable any identification. For strain JC48, the obtained score was 1.4, thus suggesting that our isolate was not a member of a known species. We incremented our database with the spectrum from strain JC48 (Figure 4).
Genome sequencing information
Genome project history
The organism was selected for sequencing on the basis of its phylogenetic position and 16S rRNA similarity to other members of the genus Anaerococcus, and is part of a “culturomics” study of the human digestive flora aiming at isolating all bacterial species within human feces. It was the second genome of an Anaerococcus species and the first genome of Anaerococcus senegalensis sp. nov. A summary of the project information is shown in Table 2. The Genbank accession number is PRJE70539 and consists of 39 contigs. Table 2 shows the project information and its association with MIGS version 2.0 compliance .
Growth conditions and DNA isolation
A. senegalensis sp. nov. strain JC48T, CSUR P156, was grown anaerobically on 5% sheep blood-enriched Columbia agar at 37°C. Four petri dishes were spread and resuspended in 3×100µl of G2 buffer (EZ1 DNA Tissue kit, Qiagen). A first mechanical lysis was performed by glass powder on the Fastprep-24 device (Sample Preparation system; MP Biomedicals, USA) using 2×20 seconds cycles. DNA was then treated with 2.5µg/µL lysozyme (30 minutes at 37°C) and extracted through the BioRobot EZ 1 Advanced XL (Qiagen). The DNA was then concentrated and purified on a Qiamp kit (Qiagen). The yield and the concentration was measured by the Quant-it Picogreen kit (Invitrogen) on the Genios_Tecan fluorometer at 50ng/µl.
Genome sequencing and assembly
DNA (5 µg) was mechanically fragmented on a Hydroshear device (Digilab, Holliston, MA, USA) with an enrichment size at 3–4 kb. The DNA fragmentation was visualized through the Agilent 2100 BioAnalyzer on a DNA labchip 7500 with an optimal size of 3.785 kb. The library was constructed according to the 454 GS FLX Titanium paired end protocol. Circularization and nebulization were performed and generated a pattern with an optimal at 614 bp. After PCR amplification through 15 cycles followed by double size selection, the single stranded paired end library was then quantified on the Quant-it Ribogreen kit (Invitrogen) on the Genios Tecan fluorometer at 96 pg/µL. The library concentration equivalence was calculated as 2,87E+08 molecules/µL. The library was stored at −20°C until further use.
The library was clonally amplified with 0.5 cpb and 1 cpb respectively in 2×8 emPCR reactions with the GS Titanium SV emPCR Kit (Lib-L) v2 (Roche). The yields of the emPCR were lower than expected at 3.92%, compared to the range of 5 to 20% from the Roche procedure.
Approximately 790,000 beads were loaded on the GS Titanium PicoTiterPlate PTP Kit 70×75 and sequenced with the GS FLX Titanium Sequencing Kit XLR70 (Roche). The run was performed overnight and then analyzed on the cluster through the gsRunBrowser and Newbler assembler (Roche). A total of 310,172 passed filter wells were obtained and generated 99 Mb with a length average of 320 bp. The passed filter sequences were assembled using Newbler with 90% identity and 40bp as overlap. The final assembly identified 4 scaffolds and 39 contigs (>100bp).
Open Reading Frames (ORFs) were predicted using Prodigal  with default parameters but the predicted ORFs were excluded if they were spanning a sequencing GAP region. The predicted bacterial protein sequences were searched against the GenBank database and the Clusters of Orthologous Groups (COG) database using BLASTP. The tRNAScanSE tool  was used to find tRNA genes, whereas ribosomal RNAs were found by using RNAmmer  and BLASTn against the GenBank. ORFans were identified if their BLASTP E-value was lower than 1e-03 for alignment length greater than 80 amino acids. If alignment lengths were smaller than 80 amino acids, we used an E-value of 1e-05. Such parameter thresholds have already been used in previous works to define ORFans. To estimate the mean level of nucleotide sequence similarity at the genome level between Anaerococcus species, we compared the ORFs only using BLASTN at a query coverage of ≥ 70% and a minimum nucleotide length of 100 bp.
The genome is 1,790,835 bp long (one chromosome, no plasmid) with a 28.56% G + C content (Table 3). Of the 1,774 predicted genes, 1,721 were protein-coding genes, and 53 were RNAs. Two distinct copies of 16S rRNA, differing by two point mutations, were identified. A total of 1,296 genes (73.0%) were assigned a putative function. Fifty-one genes were identified as ORFans (3%). The remaining genes were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4. The properties and the statistics of the genome are summarized in Tables 3 and 4.
Comparison with Anaerococcus prevotii
To date, the genome from A. prevotii strain PC1T is the only genome from Anaerococcus species that has been sequenced . By comparison with A. prevotii, A. senegalensis exhibited a lower G + C content (35.64% vs 28.56%, respectively) and a smaller number of genes (1,913 vs 1,774) and genes with peptide signals (337 vs 142). In contrast, A. senegalensis had higher ratios of genes per Mb (957 vs 990) and genes assigned to COGs (74.28% vs 79. 26%). However, the distribution of genes into COG categories (Table 4) was highly similar in both genomes.
In addition, A. senegalensis shared mean nucleotide sequence similarities at the genome level of 76.6% (range 62.5-100%) and 75.4% (range 62.7-100%) with A. prevotii strains PC1T and ACS065-V-Col13 (GenBank accession number AEXM00000000), respectively.
On the basis of phenotypic, phylogenetic and genomic analyses, we formally propose the creation of Anaerococcus senegalensis sp. nov. that contains the strain JC48T. This bacterium has been found in Senegal.
Description of Anaerococcus senegalensis sp. nov.
Anaerococcus senegalensis (se.ne.gal.e′n.sis. L. gen. masc. n. senegalensis, pertaining to, or originating from Senegal, the country from which the specimen was isolated).
Colonies are 0.5 mm to 1 mm in diameter on blood-enriched Columbia agar and Brain Heart Infusion (BHI) agar. Cells are coccoid with a mean diameter of 0.87 µm, occurring mostly in pairs, short chains or small clumps. Optimal growth is achieved anaerobically. Weak growth is observed in microaerophilic conditions and with 5% CO2. No growth is observed in aerobic conditions. Growth occurs between 30–37°C, with optimal growth observed at 37°C, in BHI medium + 5% NaCl. Cells stain Gram-positive, are non-endospore-forming, and non-motile. Catalase, urease, arginine dihydrolase, β glucuronidase, alkaline phosphatase, arginine arylamidase, leucyl glycine arylamidase, and histidine arylamidase activity are present. Mannose fermentation and indole production are also present. A weak reaction is obtained for pyroglutamyl arylamidase. Oxidase activity is absent. Cells are susceptible to penicillin G, imipeneme, amoxicillin + clavulanic acid, vancomycin, clindamycin and metronidazole. The G + C content of the genome is 28.56%.
The type strain JC48T (= CSUR P156 = DSM 25366) was isolated from the fecal flora of a healthy patient in Senegal.
European Molecular Biology Laboratory
National Center for Biotechnology Information (Bethesda, MD, USA)
Ribosomal Database Project (East Lansing, MI, USA)
Rossello-Mora R. DNA-DNA reassociation methods applied to microbial taxonomy and their critical evaluation. In: Stackebrandt E (ed.), Molecular Identification, Systematics, and population Structure of Prokaryotes. Springer, Berlin, 2006, p. 23–50.
Stackebrandt E, Ebers J. Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 2006; 33:152–155.
Welker M, Moore ER. Applications of whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry in systematic microbiology. Syst Appl Microbiol 2011; 34:2–11. PubMed http://dx.doi.org/10.1016/j.syapm.2010.11.013
Tindall BJ, Rossello-Mora R, Busse HJ, Ludwig W, Kämpfer P. Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol 2010; 60:249–266. PubMed http://dx.doi.org/10.1099/ijs.0.016949-0
Ezaki T, Kawamura Y, Li N, Li ZY, Zhao L, Shu S. Proposal of the genera Anaerococcus gen. nov., Peptoniphilus gen. nov. and Gallicola gen. nov. for members of the genus Peptostreptococcus. Int J Syst Evol Microbiol 2001; 51:1521–1528. PubMed
Jain S, Bui V, Spencer C, Yee L. Septic arthritis in a native joint due to Anaerococcus prevotii. J Clin Pathol 2007; 61:775–776. PubMed http://dx.doi.org/10.1136/jcp.2007.053421
Song Y, Liu C, Finegold SM. Peptoniphilus gorbachii sp. nov., Peptoniphilus olsenii sp. nov., and Anaerococcus murdochii sp. nov. isolated from clinical specimens of human origin. J Clin Microbiol 2007; 45:1746–1752. PubMed http://dx.doi.org/10.1128/JCM.00213-07
La Scola B, Fournier PE, Raoult D. Burden of emerging anaerobes in the MALDI-TOF and 16S rRNA gene sequencing era. Anaerobe 2011; 17:106–112. PubMed http://dx.doi.org/10.1016/j.anaerobe.2011.05.010
Pépin J, Deslandes S, Giroux G, Sobéla F, Khonde N, Diakité S, Demeule S, Labbé AC, Carrier N, Frost E. The complex vaginal flora of west african women with bacterial vaginosis. PLoS ONE 2011; 6:e25082. PubMed http://dx.doi.org/10.1371/journal.pone.0025082
Grice EA, Kong HH, Conlan S, Deming CB, Davis J, Young AC, NISC Comparative Sequencing Program, Bouffard GG, Blakesley RW, Murray PR, Green ED, et al. Topographical and temporal diversity of the human skin microbiome. Science 2009; 324:1190–1192. PubMed http://dx.doi.org/10.1126/science.1171700
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eukarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. PubMed http://dx.doi.org/10.1073/pnas.87.12.4576
Gibbons NE, Murray RGE. Proposals Concerning the Higher Taxa of Bacteria. Int J Syst Bacteriol 1978; 28:1–6. http://dx.doi.org/10.1099/00207713-28-1-1
Garrity GM, Holt JG. The Road Map to the Manual. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 1, Springer, New York, 2001, p. 119–169.
Murray RGE. The Higher Taxa, or, a Place for Everything…? In: Holt JG (ed), Bergey’s Manual of Systematic Bacteriology, First Edition, Volume 1, The Williams and Wilkins Co., Baltimore, 1984, p. 31–34.
List Editor. List of new names and new combinations previously effectively, but not validly, published. List no. 132. Int J Syst Evol Microbiol 2010; 60:469–472. http://dx.doi.org/10.1099/ijs.0.022855-0
Rainey FA. Class II. Clostridia class nov. In: De Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, Rainey FA, Schleifer KH, Whitman WB (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer-Verlag, New York, 2009, p. 736.
Skerman VBD, Sneath PHA. Approved list of bacterial names. Int J Syst Bacteriol 1980; 30:225–420. http://dx.doi.org/10.1099/00207713-30-1-225
Prevot AR. In: Hauduroy P, Ehringer G, Guillot G, et al. (eds.), Dictionnaire des bactéries pathogènes, Second Edition, Masson, Paris, 1953, p. 1–692.
Garrity GM, Holt J. Taxonomic outline of the Archaea and Bacteria. In: Garrity GM, Boone DR, Castenholz RW (eds.), Bergey’s Manual of Systematic Bacteriology, Second Edition, Springer-Verlag, New York, 2003, p.155–166.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. PubMed http://dx.doi.org/10.1038/75556
Seng P, Drancourta M, Gouriet F, La Scola B, Fournier PE, Jean Rolain M, Raoult D. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis 2009; 49:543–551. PubMed http://dx.doi.org/10.1086/600885
Lowe TM, Eddy SR. t-RNAscan-SE: a program for improved detection of transfer RNA gene in genomic sequence. Nucleic Acids Res 1997; 25:955–964. PubMed http://dx.doi.org/10.1093/nar/25.5.955
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. Complete genome sequence of Anaerococcus prevotii type strain (PC1). Nucleic Acids Res 2007; 35:3100–3108. PubMed http://dx.doi.org/10.1093/nar/gkm160
Labutti K, Pukall R, Steenblock K, Glavina Del Rio T, Tice H, Copeland A, Cheng JF, Lucas S, Chen F, Nolan M. Complete genome sequence of type strain (PC1). Stand Genomic Sci 2009; 1:159–165. PubMed http://dx.doi.org/10.4056/sigs.24194
GenBank database. http://www.ncbi.nlm.nih.gov/genbank