- Extended genome report
- Open Access
Non contiguous-finished genome sequence and description of Microbacterium gorillae sp. nov.
Standards in Genomic Sciencesvolume 11, Article number: 32 (2016)
Strain G3T (CSUR P207 = DSM 26203) was isolated from the fecal sample of a wild gorilla (Gorilla gorilla subsp gorilla) from Cameroon. It is a Gram-positive, facultative anaerobic short rod. This strain exhibits a 16S rRNA sequence similarity of 98.2 % with Microbacterium thalassium, the closest validly published Microbacterium species and member of the family Microbacteriaceae. Moreover, it shows a low MALDI-TOF-MS score (1.1 to 1.3) that does not allow any identification. Thus, it is likely that this strain represents a new species. Here we describe the phenotypic features of this organism, the complete genome sequence and annotation. The 3,692,770 bp long genome (one chromosome but no plasmid) contains 3,505 protein-coding and 61 RNA genes, including 4 rRNA genes. In addition, digital DNA-DNA hybridization values for the genome of the strain G3T against the closest Microbacterium genomes range between 19.7 to 20.5, once again confirming its new status as a new species. On the basis of these polyphasic data, consisting of phenotypic and genomic analyses, we propose the creation of Microbacterium gorillae sp. nov. that contains the strain G3T.
Strain G3T (= CSUR P207 = DSM 26203) is the type strain of Microbacterium gorillae sp. nov. This bacterium is a Gram-positive, non-spore-forming, indole-negative, facultative anaerobic rod shaped bacillus. It was isolated from the feces of western lowland gorilla in Cameroon as part of a culturomics study to describe the bacterial communities of the gorilla gut . By applying a large variety of culture conditions, culturomics allowed previously the isolation of numerous new bacterial species from gorilla fecal samples .
Furthermore, since the creation of the genus Microbacterium by Orla-Jensenin (1919)  to date, 91 bacterial species belonging to this genus have been validly published . These species are Gram-positive and non-endospore-forming bacteria. Many studies have described Microbactertium species in diverse origins including human clinical specimens, soil, sea sediments, plants and hairspray [4–7].
In this report, we present a summary classification, phenotypic features for M. gorillae sp. nov. strain G3T, together with the description of the complete genome sequence and annotation. These characteristics support the circumscription of the species M. gorillae .
Classification and features
Information about the fecal sample collection and conservation are described previously . Strain G3T (Table 1) was isolated in January 2012 as part of a culturomics study  by cultivation on Columbia agar supplemented with sheep blood (BioMérieux, Craponne, France).
When compared to sequences available in GenBank, the 16S rRNA gene sequence of M. gorillae strain G3T (GenBank accession number JX650056) exhibited an identity of 98.2 % with Microbacterium thalassium , the closest validly published Microbacterium species. This value was equal to the percentage of 16S rRNA gene sequence threshold recommended by Meier-Kolthoff et al. for class Actinobacteria to delineate a new species without carrying out DNA-DNA hybridization with maximum error probability of 0.1 % . Figure 1 presents the 16S rRNA based tree for the strain G3T and other Microbacterium species.
Different growth temperatures (20, 25, 30, 37, 45 °C) were tested. Growth occurred between 25 °C and 37 °C, but the optimal growth was observed at 25 °C, 24 h after inoculation. No growth occurred at 20 and 45 °C. Colonies were 0.8 mm in diameter, appear as gray color on Columbia agar supplemented with sheep blood. Growth of the strain was tested under anaerobic and microaerophilic conditions using GENbag anaer and GENbag microaer systems, respectively (BioMérieux), and under aerobic conditions, with or without 5 % CO2. Growth was achieved under aerobic (with and without CO2), microaerophilic and anaerobic conditions. Gram staining showed Gram positive short bacilli (Fig. 2, left panel). A motility test with API M medium (BioMérieux) produced a negative result. Cells grown on agar do not sporulate and the rods have a mean length of 1 μm and a mean width of 0.5 μm. Both the length and the diameter were determined by negative staining transmission electron microscopy (Fig. 2, right panel).
Strain G3T exhibited catalase activity but not oxidase activity using ID color catalase and oxidase reagent, respectively (BioMérieux). In assays with API 50CH system (BioMérieux), strain G3T produced acid from esculin, D-cellobiose, D-maltose, D-lactose, D-mannose, D-mannitol, D-saccharose, D-trehalose and gentiobiose. By contrast, acid production was not observed for glycerol, erythritol, D-arabinose, L-arabinose, D-ribose, D-xylose, L-xylose, D-adonitol, methyl-αD-xylopyranoside, D-galactose, D-glucose, L-fructose, L-sorbose, L-rhamnose, dulcitol, inositol, D-sorbitol, methyl-αD-mannopyranoside, Methyl-αD-glucopyranoside, xylitol, D-tagatose, D-turanose, D-lyxose, D-fucose, L-fucose, D-arabitol, L-arabitol, potassium gluconate, potassium 2-cetogluconate, potassium 5-cetogluconate, D-melezitose, D-raffinose, Glycogen, N-acetylglucosamin, amygdalin, arbutin, salicin and hydrolysis of starch. Using APIZYM, positive enzyme activities were observed for esterase (C4), esterase lipase (C8), leucine aramidase, phosphatase acid, naphtol-AS-BI-phosphohydrolase, α-mannosidase, α- glucosidase and N-acetyl-β-glucosaminidase. Negative results for lipase (C14), phosphatase alcalin, valine arylamidase, cystine arylamidase, trypsin, α-chymotrypsin, α-galactosidase, β – galactosidase, β-glucosidase, β-glucuronidase, β-glucosidase, and α-fucosidase.
M. gorillae is susceptible to amoxicillin (25 μg), erythromycin (15UI), doxycyclin (30UI), rifampicin (30 μg), vancomycin (50 μg), amoxicillin-clavulanic acid (20 μg + 10 μg), trimethoprim-sulfamethoxazole (1.25 μg / 23.75 μg) and imipenem (10 μg) but resistant to ciprofloxacin (5 μg) and gentamycin (15 μg).
Extended feature descriptions
Matrix-assisted laser-desorption/ionization time-of-flight (MALDI-TOF) MS protein analysis was carried out as previously described  using a Microflex spectrometer (Bruker Daltonics, Leipzig, Germany). Twelve distinct deposits were done for strain G3T from 12 isolated colonies. Two microliters of matrix solution (saturated solution of alpha-cyano-4-hydroxycinnamic acid) in 50 % acetronitrile and 2.5 % trifluoroacetic-acid were distributed on each smear and submitted at air drying for five minutes. Then, the spectra from the 12 different colonies were imported into the MALDI BioTyper software (version 2.0, Bruker) and analyzed by standard pattern matching (with default parameter settings) against 5,626 bacterial spectra including 43 spectra from 33 Microbacterium species, used as reference data, in the BioTyper database. Briefly, a score ≥ 2 with a species with a validly published name provided allows the identification at the species level, a score ≥ 1.7 but < 2 allows the identification at the genus level; and a score < 1.7 does not allow any identification. For strain G3T, no good score was obtained, suggesting that our isolate was not a member of any known species. We incremented our database with the spectrum from strain G3T (Additional file 2: Figure S1). The gel view highlighted spectrum differences with other Microbacterium species (Additional file 3: Figure S2).
Genome sequencing information
Genome project history
According to phenotypic characteristics of this strain and MALDI-TOF result and because of the low16S rRNA similarity to other members of the genus Microbacterium , it is likely that the strain represents a new species and thus it was chosen for genome sequencing. It was the 20th genome of a Microbacterium species (Genomes Online Database) and the first genome of Microbacterium gorillae sp. nov. A summary of the project information is shown in Table 2. The GenBank accession number is CDAR00000000 and consists of 14 contigs. Table 2 shows the project information and its association with MIGS version 2.0 compliance .
Growth conditions and genomic DNA preparation
Microbacterium gorillae sp.nov strain G3T (= CSUR P207 = DSM 26203) was grown aerobically on 5 % sheep blood-enriched Columbia agar (BioMérieux) at 25 °C. Bacteria grown on four Petri dishes were resuspended in 3x500μl of TE buffer and stored at 80 °C. Then, 500 μl of this suspension were thawed, centrifuged 3 min at 10,000 rpm and resuspended in 3x100μL of G2 buffer (EZ1 DNA Tissue kit, Qiagen). A first mechanical lysis was performed by glass powder on the Fastprep-24 device (Sample Preparation system, MP Biomedicals, USA) using 2x20 s cycles. DNA was then treated with 2.5 μg/μL lysozyme (30 min at 37 °C) and extracted using the BioRobot EZ1 Advanced XL (Qiagen). The DNA was then concentrated and purified using the Qiamp kit (Qiagen). The yield and the concentration was measured by the Quant-it Picogreen kit (Invitrogen) on the Genios Tecan fluorometer at 50 ng/μl.
Genome sequencing and assembly
Genomic DNA of M. gorillae was sequenced on the MiSeq Technology (Illumina Inc, San Diego, CA, USA) with the 2 applications: paired end and mate paired. The gDNA was barcoded in order to be mixed with 11 others projects with the Nextera Mate Pair sample prep kit (Illumina) and with 17 others projects with the Nextera XT DNA sample prep kit (Illumina).
gDNA was quantified by a Qubit assay with the high sensitivity kit (Life technologies, Carlsbad, CA, USA) to 46.7 ng/μlTo prepare the paired end library, dilution was performed to require 1 ng of each genome as input. The « tagmentation » step fragmented and tagged the DNA. Then limited cycle PCR amplification (12 cycles) completed the tag adapters and introduced dual-index barcodes. After purification on AMPure XP beads (Beckman Coulter Inc, Fullerton, CA, USA), the libraries were then normalized on specific beads according to the Nextera XT protocol (Illumina). Normalized libraries were pooled for sequencing on the MiSeq. The pooled single strand library was loaded onto the reagent cartridge and then onto the instrument along with the flow cell. Automated cluster generation and paired end sequencing with dual index reads were performed in a single 39-h run in 2x250-bp.
Total information of 7.6 Gb was obtained from a 931 K/mm2 cluster density with a cluster passing quality control filters of 82.8 % (17,658,000 clusters). Within this run, the index representation for M. gorillae was determined to 5.11 %. The 732,922 paired end reads were trimmed and filtered by Trimmomatic tool using the recommended parameters for Illumina sequence data .
Two mate pair libraries were prepared with 1 and 1.5 μg of genomic DNA using the Nextera mate pair Illumina guide. The genomic DNA sample was simultaneously fragmented and tagged with a mate pair junction adapter. The pattern of the fragmentation was validated on an Agilent 2100 BioAnalyzer (Agilent Technologies Inc, Santa Clara, CA, USA) with a DNA 7500 labchip. The DNA fragments ranged from 1 kb to 11 kb in size with the majority of fragments at 8.8 and 9.4 kb of size. No size selection was performed and 45 ng for the 1st library and 600 ng for the second library of tagmented fragments were circularized. The circularized DNA was mechanically sheared to small fragments with the majority at 400 and 380 bp on the Covaris device S2 in microtubes (Covaris, Woburn, MA, USA). The library profile was visualized on a High Sensitivity Bioanalyzer LabChip (Agilent Technologies Inc, Santa Clara, CA, USA) and the final concentration library was measured at 0.65 and 0.59 nmol/l respectively. The libraries were normalized at 2nM and pooled. After a denaturation step and dilution at 15 pM, the pool of libraries was loaded onto the reagent cartridge and then onto the instrument along with the flow cell. Automated cluster generation and sequencing run were performed in a single 39-h run in a 2x251-bp. The first libray was loaded three times on a flowcell and the second once. Within these runs, the index representation for M. gorillae was determined as an average at 3.51 %. The 1,881,286 paired reads were filtered according to the read qualities. The global paired end and mate pair libraries lead to 2,614,208 paired reads which were trimmed by Trimmomatic  then assembled by Spades software using the recommended options “--careful” and “-k 127” to fix the kmer size to 127 . The final assembly identified 14 scaffolds generating a genome size of 3.69 Mb which corresponds to genome coverage of 213X.
Open Reading Frames (ORFs) were predicted using Prodigal  with default parameters but the predicted ORFs were excluded if they spanned a sequencing gap region. The predicted bacterial protein sequences were searched against the GenBank database  and the Clusters of Orthologous Groups (COG) databases using BLASTP. The tRNAScanSE tool  was used to find tRNA genes, whereas ribosomal RNAs were found using RNAmmer  and BLASTn against the GenBank database. Lipoprotein signal peptides and the number of transmembrane helices were predicted using SignalP  and TMHMM  respectively. ORFans were identified if their BLASTP E-value was lower than 1e-03 for alignment length greater than 80 amino acids. If alignment lengths were smaller than 80 amino acids, we used an E-value of 1e-05. Such parameter thresholds have already been used in previous works to define ORFans. Artemis  was used for data management and DNA Plotter  for visualization of genomic features. The Mauve alignment tool (version 2.3.1) was used for multiple genomic sequence alignment . To estimate the mean level of nucleotide sequence similarity at the genome level between M. gorillae sp. nov. strain G3T and other members of the genus Microbacterium , we used the MAGI home-made software to calculate the average genomic identity of gene sequences (AGIOS) among compared genomes . Briefly, this software combines the Proteinortho software  for detecting orthologous proteins in pairwise genomic comparisons, then retrieves the corresponding genes and determines the mean percentage of nucleotide sequence identity among orthologous ORFs using the Needleman-Wunsch global alignment algorithm. Finally, we used Genome-to-Genome Distance Calculator (GGDC) web server available at (http://ggdc.dsmz.de) to estimate of the overall similarity among the compared genomes and to replace the wet-lab DNA-DNA hybridization (DDH) by a digital DDH (dDDH) [32, 33]. GGDC 2.0 BLAST+ was chosen as alignment method and the recommended formula 2 was taken into account to interpret the results.
The genome of M. gorillae strain G3T is 3,692,770 bp-long with a 69.3 % G+C content (Table 3, Fig. 3). Of the 3,566 predicted genes, 3,505 were protein-coding genes and 61 were RNA genes, including 4 complete rRNA operons (Additional file 4). A total of 2,412 genes (68.82 %) were assigned a putative function. A total of 6.33 % were identified as Pseudo-genes. The remaining genes were annotated as hypothetical proteins. The properties and the statistics of the genome are summarized in Table 3. The distribution of genes into COGs functional categories is presented in Table 4 and Additional file 4.
Insights from the genome sequence
Here, we compared the genome sequences of M. gorillae strain G3T (CDAR00000000) with those of Microbacterium barkeri strain 2011-R4 (AKVP00000000), Microbacterium maritypicum strain MF109 (ATAO00000000), Microbacterium indicum strain DSM 19969 (AULR00000000), Microbacterium laevaniformans strain OR221 (AJGR00000000), Microbacterium luticocti strain DSM 19459 (AULS00000000), Microbacterium paraoxydans strain 77MFTsu3.2 (AQYI00000000), Microbacterium testaceum strain StLB037 (AP012052) and Microbacterium yannicii strain PS01 (CAJF00000000). The draft genome of M. gorillae has a larger size than those of M. indicum , M. luticocti , M. laevaniformans , M. paraoxydans and M. barkeri , (3.69 vs 2.81, 3.11, 3.43, 3.48 and 3.64 Mb respectively) but is smaller than those of M. maritypicum , M. testaceum and M. yannicii (3.69 vs 4.0, 3.98 and 3.95 Mb respectively). The G+C content of M. gorillae is higher than those of M. laevaniformans and M. maritypicum (69.3 vs 68.0 and 68.2 % respectively) but lower than those of M. indicum , M. luticocti , M. testaceum , M. yannicii , M. paraoxydans and M. barkeri (69.3 vs 71.4, 70.7, 70.3, 69.5, 69.5, 69.2 %, respectively). The gene content of M. gorillae is lower than those of M. maritypicum and M. testaceum , (3,505 vs 3,856 and 3,676 genes respectively) but higher than those of, M. paraoxydens, M. yannicii , M. laevaniformans , M. barkeri , M. luticocti and M. indicum (3,312, 3,279. 3,249, 3,099, 2,355, 2,183 genes respectively) (Table 5). However the distribution of genes into COG categories was similar in all compared genomes (Additional file 5: Figure S3). In addition, M. gorillae shares 1,593, 1,658, 1,269, 1,396, 1,390, 1,416, 1,498 and 1,497 orthologous genes with M. barkeri , M. maritypicum , M. indicum , M. laevaniformans , M. luticocti , M. paraoxydans , M. testaceum and M. yannici i respectively (Table 5). Among compared genomes except M. gorillae, AGIOS values range from 75.51 % between M. indicum and M. maritypicum to 85.33 % between M. maritypicum and M. barkeri . When M. gorillae was compared to other species, AGIOS values range from 75.22 % with M. maritypicum to 76.41 % with M. luticocti (Table 5). dDDH estimation of the strain G3T against the compared genomes ranged between 19.70 to 20.50. These values are very low and below the cutoff of 70 %, thus confirming again the new species status of the strain G3T.
On the basis of phenotypic characteristics, phylogenetic position, genomic analyses (taxonogenomics) and GGDC results, we formally propose the creation of Microbacterium gorillae sp. nov. that contains the strain G3T. This strain has been isolated from a gorilla stool sample collected from Cameroon.
Taxonomic and nomenclatural proposals
Description of Microbacterium gorillae sp. nov.
Microbacterium gorillae (go.ril’lae. NL neut. gen gorilla, pertaining to a gorilla from which the stool sample was obtained).
Cells stain Gram-positive, are small rod, non-endospore-forming, non-motile and have a diameter of 0.5 μm and a length of 1 μm. Colonies are gray and 2 mm in diameter on blood-enriched Columbia agar. Growth occurs between 25 and 37 °C, with optimal growth observed at 25 °C.
Strain G3T exhibited catalase activity but not oxidase activity. Strain produces acid from esculin, D-cellobiose, D-maltose, D-lactose, D-mannose, D-mannitol, D-saccharose, D-trehalose and gentiobiose but not from glycerol, erythritol, D-arabinose, L-arabinose, D-ribose, D-xylose, L-xylose, D-adonitol, methyl-αD-xylopyranoside, D-galactose, D-glucose, L-fructose, L-sorbose, L-rhamnose, dulcitol, inositol, D-sorbitol, methyl-αD-mannopyranoside, Methyl-αD-glucopyranoside, xylitol, D-tagatose, D-turanose, D-lyxose, D-fucose, L-fucose, D-arabitol, L-arabitol, potassium gluconate, potassium 2-cetogluconate, potassium 5-cetogluconate, D-melezitose, D-raffinose, Glycogen, N-acetylglucosamin, amygdalin, arbutin, salicin and hydrolysis of starch.
Positive enzyme activities were observed for esterase (C4), esterase lipase (C8), leucine aramidase, phosphatase acid, naphtol-AS-BI-phosphohydrolase, α-mannosidase, α- glucosidase and N-acetyl-β-glucosaminidase. Negative results for lipase (C14), phosphatase alcalin, valine arylamidase, cystine arylamidase, trypsin, α-chymotrypsin, α-galactosidase, β – galactosidase, β-glucosidase, β-glucuronidase, β-glucosidase, and α-fucosidase.
M. gorillae is susceptible to amoxicillin, erythromycin, doxycyclin, rifampicin, vancomycin, amoxicillin-clavulanic acid, trimethoprim-sulfamethoxazole and imipenem but resistant to ciprofloxacin and gentamycin.
The G+C content of the genome is 69.3 %. The 16S rRNA and genome sequences are deposited in GenBank under accession numbers JX650056 and CDAR00000000, respectively. The type strain G3T (= CSUR P207 = DSM 26203) was isolated from the fecal sample of a western lowland gorilla from Cameroon.
Collection de souches de l’Unité des Rickettsies
Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes
Deutsche Sammlung von Mikroorganismen
- MALDI-TOF MS:
Matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry
- TE buffer:
Genome-to-Genome Distance Calculator
digital DNA-DNA hybridization
Bittar F, Keita MB, Lagier JC, Peeters M, Delaporte E, Raoult D. Gorilla gorilla gorilla gut: a potential reservoir of pathogenic bacteria as revealed using culturomics and molecular tools. Sci Rep. 2014;4:7174.
Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol. 1980;30:225–420.
Abstract for the genus Microbacterium. NamesforLife, LLC. Retrieved June 26, 2015. (doi:10.1601/tx.6034).
Gneiding K, Frodl R, Funke G. Identities of Microbacterium spp. Encountered in Human Clinical Specimens. J Clin Microbiol. 2008;46:3646.
Anand S, Bala K, Saxena A, Schumann P, Lal R. Microbacterium amylolyticum sp. nov., isolated from soil from an industrial waste site. Stand Int J Syst Evol Microbiol. 2012;62:2114–20.
Bakir MA, Kudo T, Benno Y. Microbacterium hatanonis sp. nov., isolated as a contaminant of hairspray. Int J Syst Evol Microbiol. 2008;58(Pt 3):654–8.
Alves A, Riesco R, Correia A, Trujillo ME. Microbacterium proteolyticum sp. nov. isolated from roots of Halimione portulacoides. Int J Syst Evol Microbiol. 2015;65(Pt 6):1794–8.
Sentausa E, Fournier PE. Advantages and limitations of genomics in prokaryotic taxonomy. Clin Microbiol Infect. 2013;19(9):790–5.
Meier-Kolthoff JP, Göker M, Spröer C, Klenk HP. When should a DDH experiment be mandatory in microbial taxonomy? Arch Microbiol. 2013;6:413–8.
Zhang Y, Ren H, Zhang G. Microbacterium hydrothermale sp. nov., an actinobacterium isolated from hydrothermal sediment. Int J Syst Evol Microbiol. 2014;64:3508–12.
Shivaji S, Bhadra B, Rao RS, Chaturvedi P, Pindi PK, Raghukumar C. Microbacterium indicum sp. nov., isolated from a deep-sea sediment sample from the Chagos Trench, Indian Ocean. Int J Syst Evol Microbiol. 2007;57:1819–22.
Vaz-Moreira I, Lopes AR, Falsen E, Schumann P, Nunes OC, Manaia CM. Microbacterium luticocti sp. nov., isolated from sewage sludge compost. Int J Syst Evol Microbiol. 2008;58:1700–4.
Laffineur K, Avesani V, Cornu G, Charlier J, Janssens M, Wauters G, et al. Bacteremia due to a novel Microbacterium species in a patient with leukemia and description of Microbacterium paraoxydans sp. nov. J Clin Microbiol. 2003;41(5):2242–6.
Schippers A, Bosecker K, Sproer C, Schumann P. Microbacterium oleivorans sp. nov. and Microbacterium hydrocarbonoxydans sp. nov., novel crude-oil-degrading Gram-positive bacteria. Int J Syst Evol Microbiol. 2005;55(Pt 2):655–60.
Sharma P, Diene SM, Thibeaut S, Bittar F, Roux V, Gomez C, et al. Phenotypic and genotypic properties of Microbacterium yannicii, a recently described multidrug resistant bacterium isolated from a lung transplanted patient with cystic fibrosis in France. BMC Microbiol. 2013;13:97.
Takeuchi M, Hatano K. Proposal of six new species in the genus Microbacterium and transfer of “Flavobacterium marinotypicum” ZoBell and Upham to the genus Microbacterium as Microbacterium maritypicum comb. nov. Int J Syst Bacteriol. 1998;48 Pt 3:973–82.
Seng P, Drancourt M, Gouriet F, et al. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis. 2009;49:543–51.
Field D, Garrity G, Gray T, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.
GenBank database. http://www.ncbi.nlm.nih.gov/genbank.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–8.
Bendtsen JD, Nielsen H, von Heijne G, Brunak S. Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004;340:783–95.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305:567–80.
Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B. Artemis: sequence visualization and annotation. Bioinformatics. 2000;16:944–5.
Carver T, Thomson N, Bleasby A, Berriman M, Parkhill J. DNAPlotter: circular and linear interactive genome visualization. Bioinformatics. 2009;25:119–20.
Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14:1394–403.
Ramasamy D, Mishra AK, Lagier JC, et al. A polyphasic strategy incorporating genomic data for the taxonomic description of novel bacterial species. Int J Syst Evol Microbiol. 2014;64:384–91.
Lechner M, Findeiss S, Steiner L, Marz M, Stadler PF, Prohaska SJ. Proteinortho: detection of (co-)orthologs in large-scale analysis. BMC Bioinformatics. 2011;12:124.
Auch AF, von Jan M, Klenk HP, Göker M. Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison. Stand Genomic Sci. 2010;2:117–34.
Meier-Kolthoff JP, Auch AF, Klenk HP, Göker M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 2013;14:60.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archae, Bacteria, and Eukarya. Proc Natl Acad Sci U S A. 1990;87:4576–9.
Stackebrandt E, Rainey FA, Ward-Rainey NL. Proposal for a new hierarchic classification system, Actinobacteria classis nov. Int J Syst Bacteriol. 1997;47:479–91.
Validation List no. 53 in IJSEM. Validation of the Publication of New Names and New Combinations Previously Effectively Published Outside the IJSB. Int J Syst Evol Microbiol. 1995;25:418–9.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium Nat Genet. 2000;25:25–9.
Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17:540–52.
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90.
Fadi Bittar was supported by a Chair of Excellence IRD provided by the Institut de Recherche pour le Développement / Méditerranée-Infection foundation. Mamadou Bhoye Keita was funded by the Méditerranée-Infection foundation. The authors thank Xegen company for automating the genome annotation process.
The authors declare that they have no competing interests.
LH wrote the manuscript and analyzed the data. MBK performed laboratory experiments and helped to draft the manuscript. CM performed the sequencing and helped to draft the manuscript. JR and AL performed bioinformatics analysis and helped to draft the manuscript. DR, PF, JMR and FB conceived the study, participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
Differential phenotypic characteristics between Microbacterium gorillae sp. nov. strain G3T and others Microbacterium strains. (DOCX 14 kb)
Reference mass spectrum from M. gorillae strain G3T. Spectra from 12 individual colonies were compared and a reference spectrum was generated. (PPTX 44 kb)
Gel view comparing Microbacterium gorillae strain G3T spectra with other members of the genus Microbacterium. The gel view displays the raw spectra of all loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a gray-scale scheme code. The color bar and the right y-axis indicate the relation between the color a peak is displayed with and the peak intensity in arbitrary units. Displayed species are indicated on the right. (PPTX 76 kb)
Annotation results. (RAR 1566 kb)
Distribution of functional classes of predicted genes of M. gorillae strain G3T with 8 members of Microbacterium genus. (PPTX 63 kb)