Skip to main content


Draft genome sequence of Bacillus amyloliquefaciens HB-26

Article metrics

  • 787 Accesses

  • 2 Citations


Bacillus amyloliquefaciens HB-26, a Gram-positive bacterium was isolated from soil in China. SDS-PAGE analysis showed this strain secreted six major protein bands of 65, 60, 55, 34, 25 and 20 kDa. A bioassay of this strain reveals that it shows specific activity against P. brassicae and nematode. Here we describe the features of this organism, together with the draft genome sequence and annotation. The 3,989,358 bp long genome (39 contigs) contains 4,001 protein-coding genes and 80 RNA genes.


Bacillus amyloliquefaciens (B. amyloliquefaciens) is a species of bacterium in the genus Bacillus with high affinity of Bacillus subtilis. In the growth process, B. amyloliquefaciens can produce numerous antimicrobial or, more generally, bioactive metabolites with well-established activity in vitro such as surfactin, iturin and fengycin [1,2]. The production of all of these antibiotic compounds highlights B. amyloliquefaciens as a good candidate for the development of biocontrol agents [3,4].

Strain HB-26 belongs to the species B. amyloliquefaciens. The type strain of the species produces much bioactive metabolites showing specific activity against Plasmodiophora brassicae which could cause Clubroot, one of the most serious diseases of brassica crops worldwide [57]. Heavy infection by this pathogen of Chinese cabbage, cabbage, broccoli, turnip, oilseed rape, and other crucifers can lead to severe economic losses [811]. The root systems of infected plants show gall formation, which inhibits nutrient and water transport, stunts plant growth, and increases susceptibility to wilting [12,13]. Otherwise, bioassay results showed strain HB-26 also had some root-knot nematicidal activity.

Here, we present a summary classification and a set of features for B. amyloliquefaciens HB-26, together with the description of the genomic sequencing and annotation in order to improve the understanding of the molecular basis for its ability to inhibit Plasmodiophora brassicae and nematode.

Classification and features

Strain HB-26 colonies were milky white and matte with a wrinkled surface. Microscopy observations indicated that it was a Bacillus species (Figure 1A, Figure 1B and Table 1). SDS-PAGE analysis showed this strain secreted six major protein bands of 65, 60, 55, 34, 25 and 20 kDa (Figure 1C).

Figure 1.

General characteristics of B. amyloliquefaciens HB-26. (A) The colonial morphology pictures of strain HB-26. (B) Phase contrast micrograph of HB-26. (C) SDS-PAGE analysis of proteins of HB-26. Lane M, protein molecular weight marker; Lane 1, proteins of strain HB-26.

Table 1. Classification and general features of B. amyloliquefaciens HB-26

A representative genomic 16S rDNA sequence of strain HB-26 was searched against GenBank database using BLAST [29]. Sequences showing more than 99% sequence identity to 16S rDNA of HB-26 were selected for phylogentic analysis, and 15 sequences were aligned with ClustalW algorithm. The tree was reconstructed by neighbor-Joining by using Kimura 2-parameter for distance calculation. The phylogenetic tree was assessed by bootstrapped for 1,000 times, and the consensus tree was shown in Figure 2.

Figure 2.

Neighbor-Joining Phylogenetic tree was generated using MEGA 4 based on 16S rRNA sequences. The strains and their corresponding GenBank accession numbers for 16S rDNA sequences are: A: B. amyloliquefaciens ML581 (KC692179.1); B: B. amyloliquefaciens JM-21 (KC752450.1); C: Bacillus strain HB-26 (HM138476); D: B. vallismortis WA3-7 (JF496475.1); E: B. sp.BYK1448 (HF549161.1); F: B. subtilis 2B (KF112078.1); G: B. methylotrophicus GZGL8 (JN999861.1); H: B. vallismortis D20 (KC441761.1); I: B. tequilensis L10 (JN700126.1); J: B. sp. C4(2013) (KC310834.1); K: B. subtilis WBZ (KC460988.1); L: B. Amyloliquefaciens CA81 (KF040978.1); M: B. sp. SWB30 (JX861886.1); N: B. methylotrophicus Ns7-22 (HQ831412.1); O: B. subtilis 26A (KC295415.1). The phylogenetic tree was constructed by using the neighbor-joining method within the MEGA software [30].

Genome sequencing information

Genome project history

This Bacillus strain was selected for sequencing due to its specific activity against Plasmodiophora brassicae and nematode. The complete high quality draft genome sequence is deposited in GenBank. The Beijing Genomics Institute (BGI) performed the sequencing and the NCBI staffs used the Prokaryotic Genome Annotation Pipeline (PGAAP) to complete the annotation. A summary of the project is given in Table 2.

Table 2. Genome sequencing project information

Growth conditions and DNA isolation

B. amyloliquefaciens HB-26 was grown in 50 mL Luria-Broth for 6 h at 28°C. DNA was isolated by incubating the cells with lysozyme (10 mg/mL) in 2 mL TE (50 mM Tris base, 10 mM EDTA, 20% sucrose, pH8.0) at 4°C for 6 h. 4 mL of 2% SDS were added and the mixture was incubated at 55°C for 30 min; 2 mL 5M NaCl were added, and the mixture was incubated at 4°C for 10 min. DNA was purified by organic extraction and ethanol precipitation.

Genome sequencing and assembly

The genome of B. amyloliquefaciens HB-26 was sequenced using Illumina Hiseq 2000 platform (with a combination of a 251-bp paired-end reads sequencing from a 700-bp genomic library). Reads with average quality scores below Q30 or more than 3 unidentified nucleotides were eliminated. 2,605,589 paired-end reads (achieving 192 fold coverage [0.94 Gb]) was de novo assembled using SOAPdenovo 1.05 version [9]. The assembly consists of 39 contigs arranged in 39 scaffolds with a total size of 3,989,358 bp (including chromosome and plasmids).

Genome annotation

Genome annotation was completed using the Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP). Briefly, Protein-coding genes were predicted using a combination of GeneMark and Glimmer [3133]. Ribosomal RNAs were predicted by sequence similarity searching using BLAST against an RNA sequence database and/or using Infernal and Rfam models [34,35]. Transfer RNAs were predicted using tRNAscan-SE [36]. In order to detect missing genes, a complete six-frame translation of the nucleotide sequence was done and predicted proteins (generated above) were masked. All predictions were then searched using BLAST against all proteins from complete microbial genomes. Annotation was based on comparison to protein clusters and on the BLAST results. Conserved domain Database and Cluster of Orthologous Group information is then added to the annotation.

Genome properties

The draft assembly of the genome consists of 39 contigs in 39 scaffolds, with an overall 47.37% G+C content. Of the 4,114 genes predicted, 4,001 were protein-coding genes, and 80 RNAs were also identified. The majority of the protein-coding genes (54.06%) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 3, Table 4 and Figure 3.

Figure 3.

Graphical circular map of the Bacillus amyloliquefaciens HB-26 genome. From the outside to the center: genes on forward strand (color by COG categories), genes on reverse strand (color by COG categories), GC content, GC skew. The map was generated with the CGviewer server (Stothard Rearch Group:

Table 3. Genome Statistics
Table 4. Number of genes associated with the general COG functional categories


  1. 1.

    Vilas-Bôas GT, Peruca AP, Arantes OM. Biology and taxonomy of Bacillus cereus, Bacillus anthracis, and Bacillus thuringiensis. Can J Microbiol 2007; 53:673–687. PubMed

  2. 2.

    Chowdhury SP, Dietel K, Rändler M, Schmid M, Junge H, Borriss R, Hartmann A, Grosch R. Effects of Bacillus amyloliquefaciens FZB42 on Lettuce Growth and Health under Pathogen Pressure and Its Impact on the Rhizosphere Bacterial Community. PLoS ONE 2013; 8:e68818. PubMed

  3. 3.

    Helgason E, Caugant DA, Lecadet MM, Chen Y, Mahillon J, Lovgren A, Hegna I, Kvaloy K, Kolsto AB. Genetic diversity of Bacillus cereus/B. thuringiensis isolates from natural sources. Curr Microbiol 1998; 37:80–87. PubMed

  4. 4.

    Arguelles-Arias A, Ongena M, Halimi B, Lara Y, Brans A, Joris B, Fickers P. Bacillus amyloliquefaciens GA1 as a source of potent antibiotics and other secondary metabolites for biocontrol of plant pathogens. Microb Cell Fact 2009; 8:63. PubMed

  5. 5.

    Helgason E, Okstad OA, Caugant DA, Johansen HA, Fouet A, Mock M, Hegna I, Kolsto AB. Bacillus anthracis, Bacillus cereus, and Bacillus thuringiensisone species on the basis of genetic evidence. Appl Environ Microbiol 2000; 66:2627–2630. PubMed

  6. 6.

    Ticknor LO, Kolsto AB, Hill KK, Keim P, Laker MT, Tonks M, Jackson PJ. Fluorescent Amplified Fragment Length Polymorphism Analysis of Norwegian Bacillus cereus and Bacillus thuringiensis Soil Isolates. Appl Environ Microbiol 2001; 67:4863–4873. PubMed

  7. 7.

    Nagaoka T, Doullah MA, Matsumoto S, Kawasaki S, Ishikawa T, Hori H, Okazaki K. Identification of QTLs that control clubroot resistance in Brassica oleracea and comparative analysis of clubroot resistance genes between B. rapa and B. oleracea. Theor Appl Genet 2010; 120:1335–1346. PubMed

  8. 8.

    Rocherieux J, Glory P, Giboulot A, Boury S, Barbeyron G, Thomas G, Manzanares-Dauleux MJ. Isolate-specific and broad-spectrum QTLs are involved in the control of clubroot in Brassica oleracea. Theor Appl Genet 2004; 108:1555–1563. PubMed

  9. 9.

    Chen XH, Koumoutsi A, Scholz R, Eisenreich A, Schneider K, Heinemeyer I, Morgenstern B, Voss B, Hess WR, Reva O, et al. Comparative analysis of the complete genome sequence of the plant growth-promoting bacterium Bacillus amyloliquefaciens FZB42. Nat Biotechnol 2007; 25:1007–1014. PubMed

  10. 10.

    Choi K, Yi Y, Lee S, Kang K, Lee E, Hong S, Young J, Park Y, Choi GJ, Kim BJ, Lim Y. Microorganisms against Plasmodiophora brassicae. J Microbiol Biotechnol 2007; 17:873–877. PubMed

  11. 11.

    Helgason E, Tourasse NJ, Meisal R, Caugant DA, Kolsto AB. Multilocus sequence typing scheme for bacteria of the Bacillus cereus group. Appl Environ Microbiol 2004; 70:191–201. PubMed

  12. 12.

    Schnepf E, Crickmore N, Van Rie J, Lereclus D, Baum J, Feitelson J, Zeigler DR, Dean DH. Bacillus thuringiensis and its pesticidal crystal proteins. Microbiol Mol Biol Rev 1998; 62:775–806. PubMed

  13. 13.

    Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. PubMed

  14. 14.

    Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. PubMed

  15. 15.

    Gibbons NE, Murray RGE. Proposals Concerning the Higher Taxa of Bacteria. Int J Syst Bacteriol 1978; 28:1–6.

  16. 16.

    Garrity GM, Holt JG. The Road Map to the Manual. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 1, Springer, New York, 2001, p. 119–169.

  17. 17.

    Murray RGE. The Higher Taxa, or, a Place for Everything…? In: Holt JG (ed), Bergey’s Manual of Systematic Bacteriology, First Edition, Volume 1, The Williams and Wilkins Co., Baltimore, 1984, p. 31–34.

  18. 18.

    List of new names and new combinations previously effectively, but not validly, published. List no. 132. Int J Syst Evol Microbiol 2010; 60:469–472.

  19. 19.

    Ludwig W, Schleifer KH, Whitman WB. Class I. Bacilli class nov. In: De Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, Rainey FA, Schleifer KH, Whitman WB (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer-Verlag, New York, 2009, p. 19–20.

  20. 20.

    Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol 1980; 30:225–420.

  21. 21.

    Prévot AR. In: Hauderoy P, Ehringer G, Guillot G, Magrou. J., Prévot AR, Rosset D, Urbain A (eds), Dictionnaire des Bactéries Pathogènes, Second Edition, Masson et Cie, Paris, 1953, p. 1–692.

  22. 22.

    Fischer A. Untersuchungen über bakterien. Jahrbücher für Wissenschaftliche Botanik 1895; 27:1–163.

  23. 23.

    Cohn F. Untersuchungen über Bakterien. Beitr Biol Pflanz 1872; 1:127–224.

  24. 24.

    Gibson T, Gordon RE. Genus I. Bacillus Cohn 1872, 174; Nom. gen. cons. Nomencl. Comm. Intern. Soc. Microbiol. 1937, 28; Opin. A. Jud. Comm. 1955, 39. In: Buchanan RE, Gibbons NE (eds), Bergey’s Manual of Determinative Bacteriology, Eighth Edition, The Williams and Wilkins Co., Baltimore, 1974, p. 529–550.

  25. 25.

    Priest FG, Goodfellow M, Shute LA, Berkeley RCW. Bacillus amyloliquefaciens sp. nov., nom. rev. Int J Syst Bacteriol 1987; 37:69–71.

  26. 26.

    Wang LT, Lee FL, Tai CJ, Kuo HP. Bacillus velezensis is a later heterotypic synonym of Bacillus amyloliquefaciens. Int J Syst Evol Microbiol 2008; 58:671–675. PubMed

  27. 27.

    Fukomoto J. Studies on the production of bacterial amylase. I. Isolation of bacteria secreting potent amylase and their distribution. Nippon Nogeikagaku Kaishi 1943; 19:487–503.

  28. 28.

    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol 1990; 215:403–410. PubMed

  29. 29.

    Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K, et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 2010; 20:265–272. PubMed

  30. 30.

    Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 2011; 28:2731–2739. PubMed

  31. 31.

    Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res 2001; 29:2607–2618. PubMed

  32. 32.

    Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 1999; 27:4636–4641. PubMed

  33. 33.

    Lukashin AV, Borodovsky M. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 1998; 26:1107–1115. PubMed

  34. 34.

    Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. Rfam: an RNA family database. Nucleic Acids Res 2003; 31:439–441. PubMed

  35. 35.

    Eddy SR. A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure. BMC Bioinformatics 2002; 3:18. PubMed

  36. 36.

    Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 1997; 25:955–964. PubMed

Download references


This work was financially supported by the National Science and Technology Support Program (2008BADA5B03), the National 863 High Technology Research Program of China (2011AA10A201, 2011AA10A203), China 948 Program of Ministry of Agriculture (2011-G25), the National Science and Technology Support Program (2011BAB06B004-02), Hubei Province Development Plan (YJN0077) and the Science and Technology Support Program of Academy of Agricultural Sciences of Hubei Province (2012NKYJJ21).

Author information

Correspondence to Zi-Wen Yang.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark


  • Bacillus amyloliquefaciens HB-26
  • next-generation sequencing
  • Plasmodiophora brassicae