- Short genome report
- Open Access
Complete genome sequence of the sand-sediment actinobacterium Nocardioides dokdonensis FR1436T
Standards in Genomic Sciencesvolume 12, Article number: 44 (2017)
Nocardioides dokdonensis, belonging to the class Actinobacteria, was first isolated from sand sediment of a beach in Dokdo, Korea, in 2005. In this study, we determined the genome sequence of FR1436, the type strain of N. dokdonensis, and analyzed its gene contents. The genome sequence is the second complete one in the genus Nocardioides after that of Nocardioides sp. JS614. It is composed of a 4,376,707-bp chromosome with a G + C content of 72.26%. From the genome sequence, 4,104 CDSs, three rRNA operons, 51 tRNAs, and one tmRNA were predicted, and 71.38% of the genes were assigned putative functions. Through the sequence analysis, dozens of genes involved in steroid metabolism, especially its degradation, were detected. Most of the identified genes were located in large gene clusters, which showed high similarities with the gene clusters in Pimelobacter simplex VKM Ac-2033D. Genomic features of N. dokdonensis associated with steroid catabolism indicate that it could be used for research and application of steroids in science and industry.
Bacteria in the genus Nocardioides were first isolated from soil in 1976  and currently more than 90 validly published Nocardioides species are available from diverse terrestrial and aquatic environments such as soil, wastewater, plant roots, groundwater, beach sand, and marine sediment [2,3,4,5,6,7,8,9,10]. Originally, the genus was classified as a member of the order Actinomycetales in the phylum Actinobacteria , but recently was reclassified to the order Propionibacteriales . Actinobacteria , also called Gram-positive high G + C bacteria, contain diverse bacterial groups that are capable of a variety of secondary metabolism including biosynthesis of antibiotics and degradation of harmful compounds [12, 13]. The genus Nocardioides is also known to utilize several kinds of non-degradable materials such as alkane compounds , atrazine , phenanthrene , trinitrophenol , and vinyl chloride . Despite almost 100 species with validly published names and their useful features associated with secondary metabolism, only draft genome sequences are publically available for the genus besides that of Nocardioides sp. JS614.
N. dokdonensis was isolated from beach sand in Dokdo, a volcanic island located in the East Sea of Korea, in 2005 . The East Sea is called a “mini-ocean” due to its oceanological properties  and is known to have a high microbial diversity . To reveal distinguishing genomic features of Nocardioides species, we determined and analyzed the genome sequence of N. dokdonensis FR1436T.
Classification and features
Nocardioides dokdonensis FR1436T, a Gram-positive, non-motile, and strictly aerobic bacterium, was isolated from sand sediment of the Dokdo island in Korea . The strain grows at the temperature range of 4 to 30 °C (optimum, 25 °C), pH range of 5.0 to 10.0 (optimum, 7.0), and NaCl concentration of 0 to 7% (w/v) (optimum, 0 to 3) . Its colony size is about 1.0–2.0 mm on TSA medium after incubation for 3 days at 25 °C. Cells are 1.2–1.8 μm long and 0.6–0.9 μm wide in size  (Fig. 1). FR1436 can utilize adonitol, glycerol, melezitose, melibiose, ribose, sodium acetate, sodium citrate, sodium propionate, and sodium pyruvate as a sole carbon source . Minimum information about the genome sequence (MIGS) for FR1436 is described in Table 1.
Phylogenetically, N. dokdonensis belongs to the family Nocardioidaceae of the order Propionibacteriales , and a phylogenetic tree based on the 16S rRNA genes of the type strains in the genus Nocardioides shows that N. dokdonensis FR1436 forms a sister clade with N. lianchengensis (Fig. 2), which was isolated from soil, and shares common ancestor with N. marinisabuli , N. basaltis , and N. salaries.
Genome sequencing information
Genome project history
As part of the project that investigates the genomic and metabolic features of bacterial isolates in and around Dokdo, the genome sequencing and analysis of N. dokdonensis FR1436 were performed at the Laboratory of Microbial Genomics and Systems/Synthetic Biology at Yonsei University. The complete genome sequence of N. dokdonensis FR1436T (= KCTC 19309 T = JCM 14815 T) has been deposited in GenBank under the accession number CP015079. The Bioproject accession number is PRJNA191956. A summary of the genome project is provided in Table 2.
Growth conditions and genomic DNA preparation
N. dokdonensis FR1436 was streaked on trypticase soy agar medium (Difco, 236,950) and incubated at 25 °C for 3 days. A single colony was inoculated in trypticase soy broth and incubated at 25 °C for 2 days. Cells in the exponential phase were harvested and genomic DNA was extracted using Wizard Genomic DNA Purification Kit (Promega, USA) according to the manufacturer’s protocol.
Genome sequencing and assembly
Genome sequencing of N. dokdonensis FR1436 was performed using the PacBio RS II System (Macrogen, Inc., Republic of Korea). A 20-kb library and C4-P6 chemistry were used for the genome sequencing. A total of 200,435 continuous long reads and 1,551,246,448 base pairs were generated after genome sequencing and quality trimming of the sequencing reads. De novo assembly was conducted with SMRTpipe HGAP and scaffolding and gap filling were performed with SMRTpipe AHA. Finally, consensus sequences were generated with SMRTpipe Quiver.
Structural gene prediction and functional annotation were conducted using the Prokka program . Additionally, we performed a functional assignment of the predicted protein-coding sequences using blastp against Pfam, Uniref90, KEGG, COG, and GenBank NR databases for more accurate annotation. tRNAscan-SE  and RNAmmer  were used for prediction of transfer RNAs and ribosomal RNAs, respectively. Assignment of the Clusters of Orthologous Groups was conducted with RPS-BLAST against COG database with an e-value cutoff of less than 1e-02. Clustered regularly interspaced short palindromic repeats were predicted with CRISPR Finder . Proteins containing signal peptide and transmembrane helices were predicted using SignalP  and TMHMM , respectively. Secondary metabolite biosynthetic genes were predicted using AntiSMASH program .
N. dokdonensis FR1436 has a single chromosome of 4,376,707 bp in length, and consists of 72.26% of G + C content (Fig. 3 and Table 3). The genome has 4165 genes that are comprised of 4104 CDSs, three rRNA operons, 51 tRNAs, and one tmRNA. Results from the analysis of KEGG pathways indicated that, in the genome of FR1436, all of the genes involved in glycolysis, gluconeogenesis, and citrate cycle are present and well conserved. Among the predicted genes, 71.38% of the genes were assigned putative functions and 2832 CDSs was functionally assigned to the COG categories (Table 4). Also in the genome, ten putative CRISPR repeats were predicted using the CRISPRFinder program, but there were no CRISPR-associated proteins next to the predicted repeat sequences. Two gene clusters, possibly associated with secondary metabolism, were predicted using the AntiSMASH program. One cluster (accession numbers ANH38050 to ANH38087) has genes associated with the phenylacetate catabolic pathway  and another cluster (accession numbers ANH40163 to ANH40204) has genes of type 3 polyketide synthases.
Insights from the genome sequence
In the genome of N. dokdonensis FR1436, dozens of steroid-degrading genes were detected (Additional file 1). Major functions of steroids, essential biomolecules in living organisms, include maintaining membrane fluidity as a component of the cell membrane and controlling cell metabolism as signaling molecules . Moreover, steroid medicines are used for treatment of a number of diseases from inflammation to cancer . The molecular backbone of steroids is composed of three cyclohexanes and one cyclopentane. To the backbone, diverse side chains are attached to endow them with diverse functions . Catabolic pathways of steroid degradation or modification have been analyzed in depth for some genera in the order Corynebacteriales [33,34,35]. In Nocardioidaceae , several large gene clusters, which have potential binding sites of the transcriptional regulator associated with steroid catabolism in their promoters, were predicted in the genome of Pimelobacter simplex VKM Ac-2033D . In the genome of FR1436, gene cluster A, which is known to be involved in degrading steroid rings A/B, and gene cluster B, which is involved in degrading side chains, were detected (Fig. 4). However, in FR1436, cluster A is separated into two large gene clusters and an additional mce gene cluster, which is involved in steroid uptake , was detected (Additional file 1). In VKM Ac-2033D, cluster A is located approximately 350-kb downstream of cluster B, whereas in FR1436, cluster A is located 6 kb downstream. Moreover, two kstR and 11 kstR2 genes, which encode the TetR family of transcriptional regulators and are reported to regulate cholesterol metabolism in mycobacteria , were detected (Additional file 1). Besides the genes in clusters A and B, genes encoding 3-beta-hydroxysteroid dehydrogenase (ANH36717 and ANH37882), 3-alpha-hydroxysteroid dehydrogenase (ANH37023 and ANH37488), and steroid delta-isomerase (ANH36955) were also detected in the genome of FR1436. Additionally, all genes involved in degradation of cholesterol to HIP-CoA were identified (Fig. 5). These results indicate that the genus Nocardioides can be useful for research and utilization of steroid metabolism.
Steroids are important biomolecules in living organisms and carry out diverse roles as components of the cell membrane to signaling molecules . Moreover, steroids are being used to treat various diseases from inflammation to cancer . These indicate that research on modification of steroid compounds has infinite possibilities to improve human health. To date, studies on bacterial steroid metabolism have been mainly focused on the order Corynebacteriales [33,34,35]. Recently, genome analysis of the genus Nocardioides in the order Propionibacteriales revealed several kinds of gene clusters associated with steroid degradation . In this study, we determined the complete genome sequence of N. dokdonensis FR1436 and analyzed the genome sequence to detect the presence of genes related to steroid metabolism. In the genome of FR1436, dozens of genes associated with steroid catabolism were detected in large gene clusters. These results demonstrate that bacteria in the genus Nocardioides can be used as promising candidates for steroid research and related fields of industry.
Basic Local Alignment Search Tool
Clusters of Orthologous Groups
Clustered regularly interspaced short palindromic repeat
Kyoto Encyclopedia of Genes and Genomes
Minimum information about a genome sequence
Reversed position specific-BLAST
UniProt reference clusters
Prauser H. Nocardioides, a new genus of the order Actinomycetales. Int J Syst Evol Microbiol. 1976;26:58–65.
Lim JM, Kim SJ, Hamada M, Ahn JH, Weon HY, Suzuki K, Ahn TY, Kwon SW. Nocardioides daecheongensis sp. nov., isolated from soil. Int J Syst Evol Microbiol. 2014;64(Pt 12):4109–14.
Deng S, Chang X, Zhang Y, Ren L, Jiang F, Qu Z, Peng F. Nocardioides antarcticus sp. nov., isolated from marine sediment. Int J Syst Evol Microbiol. 2015;65(8):2615–21.
Singh H, Yin CS. Nocardioides flava sp. nov., isolated from rhizosphere of poppy plant, Republic of Korea. Arch Microbiol. 2016;198(3):279–85.
Han JH, Kim TS, Joung Y, Kim MN, Shin KS, Bae T, Kim SB. Nocardioides endophyticus sp. nov. and Nocardioides conyzicola sp. nov., isolated from herbaceous plant roots. Int J Syst Evol Microbiol. 2013;63(Pt 12):4730–4.
Cui Y, Woo SG, Lee J, Sinha S, Kang MS, Jin L, Kim KK, Park J, Lee M, Lee ST. Nocardioides daeguensis sp. nov., a nitrate-reducing bacterium isolated from activated sludge of an industrial wastewater treatment plant. Int J Syst Evol Microbiol. 2013;63(Pt 10):3727–32.
Yoon JH, Kang SJ, Park S, Kim W, Oh TK. Nocardioides caeni sp. nov., isolated from wastewater. Int J Syst Evol Microbiol. 2009;59(Pt 11):2794–7.
Kim KH, Roh SW, Chang HW, Nam YD, Yoon JH, Jeon CO, Oh HM, Bae JW. Nocardioides basaltis sp. nov., isolated from black beach sand. Int J Syst Evol Microbiol. 2009;59(Pt 1):42–7.
Kubota M, Kawahara K, Sekiya K, Uchida T, Hattori Y, Futamata H, Hiraishi A. Nocardioides aromaticivorans sp. nov., a dibenzofuran-degrading bacterium isolated from dioxin-polluted environments. Syst Appl Microbiol. 2005;28(2):165–74.
Yoon JH, Kim IG, Kang KH, Oh TK, Park YH. Nocardioides aquiterrae sp. nov., isolated from groundwater in Korea. Int J Syst Evol Microbiol. 2004;54(Pt 1):71–5.
Zhi XY, Li WJ, Stackebrandt E. An update of the structure and 16S rRNA gene sequence-based definition of higher ranks of the class Actinobacteria, with the proposal of two new suborders and four new families and emended descriptions of the existing higher taxa. Int J Syst Evol Micr. 2009;59:589–608.
Barka EA, Vatsa P, Sanchez L, Gaveau-Vaillant N, Jacquard C, Klenk HP, Clement C, Ouhdouch Y, van Wezel GP. Taxonomy, physiology, and natural products of Actinobacteria. Microbiol Mol Biol Rev. 2016;80(1):1–43.
Park HJ, Kim ES. An inducible Streptomyces gene cluster involved in aromatic compound metabolism. FEMS Microbiol Lett. 2003;226(1):151–7.
Hamamura N, Yeager CM, Arp DJ. Two distinct monooxygenases for alkane oxidation in Nocardioides sp. strain CF8. Appl Environ Microbiol. 2001;67(11):4992–8.
Topp E, Mulbry WM, Zhu H, Nour SM, Cuppels D. Characterization of S-triazine herbicide metabolism by a Nocardioides sp isolated from agricultural soils. Appl Environ Microb. 2000;66(8):3134–41.
Iwabuchi T, Inomata-Yamauchi Y, Katsuta A, Harayama S. Isolation and characterization of marine Nocardioides capable of growing and degrading phenanthrene at 42 degrees C. J Mar Biotechnol. 1998;6(2):86–90.
Rajan J, Valli K, Perkins RE, Sariaslani FS, Barns SM, Reysenbach AL, Rehm S, Ehringer M, Pace NR. Mineralization of 2,4,6-trinitrophenol (picric acid): characterization and phylogenetic identification of microbial strains. J Ind Microbiol. 1996;16(5):319–24.
Coleman NV, Mattes TE, Gossett JM, Spain JC. Phylogenetic and kinetic diversity of aerobic vinyl chloride-assimilating bacteria from contaminated sites. Appl Environ Microb. 2002;68(12):6162–71.
Park SC, Baik KS, Kim MS, Chun J, Seong CN. Nocardioides dokdonensis sp nov., an actinomycete isolated from sand sediment. Int J Syst Evol Micr. 2008;58:2619–23.
Choi Y. Open-ocean convection in the Japan (east) sea. La mer. 1996;34:259–72.
Kim YE, Yoon H, Kim M, Nam YJ, Kim H, Seo Y, Lee GM, Ja Kim Y, Kong WS, Kim JG, Seu YB. Metagenomic analysis of bacterial communities on Dokdo Island. J Gen Appl Microbiol. 2014;60(2):65–74.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64.
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–8.
Grissa I, Vergnaud G, Pourcel C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 2007;35(Web Server issue):W52–7.
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8(10):785–6.
Sonnhammer EL, von Heijne G, Krogh A. A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol. 1998;6:175–82.
Weber T, Blin K, Duddela S, Krug D, Kim HU, Bruccoleri R, Lee SY, Fischbach MA, Muller R, Wohlleben W, et al. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43(W1):W237–43.
Teufel R, Mascaraque V, Ismail W, Voss M, Perera J, Eisenreich W, Haehnel W, Fuchs G. Bacterial phenylalanine and phenylacetate catabolic pathway revealed. Proc Natl Acad Sci U S A. 2010;107(32):14390–5.
Bloch K. Sterol molecule - structure, biosynthesis, and function. Steroids. 1992;57(8):378–83.
Bai C, Schmidt A, Freedman LP. Steroid hormone receptors and drug discovery: therapeutic opportunities and assay designs. Assay Drug Dev Technol. 2003;1(6):843–52.
Hanukoglu I. Steroidogenic enzymes: structure, function, and role in regulation of steroid hormone biosynthesis. J Steroid Biochem Mol Biol. 1992;43(8):779–804.
Van der Geize R, Yam K, Heuser T, Wilbrink MH, Hara H, Anderton MC, Sim E, Dijkhuizen L, Davies JE, Mohn WW, Eltis LD. A gene cluster encoding cholesterol catabolism in a soil actinomycete provides insight into Mycobacterium tuberculosis survival in macrophages. Proc Natl Acad Sci U S A. 2007;104(6):1947–52.
Uhia I, Galan B, Kendall SL, Stoker NG, Garcia JL. Cholesterol metabolism in Mycobacterium smegmatis. Environ Microbiol Rep. 2012;4(2):168–82.
Wilbrink MH, Petrusma M, Dijkhuizen L, van der Geize R. FadD19 of Rhodococcus rhodochrous DSM43269, a steroid-coenzyme a ligase essential for degradation of C-24 branched sterol side chains. Appl Environ Microb. 2011;77(13):4455–64.
Shtratnikova VY, Schelkunov MI, Fokina VV, Pekov YA, Ivashina T, Donova MV. Genome-wide bioinformatics analysis of steroid metabolism-associated genes in Nocardioides simplex VKM ac-2033D. Curr Genet. 2016;62(3):643–56.
Mohn WW, Wilbrink MH, Casabon I, Stewart GR, Liu J, van der Geize R, Eltis LD. Gene cluster encoding cholate catabolism in Rhodococcus spp. J Bacteriol. 2012;194(24):6712–9.
Kendall SL, Burgess P, Balhana R, Withers M, ten Bokum A, Lott JS, Gao C, Uhia-Castro I, Stoker NG. Cholesterol utilization in mycobacteria is controlled by two TetR-type transcriptional regulators: kstR and kstR2. Microbiol-Sgm. 2010;156:1362–71.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms - proposal for the domains Archaea, bacteria, and Eucarya. P Natl Acad Sci USA. 1990;87(12):4576–9.
Garrity G, Holt J. The road map to the manual. In: Garrity GM, Boone DR, Castenholz RW, editors. Bergey's manual of systematic bacteriology. Volume 1. Second ed. New York: Springer; 2001. p. 119–69.
Stackebrandt E, Rainey FA, WardRainey NL. Proposal for a new hierarchic classification system, Actinobacteria classis nov. Int J Syst Bacteriol. 1997;47(2):479–91.
Yoon JH, Lee ST, Park YH. Genetic analyses of the genus Nocardioides and related taxa based on 16S-23S rDNA internally transcribed spacer sequences. Int J Syst Bacteriol. 1998;48:641–50.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.
We recognize Ju Yeon Song for her involvement during the early stages of the project.
This work was financially supported by the National Research Foundation (NRF-2014M3C9A3068822 and NRF-2011-0017670) of the Ministry of Science, ICT and Future Planning, Republic of Korea.
The authors declare that they have no competing interests.
Genes associated with steroid metabolism. (XLSX 14 kb)