- Short genome report
- Open Access
Complete genome sequence of new bacteriophage phiE142, which causes simultaneously lysis of multidrug-resistant Escherichia coli O157:H7 and Salmonella enterica
Standards in Genomic Sciencesvolume 11, Article number: 89 (2016)
The emergence of antibiotic-resistant foodborne bacteria is a global health problem that requires immediate attention. Bacteriophages are a promising biotechnological alternative approach against bacterial pathogens. However, a detailed analysis of phage genomes is essential to assess the safety of the phages prior to their use as biocontrol agents. Therefore, here we report the complete genome sequence of bacteriophage phiE142, which is able to lyse Salmonella and multidrug-resistant Escherichia coli O157:H7 strains. Bacteriophage phiE142 belongs to the Myoviridae family due to the presence of long non-flexible tail and icosahedral head. The genome is composed of 121,442 bp and contains 194 ORFs, and 2 tRNAs. Furthermore, the phiE142 genome does not contain any genes coding for food-borne allergens, antibiotics resistance, virulence factors, or associated with lysogenic conversion. The bacteriophage phiE142 is characterized by broad host range and compelling genetic attributes making them potential candidates as a biocontrol agent.
Foodborne diseases are an important cause of morbidity and mortality worldwide, therefore are a serious public health problem . Bacteria cause the majorities of foodborne illnesses; Escherichia coli and Salmonella are among the most common foodborne pathogens that affect millions of people annually . Furthermore, the emergence of antimicrobial resistance E. coli and Salmonella strains makes more difficult its control . Hence, novel control methods for reducing the risk of bacterial food contamination, which are both environmental friendly, are urgently needed.
In this context, bacteriophages have several potential applications in the food industry; these killing-bacteria viruses are alternatives to conventional antimicrobials method for the control of pathogenic bacteria and have great potential in the improvement of food safety [4–6]. Bacteriophages suitable for biocontrol purposes must be genetically sequenced to ensure that are strictly lytic (always lyse infected cells host), does not encode any bacterial virulence factors or proteins with a potential to cause allergenicity [7, 8].
The primary aim of our research group is increase knowledge of phage biodiversity and contribute to the understanding of different types of phages in several regions of Sinaloa, an important agricultural region in Northwestern Mexico. Recently, a new bacteriophage, designated as phiE142, one of phages isolated, exhibits a high potential as a biocontrol agent . However, information about genome of phage phiE142 is still limited; therefore, to further understand the phage biology, the genome was sequenced.
Classification and features
The bacteriophage phiE142 was previously isolated in Food and Environmental Microbiology Laboratory at the Research Center for Food and Development from animal feces samples collected on a farm in Northwestern Mexico. An E. coli strain EC-48 (bacterial used for bacteriophage propagation and titration), was also isolated from the same geographical region two years before the isolation of the phage . Phage phiE142 produced clear plaques of 2 to 3 mm in diameter on the E. coli EC-48 lawn; the plaques were already visible after four to six hours of incubation time at 37 °C.
We analyzed the lytic host range of phage using spot tests assays of different bacterial, including 48 Salmonella strains and 33 E. coli strains (Additional file 1: Table S1). Based upon spot testing results, the phage phiE142 had lytic activity against 76% of the E. coli strains and 29% of Salmonella strains tested. These results indicate that bacteriophage phiE142 has the potential to be evaluated as an alternative strategy to biocontrol of E. coli and Salmonella .
The phiE142 phage was stained with 2% uranyl acetate and examined by transmission electron microscopy (TEM) and classified into its appropriate viral morphotype according to Ackermann’s classification . The analysis suggests that phage phiE142 belongs to the order Caudovirales and family Myoviridae based on the presence of almost isometric head with an average diameter of ∼ 58 nm, long non-flexible contractile tail about 120 nm in length (Fig. 1) . Phage phiE142 has a genome of 121,442 bp, with a coding region of 94.4%, GC content of 37.4%, and the gene density is 1.60. It contains 194 coding sequences ranging from 102 bp to 3,300 bp, with 53 genes on the positive strand and 141 genes on the negative strand. Phylogenetic characteristics of this phage are indicated in Table 1.
The sequence of DNA polymerase has become a commonly-used marker for constructing phylogenetic analysis, therefore the phylogenetic tree was performed based of DNA polymerase deduced amino acid sequences. According to the phylogenetic tree, the phage phiE142 and others eight phages that infect the bacterial family Enterobacteriaceae were clustered in the same group (Figs. 2 and 3). All of these phages are members of the Tevenvirinae subfamily and are strictly lytic (Based on PHACTS program server). Considering the close relationship among these phages, it is likely that phiE142 also belongs to this genus. This result confirms the findings obtained by electron microscopy.
Genome sequencing information
Genome project history
The bacteriophage phiE142 is one of the first genome to be completely sequenced publicly available for a phage infecting E. coli and Salmonella strains isolated from environmental sources in Northwest Mexico. The analysis of more genomes of bacteriophages is necessary to increase our understanding of the genetic diversity of bacteriophages, phage biology, basic molecular mechanisms, and provide a deeper insight into the relationship of phages with their hosts. Furthermore, analysis of phage genomes may reveal novel antimicrobial peptides and enzymes with bactericidal activity. In addition, the genome well understood is an essential requisite to ensure the safety of the phages prior to their use as biocontrol agents. Therefore, the genome project was deposited in the Genomes On Line Database (GOLD). The genome sequence of bacteriophage phiE142 was deposited in GenBank under accession number KU255730. The summary of genome project is available in the Table 2.
Growth conditions and genomic DNA preparation
Standard double-layer agar plate method was used to obtain high-titer stocks of the phage phiE142 , with some modifications. Briefly, 100 μl of phage stock and 1 ml of overnight culture of E. coli strain EC-48 were mixed with 3 ml TSB with 0.4% agarose, spread on TSA plates, and incubated overnight at 37 °C. After, phage was subsequently collected by adding 6 ml of SM buffer (50 mM Tris-HCl, pH 7.5, 0.1 M NaCl, 8 mM MgSO4, 0.01% gelatin) to the surface of each plate and the soft agar was scraped off the surface of the agar plates. Cell debris was removed by subsequent centrifugation at 5,500 × g for 10 min, the supernatant was filtered with 0.22 μm syringe filters, and phage particles were precipitated by centrifugation at 40,000 × g at 4 ° C for 2 h. The phage pellet was suspended in SM buffer and stored at 4°C. Bacteriophage DNA was isolated by the method of proteinase K and phenol–chloroform as previously described , with minor modifications. One milliliter of purified phage suspension was treated with 1 μg/ml of DNaseI and RNaseA (Sigma-Aldrich) at 37 °C for 1 h. Subsequently, sodium dodecyl sulfate (final concentration, 0.5%), EDTA (20 mM, pH 8.0), and proteinase K (final concentration, 25 μg/ml) were added, and the suspension was incubated at 56 °C for 1 h. After proteins were removed by an equal volume of phenol-chloroform (1:1), and DNA was precipitated from the aqueous phase by cold ethanol. Following centrifugation at 15, 000 × g for 15 min at 4 °C, the pellet was washed twice with 70% ethanol, centrifuged at the same conditions. Finally, the dried DNA pellet was suspended in nuclease-free water. Concentration of phage DNA was estimated with a NanoDrop spectrophotometer (Thermo Fisher Scientific, Wilmington, DE) and also the quality of extracted DNA was also tested visually with electrophoresis on a 1% agarose.
Genome sequencing and assembly
High-throughput DNA Sequencing of phage genomic DNA was performed using HiSeq 2000 technology (Illumina) to produce 100 bp paired-end reads, library construction and sequencing were performed according to the manufacturer’s instructions. In total, about 18 million pair reads of 100 bases in length were obtained with a quality filter threshold of Q30. The reads were analyzed and quality checked using FastQC and Geneious software package R8 (Biomatters Ltd., New Zealand) was used to trim raw reads with a low quality score. The de novo assembly was conducted with Velvet (implemented in Geneious, running VelvetOptimiser for selection of k-mer), resulting in one final contig with coverage from approximately 10,000-fold. Additional manual functional annotation and genome map was performed using Geneious software.
Open reading frames (ORFs) were identified using Glimmer 3.02 , GeneMark.hmm , and ORF Finder . The putative functions of the ORFs were analyzed by protein BLASTp searches, with a cut off E value of 10−4. Predicted protein sequences were analyzed against InterProScan , Pfam  and TMHMM Server version 2.0  for conservative domain identification. Signal peptides were predicted using SignalP 4.1. The search of putative tRNA encoding genes was done using ARAGORN  and tRNAscan-SE . The origin of replication was predicted using a GC-skew plot generated by GenSkew . Moreover, all identified ORFs were compared against the virulence factor database  and the ResFinder database . Additionally, the predicted phage protein sequences were searched to identify proteins potentially allergenic using tools from the Food Allergy Research and Resource Programme . The lifestyle of the phages was predicted using the PHACTS program . Whole genome comparisons were carried out using Mauve .
The detailed annotation information for phage genome was summarized in Table 3. The phage has a DNA genome consisting of 121,442 bp with a GC content of 37.4%, which is significantly lower than that of the host E. coli (about 50% GC). Genome analysis of the phage revealed 194 putative open reading frames (94.4% of the genome consists of a coding region), with 26 oriented in a forward orientation and 168 in a reverse orientation, and two tRNA genes were identified. Based on BLAST results, functions were assigned to 95 of the genes; most of the annotated genes (98 genes) were hypothetical proteins, probably due to the enormous diversity of bacteriophages and the insufficient database information about the functional genes of phage. Only one gene product is hypothetical novel proteins (Additional file 2: Table S2). The distribution of the ORFs into COG functional categories is provided in Table 4.
Insights from the genome sequence
The results of BLAST revealed that the genome of phage phiE142 has a high similarity (query coverage, 94%; identity, 97%) with coliphage vB_EcoM_PhAPEC2, which belong to the Tevenvirinae subfamily of the genus T4-like viruses, an observation that is consistent with the analysis of the DNA polymerase. We therefore concluded that phiE142, based on sequence similarity, belong to the Tevenvirinae subfamily. However, some differences in genome organization were observed, because progressive Mauve genome alignment revealed one colinear block that is in the different order in both bacteriophages (Additional file 3: Figure S3). The principle region of genomic dissimilarity was located between 110,000 pb and 121,000 pb, this region includes a set of ORFs found to be associated with phage-host recognition, suggesting specific features of phage evolution.
The phiE142 genome is functionally organized into four modules containing gene clusters for virion morphogenesis, DNA replication/regulation, DNA packaging, and host cell lysis. This modular organization of the genome is typical of bacteriophages.
Thirty-one ORFs were found to encode proteins involved in the morphogenesis of virions. These include the ORFs 1–3, 170, 172, 175–185, and 187–194, which are proposed to be genes encoding the components of the tail fiber and baseplate. Databases homology searches suggested that ORFs encoding capsid protein are 46, 139, 142, and 174. Additionally, the proteins encoded by ORFs 185 and 186 are most similar in its amino acid sequence to neck protein.
Overall, a total of 46 ORFs are associated with processing of the viral DNA. Our analysis of the phage genomes reveals several genes potentially involved in nucleotide metabolism, including ORFs 14–15, 38–39, 47, 64, 70, 96, 100–101, 125, and 171. In addition, genes that encode proteins involved in replication and transcription of its own DNA were identified in ORFs 5, 7, 12–13, 18, 20–21, 24–25, 28–29, 32, 34–35, 37, 49, 56, 59, 61, 66, 71, 73–76, 78, 81, 86, 102, 106, 130, 132, 141, 144, and 173.
Two ORFs exhibit similarity to a gene involved in the host cell lysis, including endolysin and holin. The protein encoded by ORF 143 displays a high degree of identity with the endolysin. This ORF contained one glycohydrolase domain (hydrolyse the beta-1,4-glycosidic bond between N-acetylmuramic acid and N-acetylglucosamine), which indicates that this protein is probably an enzyme that degrades peptidoglycan. While the putative protein of ORF 4 was identified as a holin protein. Unusually, this ORF is not located adjacent to the endolysin ORF, in most genomes bacteriophages, the holin ORF is adjacent or overlaps a ORF encoding an endolysin. The deduced holin encoded by phiE142 phage has one putative transmembrane domain, and thus resembles class III holins.
The phage lifestyle prediction result of PHACTS indicated that the phiE142 is a virulent phage, consistent with the results of genomic analysis, which revealed the absence of genes associated with the establishment and maintenance of lysogenic cycle.
The DNA packaging module includes ORF 60, which encode the putative portal protein. However, it was not possible to identify the terminase subunits.
Our data suggest that phiE142 is a member of T4-like virus genus of the Myoviridae family and the Tevenvirinae subfamily. Interestingly, in silico analyses of phiE142 genome did not exhibit homology to known virulence-associated genes, genes involved in lysogeny nor to antibiotic resistance genes or potential immunoreactive allergens. These results indicate that phage phiE142 exhibits genetics properties suitable for evaluation as a biocontrol agent.
Genomes On Line Database
Open reading frames
Phage Classification Tool Set
Transmission electron microscopy
Tryptic soy agar
Tryptic soy broth
Torgerson PR, de Silva NR, Fèvre EM, Kasuga F, Rokni MB, Zhou X-N, et al. The global burden of foodborne parasitic diseases: An update. Trends Parasitol. 2014;30:20–6. Available at: http://www.ncbi.nlm.nih.gov/pubmed/24314578.
Ahmed A, Shimamoto T. Isolation and molecular characterization of Salmonella enterica, Escherichia coli O157:H7 and Shigella spp. from meat and dairy products in Egypt. Int J Food Microbiol. 2013;168:57–62. Available at: http://www.ncbi.nlm.nih.gov/pubmed/24239976.
Johannessen GS, Eckner KF, Heiberg N, Monshaugen M, Begum M, Økland M, et al. Occurrence of Escherichia coli, Campylobacter, Salmonella and Shiga-Toxin producing E. coli in Norwegian primary strawberry production. Int J Environ Res Public Health. 2015;12:6919–32.
Ghasemi SM, Bouzari M, Emtiazi G. Preliminary characterization of Lactococcus garvieae bacteriophage isolated from wastewater as a potential agent for biological control of lactococcosis in aquaculture. Aquacult Int. 2014;22:1469–80. Available at: http://www.mdpi.com/1660-4601/12/6/6919.
Carlton R, Noordman W, Biswas B, de Meester ED, Loessner M. Bacteriophage P100 for control of Listeria monocytogenes in foods: Genome sequence, bioinformatic analyses, oral toxicity study, and application. Regul Toxicol Pharmacol. 2005;43:301–12. Available at: http://www.ncbi.nlm.nih.gov/pubmed/16188359.
Hudson J, Billington C, Wilson T, On S. Effect of phage and host concentration on the inactivation of Escherichia coli O157: H7 on cooked and raw beef. Food Sci Technol Int. 2013;21:104–9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/24285831.
Hagens S, Loessner MJ. Application of bacteriophages for detection and control of foodborne pathogens. Appl Microbiol Biotechnol. 2007;76:513–9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/17554535.
Hungaro HM, Mendonça RCS, Gouvêa DM, Vanetti MCD, de Oliveira PCL. Use of bacteriophages to reduce in chicken skin in comparison with chemical agents. Food Res Int. 2013;52:75–81. Available at: http://www.sciencedirect.com/science/article/pii/S0963996913001373.
CastrodelCampo N, Amarillas Bueno LA, García Camarena MG, Chaidez Quiroz C, León Félix J, Martínez Rodríguez CI. Presencia de Salmonella y Escherichia coli O157:H7 en la zona centro del estado de Sinaloa y su control biológico mediante el uso de bacteriófagos [abstract no. C39], XIII Congreso Internacional de Inocuidad de Alimentos. 2011. p. 165–8. Available at: http://sistemanodalsinaloa.gob.mx/archivoscomprobatorios/_14_resumeneventoscientificos/1034.pdf.
Amézquita-López B, Quiñones B, Cooley M, León-Félix J, Campo C, Mandrell R, et al. Genotypic analyses of Shiga toxin-producing Escherichia coli O157 and non-O157 recovered from feces of domestic animals on rural farms in Mexico. PLoS One. 2012;7:e51565. Available at: http://www.ncbi.nlm.nih.gov/pubmed/23251577.
Ackermann H-W. Phage classification and characterization. In: Clokie MRJ, Kropinski A, editors. Methods in Molecular Biology. ᅟ: Springer Science + Business Media; 2009. p. 127–40. Available at: http://www.springer.com/us/book/9781588296825.
King AMQ, Adams MJ, Carstens EB, Lefkowitz EJ. Virus taxonomy: classification and nomenclature of viruses: ninth report of the international committee on taxonomy of viruses. San Diego: Elsevier Academic Press; 2012. p. 855–80.
Carey-Smith G, Billington C, Cornelius A, Hudson J, Heinemann J. Isolation and characterization of bacteriophages infecting Salmonella spp. FEMS Microbiol Lett. 2006;258:182–6. Available at: http://www.ncbi.nlm.nih.gov/pubmed/16640570.
Sambrook J, Russell DW. Molecular Cloning: A laboratory manual. 3rd ed. New York: Cold Spring Harbor Laboratory Press; 2001.
Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with glimmer. Bioinformatics. 2007;23:673–9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/17237039.
Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: A self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001;29:2607–18. Available at: http://www.ncbi.nlm.nih.gov/pubmed/11410670.
Rombel IT, Sykes KF, Rayner S, Johnston SA. ORF-FINDER: A vector for high-throughput gene identification. Gene. 2002;282:33–41. Available at: http://www.ncbi.nlm.nih.gov/pubmed/11814675.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. InterProScan: Protein domains identifier. Nucleic Acids Res. 2005;33:116–20. Available at: http://nar.oxfordjournals.org/content/33/suppl_2/W116.full.
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer ELL, Tate J, Punta M. The Pfam protein families database. Nucleic Acids Res. 2014;42:D222–30. Available at: http://nar.oxfordjournals.org/content/42/D1/D222.long.
Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes. J Mol Biol. 2001;305:567–80. Available at: http://www.ncbi.nlm.nih.gov/pubmed/11152613.
Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004;32:11–6. Available at: http://www.ncbi.nlm.nih.gov/pubmed/14704338.
Lowe TM, Eddy SR. TRNAscan-sE: A program for improved detection of transfer RNA genes in Genomic sequence. Nucleic Acids Res. 1997;25:955–64. Available at: http://nar.oxfordjournals.org/content/25/5/0955.full.
GenSkew – visualization of nucleotide skew in genome sequences. http://mips.gsf.de/services/analysis/genskew.
Chen L, Xiong Z, Sun L, Yang J, Jin Q. VFDB 2012 update: Toward the genetic diversity and molecular evolution of bacterial virulence factors. Nucleic Acids Res. 2011;40:D641-5. Available at: http://www.ncbi.nlm.nih.gov/pubmed/22067448.
Kleinheinz KA, Joensen KG, Larsen MV. Applying the ResFinder and VirulenceFinder web-services for easy identification of acquired antibiotic resistance and E. coli virulence genes in bacteriophage and prophage nucleotide sequences. Bacteriophage. 2014;4:e27943. Available at: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3926868/.
Food Allergy Research and Resource Programme (FARRP). http://www.allergenonline.com.
McNair K, Bailey BA, Edwards RA. PHACTS, a computational approach to classifying the lifestyle of phages. Bioinformatics. 2012;28:614–8. Available at: http://www.ncbi.nlm.nih.gov/pubmed/22238260.
Darling AE, Mau B, Perna NT. Progressive Mauve: Multiple genome alignment with gene gain, loss and rearrangement. PLoS One. 2010;5:e11147. Available at: http://journals.plos.org/plosone/article?id=10.1371%2Fjournal.pone.0011147.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7. Available at: http://www.ncbi.nlm.nih.gov/pubmed/18464787.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: Tool for the unification of biology. Nat Genet. 2000;25:25–9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/10802651.
The support from the Fundación Produce Sinaloa is gratefully acknowledged. The authors thank the National Food Safety Research Laboratory (LANIIA) at the Research Center for Food and Development (CIAD, Mexico) for providing laboratory facilities during the research. We thank Dr. Mitzi Estrada Acosta for her assistance with data presentation. The authors would like to acknowledge the technical assistance of QFB Lucía Margarita Rubí Rangel and QFB Jesús Héctor Carrillo Yáñez.
LA analyzed the genome sequence and participated in the sequence alignment and drafted the manuscript. JLF conceived of the study, and participated in its design and coordination and helped to draft the manuscript. CC participated in the design of the study and helped to revise the manuscript. Transmission electron microscopy examinations were done by AGR. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Bacterial strains used in the host range spectrum of the bacteriophage phiE142. Phage was assessed for host range by spot testing. (+) indicate positive sensitivity to phage lysis, and (-) indicate negative sensitivity to phage lysis. (DOCX 41 kb)
Predicted open reading frames (ORFs) of phiE142 and predicted database matches (DOCX 60 kb)
Comparison of genome sequence of bacteriophages phiE142 and vB_EcoM_PhAPEC2. The comparison was carried out with progressive MAUVE. (JPG 177 kb)