Meeting Report: Fungal ITS Workshop (October 2012)

Bates, Scott T.; Ahrendt, Steven; Bik, Holly M.; Bruns, Thomas D.; Caporaso, J. Gregory; Cole, James; Dwan, Michael; Fierer, Noah; Gu, Dai; Houston, Shawn; Knight, Rob; Leff, Jon; Lewis, Christopher; Maestre, Juan P.; McDonald, Daniel; Nilsson, R. Henrik; Porras-Alfaro, Andrea; Robert, Vincent; Schoch, Conrad; Scott, James; Taylor, D. Lee; Parfrey, Laura Wegener; Stajich, Jason E.

doi:10.4056/sigs.3737409

Open access
Published: 15 April 2013

Meeting Report: Fungal ITS Workshop (October 2012)

Scott T. Bates¹,
Steven Ahrendt²,
Holly M. Bik³,
Thomas D. Bruns⁴,
J. Gregory Caporaso^5,6,
James Cole⁷,
Michael Dwan⁸,
Noah Fierer¹,
Dai Gu²,
Shawn Houston⁹,
Rob Knight^10,11,
Jon Leff¹,
Christopher Lewis¹²,
Juan P. Maestre¹³,
Daniel McDonald¹⁴,
R. Henrik Nilsson¹⁵,
Andrea Porras-Alfaro¹⁶,
Vincent Robert¹⁷,
Conrad Schoch¹⁸,
James Scott¹⁹,
D. Lee Taylor²⁰,
Laura Wegener Parfrey¹⁰ &
…
Jason E. Stajich²

Standards in Genomic Sciences volume 8, pages 118–123 (2013)Cite this article

1203 Accesses
29 Citations
6 Altmetric
Metrics details

Abstract

This report summarizes a meeting held in Boulder, CO USA (19–20 October 2012) on fungal community analyses using ultra-high-throughput sequencing of the internal transcribed spacer (ITS) region of the nuclear ribosomal RNA (rRNA) genes. The meeting was organized as a two-day workshop, with the primary goal of supporting collaboration among researchers for improving fungal ITS sequence resources and developing recommendations for standard ITS primers for the research community.

ITS sequence database for fungal community analyses

Sequencing-based techniques have allowed characterization of microbial communities from environmental samples without relying on cultivation. Because of improvements in these techniques and of methods required to analyze the resulting datasets, microbial ecology has been advancing rapidly: microbial diversity can be surveyed to an extent previously unimaginable. The internal transcribed spacer (ITS) and other ribosomal RNA gene sequence regions (“rRNA” in the rest of this document) have been used successfully to profile fungal communities using Sanger [1] and 454 [2] sequencing. The arrival of ultra-high-throughput sequencing platforms [3] promise to offer new insights into the diversity and ecology of fungal communities, yet few studies of fungal communities have employed this technology successfully. Progress has been hampered, in part, by the lack of a high-quality reference database of fungal sequences for the ITS region of the rRNA operon [4], which is now the most widely sequenced DNA region in fungi [5] and is the marker of choice for molecular identification of most fungal taxa [6].

For Bacteria and Archaea, where the ribosomal small subunit (SSU/16S) gene is the primary marker in environmental sequencing, efforts have been made to improve the quality of the public reference sequence datasets, including GreenGenes [7] and RDP [8]. The same is true for more general SSU/large subunit (LSU) rRNA gene sequence databases, such as SILVA, which includes all three domains of life [9]. For fungi, a “hand-curated” LSU sequence reference set is currently available, and work is underway to apply similar methods to improve the ITS database [10]. Because sequencing technologies continue to improve, the number of ITS sequences in primary sequence repositories such as INSDC will steadily increase, and quality control via hand-curation for specialized, publically available rRNA gene sequence databases will not be sustainable.

Primary sequence repositories are already experiencing explosive growth in the number of unidentified environmental fungal ITS sequences [11], yet these sequences will be of limited use in improving our overall understanding of fungal diversity unless they are properly identified and can be placed within a phylogenetic context. When sufficiently closely related sequences exist, environmental sequences can be placed within a phylogenetic context today simply by aligning with related sequences and constructing trees. However, because ITS evolves rapidly, constructing phylogenies that span large taxonomic ranges remains extremely challenging. An even more important problem is that of misidentified sequences (environmental sequences included) currently in public databases. These can lead to erroneous placement of unknowns, even if tree-based approaches are used. Therefore, identifying these errors and re-annotating in an automated fashion is a critical challenge, especially for extremely large datasets where manual phylogenetic analysis is not feasible due to the presence of millions of sequence reads that correspond to “unknown” operational taxonomic units (OTUs). Consequently, clean reference databases as well as automated phylogenetic assignment and analysis methods are critical needs.

Purposes of the Meeting

This meeting was organized in order to:

Facilitate communication, potential data exchange, and collaboration with the aim of improving fungal ITS sequence resources for the research community.
Identify suitable ITS primers for fungal community analyses using ultra-high-throughput sequencing.
Develop strategies for automated (and manual) database curation as well as the naming of environmental sequences and OTUs at various levels of resolution.
Establish a sustainable plan for reference database development and maintenance.

Participants

The meeting participants included researchers representing publicly available databases that contain microbial sequence data (e.g., GenBank, GreenGenes, RDP, SILVA) or fungi-specific resources (e.g., MycoBank, UNITE), as well as researchers currently using ultra-high-throughput sequencing to examine fungal communities or those involved in developing software, such as QIIME [12] and PhyloSift [13], to facilitate such studies.

Activities

The meeting was conducted as a two-day workshop. The first day was devoted primarily to brief presentations by participants outlining their involvement in curating public sequence databases, developing high-throughput sequencing pipelines, or using ultra-high-throughput sequencing to examine fungal diversity in environmental samples (e.g., air or soil). The presentations are available online [14]. The second day focused on discussions related to the assembly of a high-quality reference database of fungal ITS sequences, selecting ITS primers suitable for ultra-high-throughput sequencing, as well as methods to link ITS sequences to the fungal phylogeny for automated curation, quality control, and phylogeny-based community analysis methods.

Conclusions / Outcomes

Ultra-high-throughput sequence processing/analytical pipelines, such as those implemented in QIIME, rely on de-replication of large sequence datasets through clustering for the creation of reference sequence sets that can be used to assist in the recognition of OTUs from environmental samples. The meeting participants largely supported the use of the UNITE database [15,16] as a focal point for the development of high quality fungal ITS reference sequence sets. UNITE currently has implemented several desirable features for this task, including:

A comprehensive set of approximately 300,000 fungal ITS sequences extracted from public databases.
An annotation management system (PlutoF) that allows qualified third-party users to add pertinent metadata (e.g., on ecology or geography), improve the taxonomic resolution, tag problematic entries, or correct misidentifications for sequences in the UNITE database.
Global Key Annotations that permit visualization of sequences clustered at a range of similarity levels (99–97%), with the sequences representing each OTU in the cluster depicted in an alignment.
Cluster centroids selected with a preference toward using sequences that are reliable (e.g., generated from trustworthy sources such as the Assembling the Fungal Tree of Life project or hand-picked by taxonomic experts), taxonomically informative (e.g., identified to the species-level), or that have particular relevance for taxonomy/nomenclature (e.g., sequences from type specimens).
Improvements on the horizon (slated for early 2013) included labeling sequences representing cluster centroids that will allow these unique sequences to be tracked through time as clusters change, as well as options for downloading cluster centroid sequence sets for different sequence similarity levels.

With the availability of these cluster centroids, reference sequence sets for ultra-high-throughput pipelines can be directly generated from the UNITE database in a rapid manner. The meeting allowed for coordination that resulted in the creation of an alpha version of the UNITE reference set to facilitate OTU picking and taxonomic assignment for fungal ITS sequence reads generated in high-/ultra-high-throughput sequencing runs. This reference set is now publically available on the QIIME website [12,17].

UNITE currently provides taxonomic strings based on classification schema culled from fungal taxonomic resources such as Index Fungorum [18] and MycoBank [19], which are comprehensive databases for fungal names that offer expertise and resources for improving the quality and availability of fungal taxonomic information.

Journals publishing novel fungal taxa now typically require authors to register new names in MycoBank, which in turn is encouraging submission of informative DNA sequences, such as the ITS region, associated with new taxa to the public databases. In addition to acting as sequence vouchers for type material, these data also have the potential to inform molecular studies examining environmental samples.

Synergistic collaboration and the flow of information between, and within, online taxonomic resources and the public sequence databases (that have expressed interest in using a global standardized taxonomy) were seen by meeting participants as being highly desirable. The integration of fungal taxonomy and phylogeny was deemed another important consideration. The Assembling the Fungal Tree of Life (AFToL) project made considerable progress toward refining our understanding of the fungal phylogeny, which informed taxonomy for the kingdom [20]. Sequences generated under AFToL represent reliable data that are desirable for cluster centroids in the fungal reference sets. New projects, such as the Open Tree of Life [21], hold potential as phylogeny-based taxonomic resources for reference databases, while others, such as Fungal Barcoding [22], promise to produce additional high-quality sequences across a wide range of fungal groups for improving the reference datasets.

Although the ITS marker allows fungal sequences to be resolved at the genus- or species-level, aligning ITS sequences across a wide taxonomic range is essentially unworkable. Thus, meeting participants also discussed strategies for anchoring fungal ITS sequences to the broader phylogeny (e.g., one constrained to the AFToL backbone) using SSU or LSU sequences having contiguous ITS reads. Given that for fungi SSU (18S) is typically uninformative below the order- or family-level, LSU (26/28S) was seen as an appropriate marker for this task. The large subunit rRNA gene was considered desirable, as it has been extensively used in fungal phylogenetic studies, it allows for accurate placement in the phylogeny both at higher and lower taxonomic ranks (e.g., phylum/class and family/genus, respectively), and many reliable AFToL sequences (spanning both LSU and the ITS region) are currently available for this purpose [10,23]. Other reliable sequence sources for full-length LSU/ITS reads were identified from complete genomes, such as fungal genomes project [24]. With the anchor tree established, fungal ITS reads produced by ultra-high-throughput sequencing can potentially be placed within a phylogenetic context for taxonomic validation, improving the taxonomy associated with unknown environmental sequences, identifying and naming novel environmental groups, as well as for other automated curation tasks. Linking ITS reads to the fungal phylogeny will also allow for phylogenetic metrics of community distances (e.g., UniFrac) [25] to be used in beta-diversity analyses.

Two talks on the first day of the workshop presented preliminary data from ultra-high-throughput sequence surveys of soil fungi targeting the ITS1 region using the primer pairs ITS1-F/ITS2 [see 26]. Due to spliceosomal inserts known to exist toward the 3′ end of SSU rRNA gene that could interfere with priming sites and cause biases against groups of fungi (e.g., Helotiales) where such inserts exits, participants expressed a preference for using primers for the ITS2 region, for which such spliceosomal inserts are not known. Additional reasons for adopting the use of the ITS2 region marker include its close proximity to LSU (e.g., for anchoring to the phylogeny, see above), less variation in read length compared to ITS1, and the availability of data on ITS2 secondary structure [27,28] that can inform sequence alignments. With continually improving read lengths of ultra-high-throughput sequencing platforms, full length ITS/LSU reads may be possible in the near future. Primers targeting the ITS2 region have been identified, and are currently being tested for use in ultra-high-throughput sequencing. Recommendations for fungal ITS2 as well as current versions of the UNITE centroid reference sequence sets will be available on the QIIME website [12] in the coming year.

References

O’Brien HE, Parrent JL, Jackson JA, Moncalvo JM, Vilgalys R. Fungal community analysis by large-scale sequencing of environmental samples. Appl Environ Microbiol 2005; 71:5544–5550. PubMed http://dx.doi.org/10.1128/AEM.7L9.5544-5550.2005
Article PubMed Central PubMed Google Scholar
Jumpponen A, Jones KL. Massively parallel 454 sequencing indicates hyperdiverse fungal communities in temperate Quercus macrocarpa phyllosphere. New Phytol 2009; 184:438–448. PubMed http://dx.doi.org/10.1111/j.1469-8137.2009.02990.x
Article CAS PubMed Google Scholar
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, Betley J, Fraser L, Bauer M, Gormley N, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J 2012; 6:1621–1624. PubMed http://dx.doi.org/10.1038/ismej.2012.8
Article PubMed Central CAS PubMed Google Scholar
Nilsson RH, Tedersoo L, Abarenkov K, Ryberg M, Kristiansson E, Hartmann M, Schoch CL, Nylander JAA, Bergsten J, Porter TM, et al. Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences. MycoKeys 2012; 4:37–63. http://dx.doi.org/10.3897/mycokeys.4.3606
Article Google Scholar
Peay KG, Kennedy PK, Bruns TD. Fungal community ecology: a hybrid beast with a molecular master. Bioscience 2008; 58:799–810. http://dx.doi.org/10.1641/B580907
Article Google Scholar
Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, Chen W. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for fungi. Proc Natl Acad Sci USA 2012; 109:6241–6246. PubMed http://dx.doi.org/10.1073/pnas.1117018109
Article PubMed Central CAS PubMed Google Scholar
McDonald D, Price MN, Goodrich J, Nawrocki EP, DeSantis TZ, Probst A, Andersen GL, Knight R, Hugenholtz P. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J 2012; 6:610–618. PubMed http://dx.doi.org/10.1038/ismej.2011.139
Article PubMed Central CAS PubMed Google Scholar
Cole JR, Wang Q, Cardenas E, Fish J, Chai B, Farris RJ, Kulam-Syed-Mohideen AS, McGarrell DM, Marsh T, Garrity GM, Tiedje JM. The Ribosomal Database Project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res 2009; 37:D141–D145. PubMed http://dx.doi.org/10.1093/nar/gkn879
Article PubMed Central CAS PubMed Google Scholar
Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, Glöckner FO. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res 2007; 35:7188–7196. PubMed http://dx.doi.org/10.1093/nar/gkm864
Article PubMed Central CAS PubMed Google Scholar
Liu KL, Kuske CR, Porras-Alfaro A, Eichorst S, Xie G. Accurate, rapid taxonomic classification of fungal large subunit rRNA genes. Appl Environ Microbiol 2012; 78:1523–1533. PubMed http://dx.doi.org/10.1128/AEM.06826-11
Article PubMed Central CAS PubMed Google Scholar
Hibbett DS, Ohman A, Glotzer D, Nuhn M, Kirk P, Nilsson RH. Progress in molecular and morphological taxon discovery in Fungi and options for formal classification of environmental sequences. Fungal Biol Rev 2011; 25:38–47. http://dx.doi.org/10.1016/j.fbr.2011.01.001
Article Google Scholar
QIIME. http://qiime.org
PhyloSift. http://phylosift.wordpress.com
Fungal ITS workshop
Abarenkov K, Nilsson RH, Larsson KH, Alexander IJ, Eberhardt U, Erland S, Høiland K, Kjøller R, Larsson E, Pennanen T, et al. The UNITE database for molecular identification of fungi: recent updates and future perspectives. New Phytol 2010; 186:281–285. PubMed http://dx.doi.org/10.1111/j.14698137.2009.03160.x
Article PubMed Google Scholar
UNITE database. http://unite.ut.ee
http://qiime.org/home_static/dataFiles.html
Index Fungorum. http://www.indexfungorum.org
MycoBank. http://www.mycobank.org
Hibbett DS, Binder M, Bischoff JF, Blackwell M, Cannon PF, Eriksson OE, Huhndorf S, James T, Kirk PM, Lücking R, et al. A higher-level phylogenetic classification of the Fungi. Mycol Res 2007; 111:509–547. PubMed http://dx.doi.org/10.1016/j.mycres.2007.03.004
Article PubMed Google Scholar
Open Tree of Life.http://opentreeoflife.org
Fungal Barcoding.http://www.fungalbarcoding.org
Bruns TD, Vilgalys R, Barnes SM, Gonzalez D, Hibbett DS, Lane DJ, Simon L, Stickel S, Szaro TM, Weisburg WG, Sogin ML. Evolutionary relationships within the fungi: analyses of nuclear small subunit rRNA sequences. Mol Phylogenet Evol 1992; 1:231–241. PubMed http://dx.doi.org/10.1016/1055-7903(92)90020-H
Article CAS PubMed Google Scholar
1000 Fungal Genoms Project. http://1000.fungalgenomes.org
Lozupone C, Hamady M, Knight R. UniFrac: An Online Tool for Comparing Microbial Community Diversity in a Phylogenetic Context. BMC Bioinformatics 2006; 7:371. PubMed http://dx.doi.org/10.1186/1471-2105-7-371
Article PubMed Central PubMed Google Scholar
Bellemain E, Carlsen T, Brochmann C, Coissac E, Taberlet P, Kauserud H. ITS as an environmental DNA barcode for fungi: an in silico approach reveals potential PCR biases. BMC Microbiol 2010; 10:189. PubMed http://dx.doi.org/10.1186/1471-2180-10-189
Article PubMed Central PubMed Google Scholar
Koetschan C, Förster F, Keller A, Schleicher T, Ruderisch B, Schwarz R, Müller T, Wolf M, Schultz J. The ITS2 Database III: sequences and structures for phylogeny. Nucleic Acids Res 2010; 38:D275–D279. PubMed http://dx.doi.org/10.1093/nar/gkp966
Article PubMed Central CAS PubMed Google Scholar
The ITS2 Database. III http://its2.bioapps.biozentrum.uni-wuerzburg.de

Download references

Acknowledgements

The workshop would not have been possible without the intellectual and financial support of Paula Olsiewski and the Alfred P. Sloan Foundation. We thank Jason Stajich, for organizing the meeting, as well as all the workshop participants for their valuable contributions.

Author information

Authors and Affiliations

Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, Colorado, USA
Scott T. Bates, Noah Fierer & Jon Leff
Department of Plant Pathology & Microbiology, University of California Riverside, Riverside, California, USA
Steven Ahrendt, Dai Gu & Jason E. Stajich
UC Davis Genome Center, University of California Davis, Davis, California, USA
Holly M. Bik
Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, USA
Thomas D. Bruns
Department of Computer Science, Northern Arizona University, Flagstaff, Arizona, USA
J. Gregory Caporaso
USA and Argonne National Labs, Argonne, Illinois, USA
J. Gregory Caporaso
Department of Microbiological and Molecular Genetetics, Michigan State University, East Lansing, Michigan, USA
James Cole
Center for Microbiomics and Human Health, TGen North, Flagstaff, Arizona, USA
Michael Dwan
Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, USA
Shawn Houston
Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado, USA
Rob Knight & Laura Wegener Parfrey
Howard Hughes Medical Institute, Boulder, Colorado, USA
Rob Knight
Eastern Cereals and Oilseed Research Centre, Agriculture and Agri-food Canada, Ottawa, Ontario, Canada
Christopher Lewis
Department of Civil, Architecture and Environmental Engineering, University of Texas at Austin, Austin, Texas, USA
Juan P. Maestre
Department of Chemistry and Biochemistry, and Biofrontiers Institute, University of Colorado, Boulder, Colorado, USA
Daniel McDonald
Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
R. Henrik Nilsson
Department of Biological Sciences, Western Illinois University, Macomb, Illinois, USA
Andrea Porras-Alfaro
Centraalbureau voor Schimmelcultures- Royal Netherlands Academy of Arts and Sciences, Utrecht, The Netherlands
Vincent Robert
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
Conrad Schoch
Division of Occupational & Environmental Health, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
James Scott
Institute of Arctic Biology, University of Alaska, Fairbanks, Alaska, USA
D. Lee Taylor

Authors

Scott T. Bates
View author publications
You can also search for this author in PubMed Google Scholar
Steven Ahrendt
View author publications
You can also search for this author in PubMed Google Scholar
Holly M. Bik
View author publications
You can also search for this author in PubMed Google Scholar
Thomas D. Bruns
View author publications
You can also search for this author in PubMed Google Scholar
J. Gregory Caporaso
View author publications
You can also search for this author in PubMed Google Scholar
James Cole
View author publications
You can also search for this author in PubMed Google Scholar
Michael Dwan
View author publications
You can also search for this author in PubMed Google Scholar
Noah Fierer
View author publications
You can also search for this author in PubMed Google Scholar
Dai Gu
View author publications
You can also search for this author in PubMed Google Scholar
Shawn Houston
View author publications
You can also search for this author in PubMed Google Scholar
Rob Knight
View author publications
You can also search for this author in PubMed Google Scholar
Jon Leff
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Juan P. Maestre
View author publications
You can also search for this author in PubMed Google Scholar
Daniel McDonald
View author publications
You can also search for this author in PubMed Google Scholar
R. Henrik Nilsson
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Porras-Alfaro
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Robert
View author publications
You can also search for this author in PubMed Google Scholar
Conrad Schoch
View author publications
You can also search for this author in PubMed Google Scholar
James Scott
View author publications
You can also search for this author in PubMed Google Scholar
D. Lee Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Laura Wegener Parfrey
View author publications
You can also search for this author in PubMed Google Scholar
Jason E. Stajich
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Bates, S.T., Ahrendt, S., Bik, H.M. et al. Meeting Report: Fungal ITS Workshop (October 2012). Stand in Genomic Sci 8, 118–123 (2013). https://doi.org/10.4056/sigs.3737409

Download citation

Published: 15 April 2013
Issue Date: January 2013
DOI: https://doi.org/10.4056/sigs.3737409

Meeting Report: Fungal ITS Workshop (October 2012)

Abstract

ITS sequence database for fungal community analyses

Purposes of the Meeting

This meeting was organized in order to:

Participants

Activities

Conclusions / Outcomes

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Environmental Microbiome

Contact us

Meeting Report: Fungal ITS Workshop (October 2012)

Abstract

ITS sequence database for fungal community analyses

Purposes of the Meeting

This meeting was organized in order to:

Participants

Activities

Conclusions / Outcomes

References

Acknowledgements

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Environmental Microbiome

Contact us