Skip to main content

Fine-scale characterization of the soybean rhizosphere microbiome via synthetic long reads and avidity sequencing

Abstract

Background

The rhizosphere microbiome displays structural and functional dynamism driven by plant, microbial, and environmental factors. While such plasticity is a well-evidenced determinant of host health, individual and community-level microbial activity within the rhizosphere remain poorly understood, due in part to the insufficient taxonomic resolution achieved through traditional marker gene amplicon sequencing. This limitation necessitates more advanced approaches (e.g., long-read sequencing) to derive ecological inferences with practical application. To this end, the present study coupled synthetic long-read technology with avidity sequencing to investigate eukaryotic and prokaryotic microbiome dynamics within the soybean (Glycine max) rhizosphere under field conditions.

Results

Synthetic long-read sequencing permitted de novo reconstruction of the entire 18S-ITS1-ITS2 region of the eukaryotic rRNA operon as well as all nine hypervariable regions of the 16S rRNA gene. All full-length, mapped eukaryotic amplicon sequence variants displayed genus-level classification, and 44.77% achieved species-level classification. The resultant eukaryotic microbiome encompassed five kingdoms (19 genera) of protists in addition to fungi – a depth unattainable with conventional short-read methods. In the prokaryotic fraction, every full-length, mapped amplicon sequence variant was resolved at the species level, and 23.13% at the strain level. Thirteen species of Bradyrhizobium were thereby distinguished in the prokaryotic microbiome, with strain-level identification of the two Bradyrhizobium species most reported to nodulate soybean. Moreover, the applied methodology delineated structural and compositional dynamism in response to experimental parameters (i.e., growth stage, cultivar, and biostimulant application), unveiled a saprotroph-rich core microbiome, provided empirical evidence for host selection of mutualistic taxa, and identified key microbial co-occurrence network members likely associated with edaphic and agronomic properties.

Conclusions

This study is the first to combine synthetic long-read technology and avidity sequencing to profile both eukaryotic and prokaryotic fractions of a plant-associated microbiome. Findings herein provide an unparalleled taxonomic resolution of the soybean rhizosphere microbiota and represent significant biological and technological advancements in crop microbiome research.

Background

Microbial symbionts demonstrate the propensity to overcome basal plant immunity, after which time they may occupy host compartments and therewith engage in mutualism, commensalism, and/or parasitism [1]. These interrelationships exist across a context-dependent, temporally plastic continuum [2, 3] and are pivotal for plant health and physiology [4]. This is exemplified in the rhizosphere, defined as the soil region directly influenced by root exudates, where selective pressures imposed by a host encourage the colonization of fitness-promoting microorganisms [5]. Consequently, the assembly, function, and sustenance of the rhizosphere microbiome have become focal points of intensive research endeavors, particularly within food crop systems, due to their intrinsic link to plant health and broader implications for agricultural sustainability [6].

The Fabaceae (Leguminosae) serve as model systems for rhizosphere microbiome research given their capacity to recruit diazotrophic bacteria for atmospheric nitrogen fixation [7]. This is well evidenced by soybean (Glycine max), a global staple crop for which rhizosphere microbiome dynamics have been extensively studied. Mendes et al. [8] found that bacterial assemblages in the soybean rhizosphere were less diverse than in corresponding bulk soils, reflecting preferential selection of microbiota adept in N, Fe, P, and K metabolism. Likewise, Zhang et al. [9] surveyed 51 soybean fields across China, revealing that while soil pH predominantly influenced bacterial communities, eukaryotic assemblages were more responsive to Mg levels. Biotic stressors including Fusarium virguliforme [10], Phytophthora sojae [11], and Heterodera glycines [12] have also been implicated to modulate soybean rhizosphere microbiome structure, as have tillage [13,14,15], biological product/fertilizer application [16], host growth stage [13, 17], host genotype [18], and other edaphic parameters [14, 15]. Nonetheless, one must exercise caution when interpreting such findings, especially when bridging taxonomy with function, given the incomplete representation of soil microbiota in public databases as well as the inherent constraints of common microbiome profiling methodologies (reviewed at-length by Baldrian [19]).

Amplicon sequencing is a primary method for microbiome profiling, as microorganism identification is not restricted by culturing capacity [20] and workflows are resource-efficient in comparison to shotgun metagenomic approaches [21]. Leveraging PCR amplification of marker genes, amplicon sequencing entails primer annealing to conserved regions within rRNA operons and the use of adjacent hypervariable regions for taxonomic classification [22]. In prokaryotes, the 16S rRNA gene is targeted due to its nine hypervariable regions (V1 to V9) interspersed with highly conserved sequences [23]. For eukaryotes, multiple regions within the rRNA operon can be utilized. The 18S rRNA gene (SSU or Small Subunit) serves a similar purpose to the 16S in prokaryotes, providing broad taxonomic identification [24]. For heightened resolution, the Internal Transcribed Spacer (ITS) regions ITS1 and ITS2 are chosen [25]. These regions are located between the SSU and the 5.8S rRNA genes, and between the latter and the Large Subunit (LSU or 28S rRNA gene) in the operon [26]. PCR amplicon sequencing of such regions produces amplicon sequence variant (ASV) read numbers that estimate organism abundances with reasonable precision, though estimate accuracy can vary significantly. Despite improved coverage of marker gene-based approaches [27], traditional short-read sequencing technologies are limited to the interrogation of few hypervariable regions, rendering region-specific bias [28], uncertain/erroneous taxonomic classification [29], and limited classification beyond genus level [19,20,21]. Many in the field recommend using long read-based sequencing strategies to overcome such limitations (i.e., Oxford Nanopore Technology [ONT] and Pacific Biosciences [PacBio]) [30, 31]; yet, ONT has demonstrated inferior accuracy compared to other sequencing platforms [32] and PacBio remains relatively cost-prohibitive [31]. To derive tractable biological inference from microbiome profiling, it is imperative to employ methodologies with enhanced resolution, accuracy, and accessibility.

The LoopSeq platform by Element Biosciences (formerly Loop Genomics) is a synthetic long-read (SLR) sequencing method that addresses many of the defined challenges surrounding amplicon-based microbiome profiling. To this end, each parent DNA molecule in a sample is barcoded with a unique molecular identifier (UMI) which is thereafter distributed intramolecularly across the molecule [33]. Post-fragmentation and sequencing, short reads sharing a UMI are assembled de novo to reconstruct the entire parent molecule sequence. This approach employs a consensus-driven error correction system that renders a higher fraction of error-free reads compared to PacBio circular consensus reads and the cited ONT per-base error rate [33]. LoopSeq additionally minimizes the formation of PCR amplicon chimeras, as chimeric molecules are unlikely to contribute to the consensus unless they dominate the reads for a given UMI [33]. In practice, LoopSeq outperformed V3-V4 short-read sequencing in terms of taxonomic resolution and identification accuracy for human gut microbiota [34], and was superior to V4 and PacBio in accuracy and cost for soybean rhizosphere microbiome profiling (~ 90% per-Mb cost reduction compared to PacBio) [29]. Thus, LoopSeq SLR technology holds transformative potential for comprehensive and precise microbiome analysis across diverse study systems.

In the present study, the LoopSeq SLR platform was used to profile the soybean rhizosphere microbiome under field conditions. The experimental design incorporated two commercially available soybean cultivars with contrasting levels of tolerance to F. virguliforme-induced Sudden Death Syndrome (SDS), the absence/presence of an in-furrow/foliar biostimulant regimen, and four growth stages spanning vegetative and reproductive development (Fig. 1). Following DNA isolation and library preparation, the UMI-tagged fragments were sequenced using avidity chemistry, which independently optimizes DNA template traversal and nucleotide identification, achieving an accuracy surpassing one error per 10,000 bp [35]. The resulting short reads were assembled into SLRs spanning all nine hypervariable regions for the 16S rRNA gene or the entire 18S-ITS1-ITS2 region of the eukaryotic rRNA operon. Complementary to standard microbiome assessment, 24 edaphic properties and five agronomic traits were measured, integrated into phenotype-taxon networks, and used to prioritize taxa with putative ecological relevance (Fig. 1).

Fig. 1
figure 1

Schematic representation of the experimental design. This graphic was created using BioRender (Biorender.com)

Materials and methods

Site description and management

The test site was located at the Arkansas State University Agricultural Teaching and Research Center (35° 50′ 16″ N, 90° 40′ 00″ W). The cropland consisted of a Collins silt loam (Coarse-silty, mixed, active, acid, thermic Aquic Udifluvent) with a 0 to 1% slope and had been used historically for the cultivation of a corn (Zea mays)-soybean rotation in the absence of tillage. A cover crop blend of black oat (Avena strigosa), Austrian winter pea (Pisum sativum L. ssp. sativum var. arvense), and buckwheat (Fagopyrum esculentum) was sown during the fallow period using a Great Plains End Wheel No-Till Compact Drill (Great Plains Manufacturing Incorporated, Salina, KS, USA) with 19.05-cm row spacing. In addition, cattle (Bos taurus) were grazed on the site for 10 d.

At the beginning of the growing season, the cover crop was terminated using 1.46 L ha−1 Roundup PowerMAX 3 (Bayer CropScience, Monheim, Germany) plus 0.37 L ha−1 Verdict (BASF, Ludwigshafen, Germany). Seven d after, the test site was fertilized with P2O5 (51.45 kg ha−1), K2O (99.87 kg ha−1), and S (3.03 kg ha−1). Four passes were made with a John Deere tandem disc (Deere and Company, Moline, IL, USA), and were followed by one pass with a Triple K field cultivator (Kongskilde Agriculture, Albertslund, Denmark). Raised seedbeds were then formed using a 2-row Hipper Roller (Brandt, Springfield, IL, USA) with 76.2-cm row spacing. Approximately 2.34 L ha−1 Command® 3ME (FMC Corporation, Philadelphia, PA, USA) plus 0.37 L ha−1 Verdict® were applied for pre-emergent weed control 2 d prior to the soybean planting date. Soybeans (cultivars described below) were then planted with a total row length of 106.68 m at a seeding rate of 345,947 seeds ha−1 using a John Deere 1705 4-row vacuum planter (Deere and Company). A second fertilizer application was made 1 d after planting and was equivalent to the first. Following plant emergence, 3.51 L ha−1 Warrant (Bayer CropScience) and 2.34 L ha−1 Roundup PowerMAX 3 were applied for the management of yellow nutsedge (Cyperus esculentus L.). Manual weed removal was performed weekly throughout the growing season. Moreover, a channeled (furrowed) surface irrigation system ran along the north side of the test site and was used to deliver one acre-inch water (~ 102,789 L) to the crop. Irrigation began the first week in July and was performed weekly until the third week in September. Weather data for the growing season were obtained from Visual Crossing Corporation (https://www.visualcrossing.com/) and can be found in Additional file 1.

Experimental design

A randomized split-block design comprising eight rows of BASF Credenz soybeans was used in this study. Four rows consisted of the cultivar CZ4979X (maturity group 4.9; SDS-tolerant), while the other four were CZ4810X (maturity group 4.8; SDS-susceptible). Both 4-row sections were divided into 6.096-m plots separated by 7.62-m buffer zones, yielding 8 randomized plots (4 control and 4 treatment) for each section (16 total). At the early vegetative stage (V1 – one set of unfolded trifoliate leaves), the biostimulant IgniteS2 (AgriGro Incorporated, Doniphan, MO, USA) was applied to the base of plants in treatment plots at a rate of 1.17 L ha−1. The biostimulant FoliarBlend (AgriGro Incorporated) was applied to the foliage at mid-vegetative (V3 – third set of unfolded trifoliate leaves) and early reproductive (R3 – full flower inflorescence/reproductive stage) stages at a rate of 1.17 L ha−1. Biostimulant applications were made with a backpack sprayer within the inner 2 rows for each treatment plot. Soybean growth stages were defined by Fehr and Caviness [36].

Soil sampling and DNA isolation

From each plot, a composite sample comprising 10 soil cores was taken at the following growth stages: early vegetative (V1—first unfolded trifoliate, preceding biostimulant application), late vegetative (V6—6th node, preceding anthesis), early reproductive (R2—full flower inflorescence/reproductive stage), and late reproductive (R6—full pod development) (i.e., 640 cores reduced to 64 composite samples). Spatially distributed selective pressures reduce microbial diversity in proximity to the root surface [37, 38], which must be considered during sampling. Therefore, cores were collected with a 2 cm-diameter auger 4 cm from the base of the plant to a depth of 10 cm (Fig. 1). Composite samples were collected in treatment- and cultivar-specific vessels to minimize cross-contamination. In addition, augers were sterilized in a 20% bleach solution (7.4% NaClO) for 5 min and rinsed thoroughly with water between plot samplings. Collection and handling procedures were consistent across all samples. Following collection, each composite sample was homogenized, sieved to 2 mm, and subdivided for downstream physiochemical, enzyme, and microbiome analyses. Subsamples for enzyme and microbiome analyses were placed immediately into sterile 50 mL conical tubes, freeze-dried on solid CO2, and stored at − 80 °C.

DNA was isolated from 250 mg of each sample using the DNeasy PowerLyzer PowerSoil Kit (Cat #12855-100) according to the manufacturer’s protocol (QIAGEN, Hilden, Germany). Following the addition of soil, PowerBead Solution, and Solution C1 to the PowerBead tube, cells were lysed with a Precellys 24 tissue homogenizer with the following program: 3000 RPM × 30 s duration x three cycles with a 20 s delay between cycles. A Qubit fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) paired with a dsDNA high-sensitivity assay kit (Cat #Q32851) was used subsequently to estimate DNA concentration. DNA purity was approximated from 260/230 and 260/280 nm absorbance ratios using a NanoDrop ND-1000 spectrophotometer. A sample without soil served as a ‘kitome’ control during DNA isolation [39]. Additional positive (ZymoBIOMICS™ Microbial Community DNA Standard [Cat #D6305]) and negative (template-free sample) controls were included for barcoding and sequencing steps.

Amplicon sequencing and SLR assembly

Sequencing libraries were prepared at Element Biosciences (San Diego, CA, USA) using the Amplicon LoopSeq for AVITI (Cat #840–00002), Extension LoopSeq for AVITI (Cat #840–00003), and Element Elevate™ Library Circularization Kit (Cat #830–00001) as per manufacturer guidelines. Initial steps involved target enrichment and PCR amplification of the 16S and 18S-ITS rRNA regions using the following primers: 16S Fwd (5′-AGAGTTTGATCMTGGCTCAG-3′), 16S Rev (5′-TACCTTGTTACGACTT-3′), 18S-ITS Fwd (5′-TACCTGGTTGATYCTGCCAGT-3′), and 18S-ITS Rev (5′-GGTTGGTTTCTTTTCCT-3′, 5′-TAAATTACAACTCGGAC-3′, 5′-TCCTCCGCTTWTTGWTWTGC-3′, 5′-CTBTTVCCKCTTCACTCG-3′). Subsequently, UMIs and a LoopSeq index were integrated into each sample, and the barcoded samples were calibrated, amplified, and multiplexed. Library preparation was then performed by distributing each UMI to a random position within the respective parent molecule, fragmenting the barcoded molecules at each UMI position, and adding Element indexes and adapters. The final library was circularized and sequenced on the Element AVITI System (Cat #880-00001). All amplification conditions are provided at https://github.com/brett-hale/Hale_2024_SLR.git. The Bases2Fastq software (v1.4.0) was used to convert the bases files into FASTQ files and de-multiplex the pooled library based on index sequences.

Data processing was performed with the Element Biosciences cloud-based platform, largely following the bioinformatics pipeline established by Callahan et al. [33]. First, adapter sequences were removed from short reads using Trimmomatic (v0.36) [40], and trimmed reads were de-multiplexed based on their LoopSeq Index. Reads within a grouped sample were then binned by UMI and further processed through SPAdes (v3.9) [41], allowing the de novo assembly of SLRs spanning the full length of the defined rRNA operon sequence. The resultant SLRs were further processed and thereafter clustered into ASV bins of 100% sequence homology with the DADA2 R package (v1.28.0) [42] employing the specifications outlined by Callahan et al. [33]. Taxonomic assignments were conducted using BLAST, with a criterion of 97% sequence similarity. Prokaryotic classifications were based on the SILVA SSU database (v138) [43], while eukaryotic identifications used the UNITE database (all eukaryotes v8) [44, 45]. Short-read and SLR summary data are provided in Supplementary Fig. 1, Additional file 2; Additional file 3; and Additional file 4.

Microbiome statistical analysis

The ASV count matrices, taxonomic assignments, and sample metadata were imported into RStudio (v4.2.2) [46] and combined to create ‘phyloseq’ objects with the phyloseq package (v1.42.0) [47]. ASVs mapped to kingdoms ‘Viridiplantae’ and ‘Metazoa’ were removed subsequently from the 18S-ITS object to accentuate true eukaryotic microbiota. Prior to statistical analysis, samples were decontaminated based on ASVs present in the two negative controls using the ‘isContaminant’ function in the decontam package (v1.13) with a 0.5 prevalence probability threshold [48]. Within-sample (α) diversity was then investigated by estimating the Chao1 index [49], Simpson diversity [50], and Shannon diversity [51] using the ‘estimate_richness’ function in phyloseq. Pielou's evenness [52] was assessed with the ‘evenness’ function in the microbiome package (v1.2.1) [53]. Rank-based measures of association between the eukaryotic and prokaryotic Chao1 indices, Shannon diversity, and between the Chao1 index and Shannon diversity were inferred using the 'cor.test' function in the R package stats (v4.2.2), leveraging Spearman’s ρ statistic [54]. The package ggplot2 (v3.4.2) [55] was used for data visualization.

The 16S and 18S-ITS α diversity datasets were divided into the baseline measurement (V1) and growth stages succeeding biostimulant application (V6, R2, and R6). Within each partitioned dataset, the assumption of normality was assessed for Shannon diversity and Chao1 using the Shapiro–Wilk test [56] in the stats package. In instances when the normality assumption was not met (p value < 0.05), data were fitted to five probability distributions using the ‘fitdist’ function in the fitdistrplus package (v1.1–11) [57], and the best-fitting distribution was inferred from the minimum Akaike information criterion (AIC). Distribution fit was supported qualitatively with quantile–quantile plots generated with packages car (v3.0–12) [58] and MASS (v7.3–54) [59]. Generalized linear mixed models (GLMMs) were then implemented with package glmmTMB (v1.1.7) [60]. Treatment, cultivar, growth stage, and first-, second-, and third-order interactions were incorporated as explanatory variables, with field location incorporated as a random effect. Automated model selection was performed with the MuMin (v1.47.5) [61] ‘dredge’ function, with selection constrained to models containing treatment, cultivar, and growth stage. The best-fitting model was selected by minimum AICc (second-order AIC) as recommended by Burnham and Anderson [62]. Model fit was assessed with standardized and deviance residuals leveraging package stats. Additionally, 250 datasets were simulated and used to calculate an empirical cumulative density function, the residuals of which were examined through quantile–quantile and residual-fitted value plots. Binomial and Poisson models were checked explicitly for overdispersion using ‘check_overdispersion’ from the performance package (v0.10.3) [63]. Moreover, fixed effects retained in final models were further evaluated by hierarchical partitioning of marginal R2 values using glmm.hp (v0.1–0) [64] and through power analyses implemented with the ‘powerSim’ function of the SIMR package (v1.0.6) [65]. Marginalized coefficients were extracted for models applying a nonlinear link function with the ‘marginal_effects’ function of margins (v0.3.26) [66], and the mean coefficients (denoted hereafter as “mean estimates”, or “ME”) were visualized with ggplot2.

Following α diversity estimation, unmapped ASVs were removed from the phyloseq objects, and count matrices were normalized by cumulative sum scaling (CSS) with metagenomeSeq (v1.40.0) [67]. Condition-specific (treatment-cultivar-growth stage) compositions were then visualized at the phylum level with ggplot2. Thereafter, compositional dissimilarity was assessed at ASV level for samples preceding and succeeding biostimulant application using Bray–Curtis, Euclidean, and Jaccard distances [68, 69] with the function ‘distance’ in phyloseq. Statistical trends in community structure were inferred by conducting permutational multivariate analysis of variance (PERMANOVA) with the defined dissimilarity matrices. Each PERMANOVA was performed independently with the ‘adonis2’ function in vegan (v2.6–4) [70] specifying treatment, cultivar, and growth stage as well as first- and second-order interactions as explanatory variables, and with 9,999 permutations constrained by field location. Compositional variance attributed to each explanatory variable was inferred from PERMANOVA R2 values. Multivariate homogeneity of group dispersions was then analyzed with the vegan function ‘betadisper’. Analysis of variance (ANOVA) was used subsequently to determine if distances to group centroids varied significantly between fixed effect levels. PERMANOVA and dispersion test results were consistent across dissimilarity matrices (Supplementary Table 1, 2, Additional file 2); therefore, representative ordinations of Bray–Curtis dissimilarity were plotted using Principal Correspondence Analysis (PCoA) and Non-metric Multidimensional Scaling (NMDS) with the ‘ordinate’ phyloseq function and ggplot2. In the post-biostimulant application datasets, the ‘simper’ function from the vegan package was employed to determine the contribution of individual ASVs to the Bray–Curtis dissimilarity between levels of each fixed effect [71]. The top five ASVs for each pairwise comparison were identified, their taxonomy extracted from the phyloseq object, and their contribution percentage visualized using the ComplexHeatmap package (v2.14.0) [72]. Lastly, Bray–Curtis dissimilarity matrices were reconstructed with CSS-normalized counts from both mapped and unmapped full-length ASVs, compositional trends were assessed via PERMANOVA and β dispersion estimation, and parallels were discerned between the full and partial dataset analyses.

The taxonomic resolution achieved with both 16S and 18S-ITS amplicon sequencing enabled the retrieval of functional profiles for most mapped ASVs. Genus-level eukaryotic functions were sourced from the FungalTraits database (v1.2) [73]. Prokaryotic functional profiles were derived at the species level from BacDive, a bacterial/archaeal metadatabase maintained by the German Collection of Microorganisms and Cell Cultures [74]. The ‘retrieve’ function of the R package BacDive (v0.8.0) [75] was employed with parameters set as “query = species list” and “search = taxon”. The resulting metadata was narrowed to terms including “aerobe”, “philic”, “path”, and “gram”, reflecting O2 tolerance, temperature range, pathogenicity, and gram stain, respectively. The collated eukaryotic and prokaryotic data were then combined into a comprehensive functional data frame capturing Structure (eukaryotic fruiting body/prokaryotic gram stain), Growth (eukaryotic growth form/prokaryotic temperature range), Environment (eukaryotic aquatic habitat/prokaryotic O2 tolerance), and Lifestyle/Pathogenicity. This data informed and contextualized taxa highlighted in subsequent analyses.

Community membership was determined at genus and species levels for eukaryotic and prokaryotic communities, respectively, which were the lowest taxonomic classifications to which all full-length, mapped ASVs could be identified. Core taxon analysis was performed with the microbiome package by first converting CSS-normalized counts to relative abundances with the function ‘transform’ and specification ‘compositional’, and by subsequently obtaining taxa with a prevalence ≥ 0.5 with the ‘core_members’ function. The relative abundance of core taxa was visualized with the ComplexHeatmap package. The ASVs unique to a fixed effect level were retrieved using the ‘unique_taxa’ function of the phylosmith package (v1.0.6) [76]. The full dataset was used to identify unique ASVs between cultivars and growth stages. The analysis was repeated for treatment with datasets partitioned into the baseline measurement and samplings succeeding biostimulant application, and ASVs retrieved exclusively from the latter dataset were deemed unique to a level of treatment. This information was used to identify unique and shared taxa across fixed effects and domains, which were visualized with Venn diagrams constructed with ggvenn (v0.1.10) [77] as well as with ComplexHeatmap.

Differentially abundant taxa were identified between fixed effect levels with MaAsLin2 (Microbiome Multivariable Associations with Linear Models) (v1.12.0) [78]. CSS-normalized counts for taxa with a minimum prevalence > 0.1 were fitted with a zero-inflated negative binomial (ZINB) regression model composed of treatment, cultivar, and growth stage as fixed effects and field location as a random effect. Maaslin2 inherently corrects for multiple testing using the Benjamini–Hochberg approach, thus taxa with a q-value < 0.25 were deemed significant as recommended by the package authors [78] and as reported in the literature [79, 80]. Consistent with unique taxon identification, the full dataset was used to identify differentially abundant taxa between levels of cultivar and growth stage, with partitioned datasets deployed for treatment. Regarding the latter, taxa exhibiting statistically significant, consistent directional changes (both positive or negative coefficients) across the baseline and post-treatment datasets were excluded. Conversely, taxa that displayed opposing directional changes between the two datasets (i.e., positive in one and negative in the other), or were present exclusively in the post-treatment dataset, were retained. Log-normalized False Discovery Rate (FDR; − sign[coefficient)*log(q-value]) for differentially abundant taxa were visualized with ComplexHeatmap.

To further assess community membership, eukaryotic and prokaryotic phyloseq objects were combined and conglomerated at the genus level with the ‘merge_phyloseq’ and ‘conglomerate_taxa’ functions, respectively, of the phylosmith package. A global pairwise Spearman co-occurrence network was then constructed by obtaining significant positive and negative associations (ρ >  ± 0.6, p value < 0.05) with the phylosmith ‘co_occurrence’ function. The p-values were corrected for multiple testing using the stats function ‘p.adjust’ specifying FDR correction. Associations with a q-value < 0.05 were visualized with the phylosmith function ‘co_occurrence_network’ with nodes representing genera and edges representing positive and negative associations.

Condition-specific co-occurrence networks (n = 16) were constructed as described for the global networks, and the phylosmith ‘network_layout_ps’ function was used subsequently to create a graph object from each set of co-occurrences. Comparisons of network topology were then performed by calculating centralization degree (the concentration of network centrality), cluster count, connectance (the proportion of possible connections that are present), edge count, node count, and mean degree (the average degree [number of connections] of nodes) with the ‘net_properties’ function in the ggClusterNet R package (v0.1.0) [81]. Giant component size (the size of the largest connected component) and modularity (the extent to which a network can be divided into non-overlapping communities) were determined with the igraph R package (v1.4.2) [82]. For the latter metric, nodes were assigned to communities using the Walktrap algorithm [83] implemented with the ‘cluster_walktrap’ function, and modularity was calculated with the ‘modularity’ function using the community membership vector as input. Moreover, microbial co-occurrence networks inherently exhibit a scale-free topology in which node degrees follow a power-law distribution [84]. To this end, condition-specific graph objects were converted to degree distribution vectors with the igraph ‘degree_distribution’ function, and a power-law distribution was fitted to each vector with the ‘fit_power_law’ igraph function. The Kolmogorov–Smirnov (KS) test statistic was used to quantify the distance between node degree distribution and a power-law distribution. Nodes were prioritized for each network by assigning Kleinberg’s hub centrality scores [85] with the igraph function ‘hub_score’ with logical scaling applied. Nodes with a hub score > 0.2 were considered hubs, which was consistent with prior studies [86]. To further support predicted co-occurrences, global and condition-specific networks were reconstructed with Pearson associations [87] following the described methodology. Unique and shared co-occurrences and nodes between the association methods were visualized with ggvenn Venn diagrams. Additional rank-based Spearman associations were inferred between edge count and node count (network properties used commonly to reflect density) and the remaining topological features for each set of condition-specific networks. Furthermore, Friedman rank sum tests [88] were applied with the stats package to assess the differential ranking of conditions between association methods for each topological feature. Network membership was visualized using ComplexHeatmap.

Significant microbial co-occurrences, identified through global networks of one or both association methods, were combined with edaphic and agronomic parameters (n = 24 and 5, respectively; methodology described hereafter) to construct phenotype-taxon networks using the PhONA (phenotype-OTU network analysis) R package (v0.2) [89]. This package first employs lasso regression to identify taxa predictive of a defined phenotype. From these predictive taxa, PhONA constructs a generalized linear model (GLM) and subsequently integrates the GLM with the user-provided co-occurrence matrix [89]. All sample information was used for edaphic parameter network analysis, while agronomic parameter network analysis was performed with samples from the R6 growth stage. Resulting phenotype-taxon networks were reconstructed with igraph, and network membership was visualized using ComplexHeatmap.

The PhONA package assigns modularity roles to each node by computing connectivity (within-module z-score of edge weights) and a participation coefficient (distribution of a node's links across different modules), methodology consistent with the rnetcarto R package [89, 90]. In this study, the modularity analysis was augmented with Kleinberg's hub centrality (computed as described for co-occurrence networks), providing a more nuanced differentiation of core network nodes based on information dissemination roles. The three metrics were individually normalized to a [0,1] scale and then combined with equal weights to compute a composite centrality score for each node in the phenotype-taxon network. Nodes were prioritized by first filtering to those with a ≥ 0.2 prevalence across all samples in an effort to minimize biased associations while retaining putative specialization [91]. Thereafter, genera were ranked by mean composite score, accentuating those for which a phenotype could indicate a niche specialization in addition to those central across all networks. The top 20 genera were obtained for each parameter type (edaphic and agronomic), and rank-based measures of association were inferred between genera using CSS-normalized counts, as well as between genera and edaphic/agronomic parameters.

Edaphic parameter estimation

Freeze-dried subsamples were sent to Ward Laboratories Inc. (Kearney, NE, USA) for quantitative assessment of β-glucosidase (GB3), N-acetyl-β-glucosaminidase (NAG), phosphodiesterase (PDE), alkaline phosphatase (ALP), acid phosphatase (ACP), and arylsulfatase (ARS). The GB3 assays were based on Moscatelli et al. [92] methods, while NAG assays employed procedures from Deng and Popova [93] and Parham and Deng [94]. Both phosphatase enzymes were analyzed using Nannipieri et al. [95] protocols. The ARS assays followed the methodologies of Tabatabai and Bremner [96] and Klose et al. [97]. Each assay utilized 2 g of soil, and enzymatic activities were quantified using a BioTek Epoch 2 Microplate Spectrophotometer (Agilent Technologies, Santa Clara, CA, USA).

The remaining subsamples were kept in plastic bags at room temperature until shipment to Waypoint Analytical (Memphis, TN, USA) where the following macro- and micronutrient levels were measured following standard Mehlich 3 Extraction procedure [98]: B, Ca, Cu, Fe, K, Mg, Mn, Na, P, S, and Zn. Additionally, soil pH was assessed using the 1:1 soil–water ratio method and buffer pH using the Shoemaker–McLean–Pratt (SMP) procedure [99]. Soil organic matter (SOM) was estimated using the Loss-on-ignition method [100] and N in the form NO3- as defined by Swift and Sparks [101]. Lastly, the percent saturation of Ca, H, K, Mg, and Na were used to estimate cation exchange capacity (CEC) in milliequivalents per 100 g (meq/100 g) of soil. Detailed protocols for all edaphic measurements can be found in Gavlak et al. [102].

Pairwise rank-based measures of association were inferred across all parameters. Furthermore, differences between fixed effect levels for each parameter were assessed using GLMMs as described previously.

Agronomic parameter estimation

When soybean plants reached physiological maturity (R8—95% of pods have reached their full mature color), three plants per plot (n = 48) were selected randomly for measurement of the following characteristics: pods per plant (exclusive to those containing ≥ 1 full seed), root biomass, and aboveground biomass. For biomass measurements, roots were rinsed gently with tap water to remove substrate and plants oven-dried at 60 °C for 72 h. Roots were then cut at the soil line (~ 4 cm above the 1st lateral root) and root and aboveground dry weight determined independently. Plant selection and data collection were performed by researchers blinded to experimental conditions.

One hundred-seed weight and theoretical grain yield were also determined at the R8 growth stage. First, an area of 1.16 m2 was selected randomly from each plot and manually harvested. Collected plants were threshed using a stationary Plot Master Combine (ALMACO, Nevada, IA, USA). Seed moisture was then assessed using a mini GAC® Plus Grain Moisture Tester (Dickey-John Corporation, Auburn, IL, USA) and seed weight calculated at a 13% moisture base. Theoretical grain yield was determined in kg ha−1 with the corrected seed weight.

GLMMs were used to discern differences between levels of treatment, cultivar, and treatment-cultivar interactions. In instances where multiple plants were selected per plot, plant replicates were modeled as nested random effects to account for a lack of independence among observations.

Additional information

Plant lodging was observed around R5 (beginning of seed fill). In addition, Cercospora leaf blight (Cercospora kukuchii) and soybean stem borer (Dectes texanus) damage occurred between R6 and R8 (full seed to maturity).

Results

SLRs enabled a taxonomically resolved assessment of the soybean rhizosphere microbiome

The present study employed SLR technology in tandem with avidity sequencing to explore the composition and structure of the soybean rhizosphere microbiome. For 18S-ITS amplicon sequencing targeting eukaryotes, this method generated a collective 852 M short reads assembled into 1.4 M SLRs averaging a length of 1108.31 ± 663.99 bp (full and partial length) (Supplementary Fig. 1, Additional file 2; Additional file 3). Similarly, the full-length SLRs were 1084.69 ± 671.98 bp. From these, 1,014 denoised ASVs were classified at the genus level using the Ensemble reference database, and 44.77% could be further classified at the species level. The mapped eukaryotic SLRs were 2328.54 ± 213.7 bp. Conversely, the 16S (prokaryotic) dataset comprised 192 M short reads, resulting in 1.4 M SLRs with a mean length of 1362.44 ± 291.46 bp (Supplementary Fig. 1, Additional file 2; Additional file 4). The full-length contigs were 1,476.06 ± 70.51 bp in length. Mapping these to the Silva138 database permitted the identification of 895 ASVs, all classified at the species level, with 207 (23.13%) achieving strain-level classification. The mapped prokaryotic SLR length was consistent with that of all full-length SLRs, being 1477.64 ± 25.15 bp.

The soybean rhizosphere microbiome displayed a diverse taxonomic profile spanning seven kingdoms. The eukaryotic fraction consisted primarily of partly aquatic, saprotrophic fungi that exhibit perithecial fruiting bodies and filamentous mycelial growth forms, while the prokaryotic fraction largely comprised gram-negative, mesophilic, aerobic bacteria. Beyond bacteria and fungi, protist populations from five kingdoms were observed: Alveolata, characterized by membrane-bound sacs (alveoli) beneath the plasma membrane [103]; Apusozoa, flagellated unicellular eukaryotes [104]; Heterolobosa (i.e., Heterolobosea), protists with both amoeboid and flagellated stages [105]; Rhizaria, identified by their thread-like pseudopodia [106]; and Stramenopila, which encompasses diatoms, brown algae, and oomycetes, differentiated by their heterokont flagella [107].

Growth stage and spatial heterogeneity best explained microbiome structure

In assessing microbiome complexity and structure, the within-sample characteristics of ASV richness, diversity, and evenness were determined and compared across treatments, cultivars, and growth stages (Fig. 2A–C). ASV richness was measured with the non-parametric estimator Chao1 given its capacity to project undetected taxa based on the abundance of those rarely observed in a dataset. This extrapolated measure accounts for potential undersampling in high-diversity environments (e.g., soil), providing a more thorough assessment of community richness [49, 108]. Herein, the mean eukaryotic Chao1 index value was 1,033 ± 36, and was increased marginally in growth stage R6 vs R2 (p-value = 0.07, ME = 220.62) and in cultivar CZ4979X vs CZ4810X (p-value = 0.09, ME = 205.38) (Fig. 2C). A cultivar-growth stage interaction was also observed and explained the highest proportion of variance (marginalized R2 = 0.88), with CZ4979X:R6 being significantly less than the reference (p-value = 0.0023, ME = -533.75) (Fig. 2C). The prokaryotic Chao1 index was 85.44 ± 8.76 and demonstrated a significant increase in R6 vs R2 (p-value = 0.0099, ME = 50.28) and significant decreases in CZ4979X vs CZ4810X (p-value = 0.04, ME = -32.24) and in treatment control vs biostimulant (p-value = 0.04, ME = -31.72) (Fig. 2C). Moreover, growth stage explained the highest proportion of variance for prokaryotic Chao1 (marginalized R2 = 0.45). The baseline (V1) sampling showed no significant trends in the eukaryotic or prokaryotic Chao1 indices.

Fig. 2
figure 2

α diversity estimation. A Boxplots of the Chao1 index, Shannon diversity, Simpson diversity, and Pielou's evenness for eukaryotic and prokaryotic datasets. B Rank-based association between eukaryotic and prokaryotic Chao1 (top left), eukaryotic and prokaryotic Shannon diversity (top right), and Shannon diversity and Chao1 (bottom left). C Mean estimates (coefficients) for explanatory variables in α diversity GLMMs. Following removal of the baseline timepoint (V1), GLMMs were implemented to determine the effect of treatment, cultivar, and growth stage (fixed effects) on each response variable. Point size corresponds to hierarchically partitioned R2 values..p ≤ 0.1, *p ≤ 0.05, **p ≤ 0.01, ***p ≤ 0.001

Shannon and Simpson indices were estimated to comprehensively assess α diversity. The Shannon index integrates ASV richness and evenness, with increasing values indicative of greater diversity and uniformity among ASV abundances [51]. Conversely, the Simpson index quantifies ASV dominance, where elevated values denote diminished dominance and heightened diversity [50]. The two indices displayed cooperative, statistically insignificant trends across fixed effect levels in the present study. Eukaryotic α diversity (6.3 ± 0.07 Shannon and 0.99 ± 0.002 Simpson) peaked at the R2 growth stage (evidenced by ME at V6 vs R2 and R2 vs R6), was reduced in CZ4979X vs CZ4810X, and was increased in control vs biostimulant, with growth stage explaining the highest proportion of variance (marginalized R2 = 0.73 and 0.33 for Shannon and Simpson indices, respectively) (Fig. 2C). The prokaryotic α diversity indices (3.82 ± 0.09 Shannon and 0.94 ± 0.007 Simpson) demonstrated an opposing trend, increasing with time and being reduced in control vs biostimulant (Fig. 2C). The direction of change between CZ4979X and CZ4810X was consistent with eukaryotic α diversity, and variance was most attributed to cultivar for both indices (marginalized R2 = 0.41 and 0.58 for Shannon and Simpson, respectively). Notably, statistically significant differences were detected in the baseline between control plots and those to which biostimulants would be applied, with the control plots displaying reduced eukaryotic and prokaryotic α diversity (eukaryotic Shannon p-value = 0.1, ME = -0.19; eukaryotic Simpson p-value = 0.05, ME = -0.001; prokaryotic Shannon p-value = 0.005, ME = -0.41; prokaryotic Simpson p-value = 0.002, ME = -0.04).

While α diversity incorporates both richness and evenness, relying solely on composite diversity indices might obscure their individual contributions to ecosystem function [109]. Pielou's evenness was therefore employed to quantify the count distribution across ASVs. The index was consistent between fixed effect levels and domains, with a mean of 0.91 ± 0.007 for eukaryotes, 0.90 ± 0.009 for prokaryotes, and no statistically significant trends observed (Fig. 2A, C). This observation was supported by a strong association between Chao1 richness and Shannon diversity (ρ = 0.95, p-value = 4.58e−63) (Fig. 2B). Eukaryotic evenness was influenced predominantly by growth stage (marginalized R2 = 0.98), while no patterns were present for the prokaryotic dataset. Consistent with Shannon and Simpson diversity, the baseline control plots showed reduced prokaryotic evenness in comparison to biostimulant plots (p-value = 0.0004, ME = -0.089).

Compositional dissimilarity was first assessed using CSS-normalized counts from full-length, mapped ASVs. The phylum-level composition was first visualized across experimental conditions, revealing that the eukaryotic rhizosphere microbiome was largely composed of Ascomycota and the prokaryotic microbiome dominated by Proteobacteria (Fig. 3A, C). Dissimilarity matrices were constructed subsequently leveraging Bray–Curtis, Euclidean, and Jaccard distances, and compositional trends were inferred between fixed effect levels with PERMANOVA and β dispersion estimation. Findings were consistent across the matrices (Supplementary Table 1, 2, Additional file 2); therefore, representative ordinations were generated with Bray–Curtis dissimilarity (Fig. 3B, D). Notably, growth stage had the most significant impact on eukaryotic microbiome composition (Bray–Curtis PERMANOVA R2 = 0.08, F-value = 2.0, p-value = 0.0001) (Fig. 3E; Supplementary Table 1, Additional file 2), yet ANOVA suggested heterogeneous β dispersion across growth stage levels (F-value = 6.32, p-value = 0.003) (Supplementary Table 1, Additional file 2). Both PCoA and NMDS ordinations implicated a strong spatial effect on eukaryotic microbiome composition as well (Fig. 3B). Prokaryotic microbiome composition showed neither strong statistical nor qualitative trends, albeit a treatment-growth stage interaction explained the highest proportion of variance (Bray–Curtis PERMANOVA R2 = 0.06, F-value = 1.31, p-value = 0.08) (Fig. 3D, F; Supplementary Table 2, Additional file 2). Lastly, while no compositional trends emerged from the baseline measurements, β dispersion did vary between treatment levels for Euclidean dissimilarity (ANOVA F-value = 4.48, p-value = 0.05) (Supplementary Fig. 2; Supplementary Table 3, Additional file 2).

Fig. 3
figure 3

Microbiome composition and β diversity. A Relative abundance of eukaryotic phyla for each experimental condition. B PCoA (top row) and NMDS (bottom row) ordinations of eukaryotic Bray–Curtis dissimilarity. Compositional dissimilarity was calculated independently at ASV level using Bray–Curtis, Euclidean, and Jaccard distances, each of which yielded results consistent with those presented. Point size reflects Shannon diversity. C Relative abundance of prokaryotic phyla for each sample and experimental condition. D PCoA (top row) and NMDS (bottom row) ordinations of prokaryotic Bray–Curtis dissimilarity. Point size reflects Shannon diversity. E Variance explained by treatment, cultivar, growth stage, and interactions thereof on eukaryotic community composition as determined by PERMANOVA with Bray–Curtis dissimilarity. F Variance explained for prokaryotic community composition. G Heatmap of ASVs most influential for pairwise dissimilarity between fixed effect levels. The lowest taxonomic classification for each ASV is displayed below the corresponding column

In like manner, Bray–Curtis dissimilarity matrices were reconstructed with CSS-normalized counts from complete datasets (both mapped and unmapped ASVs), and compositional trends were assessed. Consistent with the prior analysis, growth stage explained the most variance in eukaryotic compositional dissimilarity (PERMANOVA R2 = 0.05, F-value = 1.10, p-value = 0.003), and β dispersion was not significantly heterogeneous among effect levels (Supplementary Table 4, Additional file 2). In addition, treatment had a statistically significant effect on composition (PERMANOVA F-value = 1.10, p-value = 0.02), while cultivar demonstrated a marginally significant effect (PERMANOVA F-value = 1.03, p-value = 0.07) (Supplementary Table 4, Additional file 2). Regarding prokaryotic compositional dissimilarity, growth stage explained the most variance (PERMANOVA R2 = 0.05, F-value = 1.19, p-value = 0.02) rather than a treatment-growth stage interaction as observed with the reduced dataset, yet demonstrated heterogeneous β dispersion (F-value = 3.25, p-value = 0.05) (Supplementary Table 4, Additional file 2). The treatment-growth stage interaction significantly influenced prokaryotic compositional dissimilarity (PERMANOVA F-value = 1.11, p-value = 0.08) (Supplementary Table 4, Additional file 2). No statistical significance was observed in baseline measurements of eukaryotic or prokaryotic β diversity (Supplementary Table 5, Additional file 2).

Complementary to PERMANOVA, the Bray–Curtis indices derived from mapped ASVs were decomposed with the Similarity Percentage method [71] to discern ASVs most influential for pairwise similarity/dissimilarity between fixed effect levels. The five ASVs to which dissimilarity was most attributed were identified for each comparison, and their lowest taxonomic classification retrieved. Interestingly, Cyathus stercoreus and an Acremonium species were identified for all five fixed effect comparisons with the eukaryotic Bray–Curtis matrix, followed by a Nectria species and Phallus rugulosus in four comparisons, and Neocosmospora falciformis in three. Plectosphaerella cucumerina was exclusive to comparisons between vegetative and reproductive growth stages, while Mortierella elongata and a Polymyxa species were exclusive to R6 vs R2 (Fig. 3G). Likewise, ASVs corresponding to Nitrosomonas europaea, Clostridium sporogenes, Bacteroides thetaiotaomicron, and Rhodospirillum rubrum F11 were most influential for all fixed effect comparisons of prokaryotic Bray–Curtis dissimilarity, with Bifidobacterium adolescentis identified in four comparisons and Bradyrhizobium japonicum CCBAU 15618 exclusive to R6 vs R2 (Fig. 3G). Given the vast overlap of these ASVs between pairwise comparisons, and that all demonstrated statistical insignificance (p-value > 0.05 based on 9,999 permutations), ASV identification more likely reflected high abundance/variation across the amplicon datasets than contribution to dissimilarity, which is a common (yet often misinterpreted) element of Similarity Percentage analysis [71].

Microbiota with agriculturally-relevant life strategies exhibited distinct membership trends across fixed effect levels

Core, unique, and differentially abundant taxa were identified at genus and species levels for eukaryotes and prokaryotes, respectively, across all growth stages (baseline and those succeeding biostimulant application). Unsurprisingly, 20/22 core taxa (defined as those with a ≥ 0.5 prevalence across all samples) were eukaryotic genera that mostly exhibit a partly aquatic, saprotrophic lifestyle with filamentous mycelial growth and perithecial fruiting bodies (Fig. 4A). Furthermore, 7/20 (35%) were annotated as plant pathogens (Fig. 4A). The prokaryotic core members included the nitrite-oxidizing bacterium Nitrospira japonica [110] and the scarcely reported bacterium Pseudolabrys taiwanensis [111] (Fig. 4A).

Fig. 4
figure 4

Community membership analyses. Community membership was determined at genus and species levels for eukaryotic and prokaryotic communities, respectively, which were the lowest taxonomic classifications to which 100% ASVs mapped. A Core taxa demonstrating a prevalence ≥ 0.5 across all samples. Left heatmap annotations are taxon metadata, and bottom annotations are sample metadata. B The number of shared and unique taxa by treatment (left), cultivar (middle), and growth stage (right) for eukaryotic (top) and prokaryotic (bottom) communities. The corresponding heatmap displays presence/absence of unique taxa across fixed effect levels (summarized collectively and by domain in left bar plots) in addition to taxon metadata (top annotation). C Differentially abundant taxa between experimental conditions. Left heatmap annotations are taxon metadata, and top annotation is the number of differentially abundant taxa (summarized collectively and by domain) between fixed effect levels (bottom). Eukaryotes are purple and prokaryotes are green for all bar plot annotations

Eukaryotic genera/prokaryotic species unique to a fixed effect level encompassed a collective 217 taxa between treatments (30 eukaryotes, 187 prokaryotes), 296 between cultivars (88 eukaryotes, 208 prokaryotes), and 260 between growth stages (74 prokaryotes, 186 eukaryotes) (Fig. 4B). Those unique to biostimulant-treated samples were all prokaryotes and included four species in the symbiotic genus Bradyrhizobium [112], the additional rhizobia Mesorhizobium ciceri, Mesorhizobium plurifarium, Rhizobium grahamii, and Rhizobium massiliae, Pseudomonas fluorescens [113], three species of Bacillus [114], and 10 Streptomyces species [115] (Fig. 4B). Notable taxa exclusive to the control treatment were the arbuscular mycorrhizal/root-associated genus Paraglomus [116], six plant pathogens (including the soybean disease-causing oomycete genus Phytophthora [117]), and Bradyrhizobium stylosanthis (Fig. 4B).

Perhaps the most distinct trend was the exclusivity of plant pathogens to a particular cultivar. Eight pathogens were unique to CZ4979X (SDS-tolerant cultivar), including Phytophthora and the soybean-parasitizing fungal genus Septoria [118] (Fig. 4B). Other CZ4979X-specific taxa were Bradyrhizobium algeriense, Bradyrhizobium betae, Mesorhizobium plurifarium, Nitrospira multiformis [119], and Pseudomonas fluorescens (Fig. 4B). Samples from CZ4810X, which presumably have heightened susceptibility to the soybean disease SDS in comparison to CZ4979X, had 15 plant pathogens not present in CZ4979X samples, including the soybean disease-causing genera Diaporthe [120], Macrophomina [121], and Rhizoctonia [122] (Fig. 4B). Additional taxa unique to CZ4810X included the genus Paraglomus, Bradyrhizobium lupini, Bradyrhizobium stylosanthis, Mesorhizobium ciceri, Rhizobium cellulosilyticum, and nine species of Streptomyces.

Of the 260 taxa exclusive to a single growth stage, 48 were unique to V1 (18 eukaryotes, 30 prokaryotes), 68 to V6 (17 eukaryotes, 51 prokaryotes), 61 to R2 (22 eukaryotes, 39 prokaryotes), and 83 to R6 (17 eukaryotes, 66 prokaryotes) (Fig. 4B). Those corresponding to V1 included six plant pathogens (including Septoria), Nitrospira multiformis, and Bradyrhizobium algeriense (Fig. 4B). The V6 growth stage was characterized by three distinct plant pathogens, Paraglomus, Bradyrhizobium lupini, two Bacillus species, and nine Streptomyces species (Fig. 4B). Similarly, R2 possessed five unique plant pathogens (including Rhizoctonia), Bradyrhizobium betae, Mesorhizobium plurifarium, and Rhizobium cellulosilyticum (Fig. 4B). The final sampled growth stage contained 5 unique plant pathogens (including Macrophomina, Phytophthora, and Cercospora [123]), two Bacillus species, Bradyrhizobium stylosanthis, and Pseudomonas fluorescens (Fig. 4B). A comprehensive list of condition-specific taxa can be found in Additional file 5.

Differentially abundant taxa between fixed effect levels were determined by fitting CSS-normalized counts with a ZINB regression model. The greatest number of those statistically enriched/depleted (q-value < 0.25) was observed between levels of treatment (22 eukaryotes, 19 prokaryotes, total n = 41), with 18 taxa being enriched and 23 depleted in biostimulant vs control (Fig. 4C). Notably, this included the differential abundance of saprotrophic fungi, the depletion of five fungal pathogens, and the enrichment of Bradyrhizobium elkanii, Bradyrhizobium japonicum, Bradyrhizobium lablabi, and Mesorhizobium amorphae (Fig. 4C). The most depleted taxa in biostimulant-treated samples were the potential human/foodborne pathogenic bacteria Clostridium sporogenes and Escherichia coli [124, 125] (Fig. 4C). Twenty-seven taxa (13 eukaryotes, 14 prokaryotes) were differentially abundant between cultivars, 14 of which were enriched and 13 depleted in CZ4979X vs CZ4810X (Fig. 4C). Most of the identified eukaryotes displayed marginal depletion in CZ4979X (including 3/4 differentially abundant plant pathogens) (Fig. 4C). In contrast, the majority of prokaryotes were enriched, including the inorganic phosphate-solubilizing bacterium Bacillus acidiceler [126], Bradyrhizobium elkanii, Bradyrhizobium japonicum, and Escherichia coli (Fig. 4C).

As expected, an increase in differentially abundant taxa was associated with the temporal distinctiveness of compared growth stages. The greatest number was observed in the R6 vs V1 comparison (distance = 3 growth stages) (23 eukaryotes, 15 prokaryotes, total n = 38), followed by R6 vs V6 (distance = 2 growth stages) (16 eukaryotes, 13 prokaryotes, total n = 29), R2 vs V1 (distance = 2 growth stages) (19 eukaryotes, 10 prokaryotes, total n = 29), V6 vs V1 (distance = 1 growth stage) (14 eukaryotes, 13 prokaryotes, total n = 27), R2 vs V6 (distance = 1 growth stage) (15 eukaryotes, 10 prokaryotes, total n = 25), and R6 vs R2 (distance = 1 growth stage) (12 eukaryotes, 10 prokaryotes, total n = 22) (Fig. 4C). The eukaryotic dataset was defined by the enrichment of saprotrophic genera Acremonium and Chaetomium with the progression of time, which was particularly distinct between vegetative and reproductive growth, and a depletion in the saprotrophic genus Conocybe (Fig. 4C). The reproductive growth stages also displayed an overall enrichment in Microbacterium rhizosphaerae, Bradyrhizobium elkanii, and Bradyrhizobium japonicum, although the latter was depleted at all stages compared to V1 (Fig. 4C). To this end, the V1 growth stage showed unique enrichment of eukaryotic genera Ciliophora, Cyberlindnera, Podospora, and Setophoma, of prokaryotic species Bradyrhizobium japonicum and Pseudarthrobacter chlorophenolicus, and a depletion in the genus Neocosmospora and species Arthrobacter humicola, Bacillus megaterium, and Microlunatus panaciterrae (Fig. 4C). Differentially abundant taxa and associated metadata are provided in Additional file 6.

A treatment-cultivar interaction defined genus-level microbial co-occurrence network structure

Putative genus-genus associations were inferred with Spearman and Pearson correlation methods. The associations were determined from CSS-normalized absolute abundances to mitigate the concomitant limitations of compositionality bias and biases stemming from differential sampling efficiency of taxa [127, 128]. Moreover, the p-values of pairwise associations were corrected for multiple testing given the prevalence of Type I errors during microbial co-occurrence network construction [91]. Herein, the global Spearman co-occurrence network comprised 826 edges (associations) and 188 nodes (genera), all of which presented a low mean relative abundance (< 0.5%) across the dataset (Fig. 5A, C). The global Pearson network possessed 1007 edges and 294 nodes, with evident variability in mean relative abundance observed (Fig. 5B, C). In addition, 462/1,371 (33.7%) of the edges were shared between the networks (Fig. 5C).

Fig. 5
figure 5

Global and condition-specific co-occurrence network analysis. A Global genus-level co-occurrence network constructed by obtaining significant positive and negative pairwise Spearman associations (Rho >  ± 0.6, q-value ≤ 0.05). B Global genus-level co-occurrence network constructed by obtaining significant positive and negative Pearson associations. C Venn diagram of unique and overlapping co-occurrences between Spearman and Pearson global networks. D Condition-specific networks constructed with significant Spearman associations. E Condition-specific networks constructed with significant Pearson associations. F Spearman associations between network density (edge count and node count) and topological features for each set of condition-specific networks. Both x and y axes represent log10 values. G Venn diagram of unique and overlapping co-occurrences between Spearman and Pearson condition-specific networks. (H) Heatmap of Pearson/Spearman condition-specific network nodes. Node color represents Kleinberg hub centrality, with blue reflecting a network member (hub score < 0.2) and tan/red reflecting a network hub (hub score ≥ 0.2). The top annotation represents the number of networks in which a node is a network member (blue) or hub (red). The right annotation shows the number of genera in each condition-specific network and is partitioned by domain (eukaryotes are purple and prokaryotes are green)

Condition-specific co-occurrence networks, defined by treatment-cultivar-growth stage combinations, were constructed with Spearman and Pearson associations as outlined for global networks. The Spearman networks presented an increase in network density with time (vegetative node n = 11.13 ± 2.54, edge n = 16.38 ± 4.88; reproductive node n = 22.88 ± 1.51, edge n = 37.50 ± 4.27), and a potential treatment effect at the V6 growth stage (control node n = 5.0 ± 1.0, edge n = 6; biostimulant node n = 21.0 ± 2.0, edge n = 35.50 ± 1.50) (Fig. 5D; Table 1). Given the density of the biostimulant-CZ4979X-V1 network (node n = 16, edge n = 27), which is the baseline sampling preceding biostimulant application, the variation observed at V6 may better reflect a treatment-cultivar interaction between biostimulant and CZ4810X (Fig. 5D; Table 1). In support of this notion, the greatest trend in Pearson network density was observed in biostimulant-CZ4810X networks at growth stages succeeding biostimulant application (control node n = 99.0 ± 14.42, edge n = 612.33 ± 86.81; biostimulant node n = 136.0 ± 4.58, edge n = 1,533.67 ± 67.85) (Fig. 5E; Table 2). Pearson networks were overall denser than Spearman networks (Spearman node n = 17.0 ± 2.08, edge n = 26.94 ± 4.15; Pearson node n = 107.63 ± 4.78, edge n = 791.50 ± 99.20), which was consistent with the global co-occurrence networks, and 257/10,665 (2.41%) of edges were shared between the association methods (Fig. 5E, G).

Table 1 Topological properties for condition-specific Spearman networks
Table 2 Topological properties for condition-specific Pearson networks

Condition-specific network topology was further defined by centralization degree (the extent to which a single node “controls” a network), cluster count (the number of separate, interconnected groups), connectance (the proportion of all possible links that are actual connections), giant component size (the size of the largest connected subgraph), hub count (the number of nodes with a Kleinberg hub centrality score > 0.2), KS test statistic (how closely degree distribution adheres to a scale-free topology), mean node degree (the average number of connections per node), and modularity (the strength of division into distinct modules/communities) (Tables 1, 2). Of these, centralization degree, cluster count, giant component size, mean degree, and modularity demonstrated a statistically significant, positive association (Spearman’s ρ statistic > 0, p-value < 0.05) with both edge count and node count across Spearman networks, while connectance presented a negative association (Fig. 5F; Table 1). The KS test statistic was negatively associated with edge count (Fig. 5F; Table 1). Conversely, only hub count and mean degree presented significant associations with Pearson network node count (both of which were positive), and centralization degree, connectance, hub count, mean degree, and modularity were associated with edge count (all positive associations except modularity) (Fig. 5F; Table 2). The discrepancy in network topology was reflected in the ranking of co-occurrence networks, as eight of the 10 metrics showed statistically significant differences in conditional ranking between the association methods (Friedman rank sum test p < 0.05) (Supplementary Fig. 3B, Additional file 2). Despite this, all nodes represented in Spearman networks were present in Pearson networks (Supplementary Fig. 3A, Additional file 2), and the hierarchical clustering of Pearson networks by node centrality score supported the overarching trend of treatment-cultivar interaction defining co-occurrence network structure (Fig. 5H).

Edaphic property dynamism and integration within phenotype-taxon networks

To contextualize microbiome dynamics, 24 edaphic parameters were measured for each soil sample, and differences between fixed effect levels and their interactions were determined using GLMMs. Of these, 18 parameters displayed significant variation (p < 0.05) between levels of one or more explanatory variables, with soil K and SOM affected most (n = 5). These were followed by ARS, B, Ca, and P (n = 3), and ALP, Ca/Mg, CEC, Fe, K/Mg, Na, and NO3- (n = 2). The parameters GBA3, Mg, NAG, S, and soil pH each displayed variation between levels of a single explanatory variable (Fig. 6A). Inversely, cultivar explained observed variation for 14 edaphic parameters, 12 of which were increased in CZ4979X vs CZ4810X (Fig. 6A). Ten parameters were increased in V6 vs R2, and six differed significantly in the R6 vs R2 comparison (two increased and four decreased) (Fig. 6A). The explanatory variables treatment, treatment-cultivar interaction, and cultivar-growth stage interaction each explained observed variation in a lesser number of edaphic parameters (Fig. 6A). Furthermore, rank-based measures of association were inferred between edaphic parameters with Spearman’s ρ statistic, rendering 109 significant associations (positive n = 73, negative n = 36) (Fig. 6B). Of these, the Ca/Mg ratio presented the most significant associations of the parameters (positive n = 6, negative n = 9; total n = 15) (Fig. 6B).

Fig. 6
figure 6

Phenotype-taxon networks for edaphic parameters. A Mean estimates for edaphic measure GLMMs. Point size corresponds to hierarchically partitioned R2 values. B Pairwise Spearman associations for edaphic measures. C Genus-level phenotype-taxon networks constructed by coupling lasso regression, reduced GLMs, and co-occurrences (all significant Spearman and Pearson associations). D Heatmap of node composite score (calculated with normalized modularity measures and Kleinberg's hub centrality) for each phenotype-taxon network. The top annotation represents mean composite score across all networks. The right annotation shows the number of nodes in each phenotype-taxon network and is partitioned by domain (eukaryotes are purple and prokaryotes are green). E Relative abundance of nodes in phenotype-taxon networks. The top annotation represents the mean composite score. F Pairwise Spearman associations for the top 20 nodes with respect to mean composite score. G Pairwise Spearman associations for the top 20 nodes and edaphic measures..p ≤ 0.1, *p ≤ 0.05, **p ≤ 0.01, ***p ≤ 0.001

Significant microbial associations spanning one or both global networks were coupled with edaphic data to construct phenotype-taxon networks. Briefly, lasso regression was used to identify taxa putatively associated with each edaphic parameter given its propensity to assign coefficient penalties in instances when sample size is small relative to feature (node) count [89, 129]. A reduced GLM was deployed thereafter to provide directionality to phenotype-node associations and was overlaid with the global microbial association dataset to derive final networks. The phenotype-taxon networks comprised 285.63 ± 1.98 nodes (eukaryotic n = 165.71 ± 1.70, prokaryotic n = 119.92 ± 0.31) and 1,417.46 ± 24.33 edges, and a collective 306 nodes (183 eukaryotes, 123 prokaryotes) were represented in at least one network (Fig. 6C, D). Given the inherent zero inflation in taxon abundance (Fig. 6E), node prioritization was preceded by filtering to those with a ≥ 0.2 prevalence. A composite score was then computed with modularity measures and Kleinberg's hub centrality (see “Materials and Methods”), and the top 20 nodes by mean composite score were extracted. These nodes included 17 eukaryotic genera that are predominantly aquatic/partly aquatic with mycelial growth and a saprotrophic lifestyle, as well as the prokaryotic genera Bradyrhizobium, Lysobacter, and Mycobacterium (Fig. 6F, G). Six of the 20 (30%) were represented in the core microbiome. In addition, seven (35%) of the selected eukaryotic genera were annotated as plant pathogens.

Rank-based measures of association were determined thereafter between prioritized nodes, as well as between nodes and edaphic phenotypes. Thirty node–node associations were significant (positive n = 22, negative n = 8), with the greatest number of those encompassing the pathogenic/saprotrophic genus Fusarium (positive n = 6, negative n = 1, total n = 7) and the saprotrophic genus Neocosmospora (positive n = 5, negative n = 2, total n = 7) (Fig. 6F). The node-phenotype analysis rendered 90 significant associations (positive n = 34, negative n = 56), with the most represented nodes being the pathogenic Apiospora (positive n = 6, negative n = 5, total n = 11) and the saprotrophic Phallus (positive n = 4, negative n = 7, total n = 11), and the most represented phenotypes being B (positive n = 1, negative n = 6, total n = 7) and Buffer pH (positive n = 3, negative n = 4, total n = 7) (Fig. 6G).

Agronomic property dynamism and integration within phenotype-taxon networks

At the R8 growth stage, the agronomic parameters 100-seed weight, aboveground biomass, belowground biomass, pods/plant, and theoretical yield were determined for each of the 16 plots. Variations between levels of treatment, cultivar, and their interaction were then determined using GLMMs. Biostimulant-treated plots had significantly increased 100-seed weight (p-value = 0.05, ME = 0.83), and a marginal decrease was observed in CZ4979X vs CZ4810X (p-value = 0.08, ME = -0.73) (Fig. 7A, B). Expectedly, consistent directional changes were present for theoretical yield, with biostimulant demonstrating a marginal increase (p-value = 0.09, ME = 464.66) over the control, and CZ4979X being decreased in comparison to CZ4979X (albeit insignificantly) (Fig. 7A, B). Above- and belowground biomass showed opposing trends between cultivars, being statistically increased (p-value = 0.002, ME = 3.66) and insignificantly decreased, respectively, in CZ4979X vs CZ4810X (Fig. 7A, B). Both parameters were increased insignificantly in biostimulant vs control (Fig. 7A, B). Lastly, pods/plant was increased significantly in CZ4979X vs CZ4810X (p-value = 0.02, ME = 14.23) and increased insignificantly in biostimulant vs control (Fig. 7A, B). The cultivar explained the highest proportion of variance for aboveground biomass and pods/plant (marginalized R2 = 0.85 and 0.71, respectively), while variance in 100-seed weight, belowground biomass, and theoretical yield were best explained by treatment (marginalized R2 = 0.56, 1.0, and 0.95, respectively).

Fig. 7
figure 7

Phenotype-taxon networks for agronomic parameters. A Agronomic measures across treatments and cultivars. Note that biomass measurements reflect dry weight, and 100-seed weight and theoretical yield were determined at 13% moisture. B Mean estimates for agronomic measure GLMMs. (C) Genus-level phenotype-taxon networks. D Heatmap of node composite score for each phenotype-taxon network. The right annotation shows the mean composite score across all networks for each node. The top annotation shows the number of nodes in each phenotype-taxon network and is partitioned by domain (eukaryotes are purple and prokaryotes are green). E Relative abundance of nodes in phenotype-taxon networks. The top annotation shows the mean composite score. F Pairwise Spearman associations for the top 20 nodes with respect to composite score. G Pairwise Spearman associations for the top 20 nodes and agronomic measures..p ≤ 0.1, *p ≤ 0.05, **p ≤ 0.01, ***p ≤ 0.001

Phenotype-taxon networks were constructed and visualized as described for edaphic parameters, encompassing 128 ± 6.66 nodes (eukaryotic n = 75.20 ± 5.64, prokaryotic n = 52.80 ± 1.56) and 448.20 ± 35.11 edges (Fig. 7C–E). Furthermore, 148 nodes (89 eukaryotes, 59 prokaryotes) were present in one or more of the networks (Fig. 7C, D). Consistent with the previous analysis, node prioritization identified 17 eukaryotic genera that are predominantly aquatic/partly aquatic with mycelial growth and a saprotrophic lifestyle, seven of which are also annotated as plant pathogens (Fig. 7F, G). The remaining genera included the prokaryotes Bradyrhizobium, Burkholderia, and Lysobacter (Fig. 7F, G). Six of the 20 were represented in the core microbiome. In addition, 10 (50%) nodes were prioritized for both agronomic and edaphic networks. Rank-based measures of association were next inferred between the top 20 nodes and between nodes and agronomic parameters. There were 35 significant associations between nodes (positive n = 31, negative n = 4), and Neocosmospora was most represented (positive n = 8, negative n = 1, total n = 9) (Fig. 7F). Additionally, six significant associations were discerned between nodes and parameters, with four nodes positively associated with aboveground biomass (Naegleria, Setophoma, Neonectria, and Fusariella), Leptosphaeria positively associated with pods/plant, and Phaeosphaeriopsis associated negatively with theoretical yield (Fig. 7G).

Discussion

Soil is the most biodiverse habitat on Earth, harboring an estimated 59% of all living organisms [130]. Yet, relatively little is known about the inhabitants of this dynamic ecosystem, their interaction, their collective influence on environmental (and thereby human) health, and the interplay of stochastic and deterministic processes shaping such communities [131,132,133]. Single-molecule-based sequencing stands at the forefront of technologies predicted to clarify these ambiguities inherent in complex microbial systems [134]. To this end, the current study paired the commercial LoopSeq SLR platform with avidity sequencing to profile both eukaryotic and prokaryotic fractions of the soybean rhizosphere microbiome. An in situ experimental design reflected potential environmental dependencies in microbiome structure, which are likely missed in greenhouse/growth chamber experiments [135, 136], yet remain indispensable for practical application of derived inferences [137]. Multiple growth stages, commercial cultivars (genotypes), and biostimulants were also incorporated given their reported effect on rhizosphere microbiome assembly in soybean [13, 17, 18] and other plant systems [138,139,140]. The aim of this approach was to generate a well-resolved depiction of soybean rhizosphere microbiome structure and composition, laying groundwork for future applications in microbiome-based agriculture.

Perhaps the most significant outcome of this study was the taxonomic resolution achieved with both 16S and 18S-ITS SLRs. Traditional short-read amplicon sequencing rarely classifies ASVs beyond genus level, constraining biological inference [29]. For instance, Sugiyama et al. [141] suggested that soybean demonstrates species- and even strain-level selection of Bradyrhizobia based upon stark abundance patterns of ASVs/OTUs with Bradyrhizobium annotation; yet, this notion could not be verified given the limited resolution discerned with pyrosequencing. In the present work, assembling all 9 hypervariable regions of the 16S rRNA gene assigned prokaryotes to at least species level, with nearly one-fourth of mapped, full-length ASVs obtaining strain-level classification. This included the identification of 13 Bradyrhizobium species, some of which demonstrated exclusivity and/or differential abundance (i.e., putative selection) across experimental conditions. Furthermore, a subset of ASVs corresponding to Bradyrhizobium elkanii and Bradyrhizobium japonicum (genus members with the greatest absolute abundance) were resolved at strain level (one and four strains, respectively). This result further coincides with prior studies wherein Bradyrhizobium elkanii and Bradyrhizobium japonicum were the predominant species to nodulate soybean [142].

In eukaryotic microbial community analysis, the de novo-assembled 18S-ITS1-ITS2 molecules facilitated genus-level taxonomic assignment for all mapped ASVs, and species level assignment for approximately 45%. This strategy effectively captured diverse fungal taxa with agricultural importance, such as soybean-parasitizing genera. Beyond fungi, the analysis identified five kingdoms encompassing 19 genera of protists, including Phytophthora and Pythium [143]. Assessing soil-dwelling prokaryotic, fungal, and protist communities in tandem bears significance given that general primers do not exist for short-read amplicon profiling of protists [19, 144] and the understated yet significant role of protists in the soil microbiome [145]. Furthermore, the resolution achieved here permitted the automated retrieval of taxonomy-based functional annotations, allowing for highly reproducible biological inference without the need for sequence-based functional prediction.

Measures of α and β diversity suggested an overarching temporal effect on microbiome structure and composition, with more subtle trends attributed to treatment, cultivar, and fixed effect interaction. These findings were consistent with Moroenyane et al. [146], wherein spatial and temporal dynamics were key modulators of α and β diversity in the soybean rhizosphere microbiome. Further, α diversity aligned explicitly with the work of Longley et al. [13]. In both studies, eukaryotic and prokaryotic richness were decreased at the R2 growth stage and increased by R6 [13]. Shannon diversity trends matched the no-till soil findings of Longley et al. [13], showing reduced eukaryotic diversity at R6 compared to R2, with prokaryotes exhibiting the inverse trend. The authors of the compared study noted that their results deviated from prior research, postulating that management could account for the discrepancy [13]. In this regard, the accordance between the current and prior work may be attributed to the absence of tillage in both experimental designs. Agreeance may also reflect growth stage selection, as bacterial diversity in the soybean rhizosphere has been evidenced to increase between R1 and R5 and then decrease from R5 to R8 [147]. Moreover, the sole use of full-length contigs for analysis may have impacted diversity estimates, potentially excluding shorter sequences that contribute to overall α and β diversity.

Consistent with previous studies, the eukaryotic microbiome composition was predominantly Ascomycota [9, 13], while the prokaryotic fraction was largely represented by Proteobacteria [9, 18], as evidenced by taxonomic classification of mapped ASVs. The β diversity patterns echoed findings from Moroenyane et al. [146] in which growth stage (and interactions comprised thereof) best explained compositional dissimilarity, yet also displayed significant heterogeneous dispersion, across both eukaryotic and prokaryotic communities. In the current study, a spatial effect was evident in the eukaryotic microbiome composition, with samples clustering by field location. Given the experimental setup replicated a conventional row crop system, this may reflect an "edge-of-field" effect, with plots near turnrows receiving varied moisture or amendment applications. Comparable findings were reported by Longley et al. [13] wherein management strategy (i.e., conventional, no-till, organic) rendered distinct clustering of eukaryotic rhizosphere communities, with such trends absent for prokaryotic communities [13]. Notably, heterogeneity arising from field location was controlled statistically in all analyses.

Community membership revealed core, unique, and differentially abundant taxa across fixed effect levels. The core microbiome is a crucial element for rhizosphere microbiome assembly and consequent plant growth promotion [148, 149]. Thus, it is unsurprising that the core microbiome in this study was enriched with saprotrophic fungi, which decompose organic matter, contribute to nutrient cycling, and support soil structure [150]. Unique taxon identification reinforced the supposition of Sugiyama et al. [141] that Bradyrhizobia are subject to species-level selection, and implicated strong host selectivity of parasites/pathogens and mutualists. With regard to the latter, numerous plant pathogens were exclusive to the rhizosphere of the SDS-susceptible soybean cultivar, particularly at later growth stages. This could be attributed to compromised defense mechanisms of the susceptible cultivar, allowing opportunistic pathogens to colonize and proliferate, or possibly due to specific root exudates from this cultivar that inadvertently promote the growth of these pathogens. The exclusivity of Streptomyces panaciradicis, Pseudomonas fluorescens, and Streptomyces griseoplanus in the SDS-tolerant rhizosphere may also reflect host selection, as the two former have been leveraged as biocontrol agents against Fusarium pathogens [151, 152] and the latter as a biocontrol agent against Macrophomina [153]. The Pseudomonas genus has also been associated with SDS-suppressive soils spanning 45 soybean fields [10]. Lastly, the enrichment of Bradyrhizobia in CZ4979X vs CZ4810X and in Biostimulant vs Control further supports the exclusivity of Pseudomonas fluorescens, which is well-evidenced to interact synergistically with Bradyrhizobium japonicum [154, 155]. These microbial dynamics in the soybean rhizosphere highlight potential avenues for targeted crop protection, improved soil health, and optimized disease-resistant breeding.

Putative co-occurrences between prokaryotic, fungal, and protist genera were determined using Spearman and Pearson association methods. The Pearson networks exhibited greater complexity and more pronounced variability in node relative abundance compared to the Spearman networks. This disparity could be influenced by Spearman's method of assigning similar rank values to taxa with minimal or zero abundances, leading to simpler network structure and a reduced representation of high-abundance taxa [156]. Conversely, Pearson's sensitivity to actual data magnitudes may amplify the presence of notably abundant taxa, resulting in networks with a broader range of densities and node abundances [156]. Nonetheless, the dominant effect of treatment-cultivar interaction on condition-specific network structure was persistent across the association methods. In like manner, Liu et al. [18] noted a subtle genotype effect on soybean rhizosphere microbial co-occurrence network structure. Due to the reported effects of biostimulant application on soybean agronomic performance [157] and microbial network structure in other environments [158], it is also logical to presume its influence on network structure in the present work. Still, one must consider such findings as preliminary, given the shortcomings in inferring ecological interaction from co-occurrence [159] and that mapped ASVs were used exclusively for co-occurrence network construction, the latter of which could influence network structure and node prioritization. It is therefore recommended to complement network analysis with additional measurements for more robust hypothesis-driven research [160].

In this manner, 24 edaphic measurements were collected for each soil sample, evaluated with GLMMs and rank-based associations, and incorporated into phenotype-taxon networks. Soil organic matter (SOM) was among the most dynamic parameters assessed, displaying significant variation between five fixed effect level comparisons. This may reflect robust organic macromolecule depolymerization given the observed enrichment of saprotrophs in the eukaryotic rhizosphere microbiome [161] and Proteobacteria in the prokaryotic fraction [162], and perhaps coincides with the establishment of nodulation [163]. The edaphic data were used independently to construct phenotype-taxon networks based on the framework of Poudel et al. [89]. A more exhaustive approach was used to prioritize nodes by modularity and centrality, accentuating microbial taxa with both module-specific and network-wide influence. As expected, the prioritized nodes for edaphic networks were mostly saprotrophic eukaryotes. Moreover, the core rhizosphere microbiome has been shown to interact with more transient taxa via competition and cooperation, being central for microbial network structure and functional stability [149, 164]. In line with this, nearly one-third of the prioritized nodes for edaphic networks were members of the core microbiome. Other identified nodes reinforced trends in edaphic measures (e.g., the β-Proteobacteria genus Burkholderia and the α-Proteobacteria genus Bradyrhizobium are prominent lignin decomposers that can nodulate soybean [162, 165]).

The microbiome dataset was further contextualized by taking agronomic measurements at the end of the growing season. The most apparent trend was that biostimulant application increased every measured trait, with variation in 100-seed weight and theoretical yield being statistically significant. Additionally, the SDS-susceptible variety had heightened 100-seed weight and theoretical yield in comparison to the tolerant cultivar despite having reduced pods/plant, aboveground biomass, and increased pathogens in the rhizosphere, implicating a putative fitness cost associated with genetic resistance/tolerance in the absence of disease [166]. Notably, each replicate for 100-seed weight and theoretical yield encompassed approximately 40 plants in a manner aligned with yield plot trials. Network analysis and node prioritization were consistent with that for edaphic properties, highlighting saprotrophic eukaryotes, SOM-decomposing/nitrogen-fixing bacteria, and members of the core microbiome. As evidenced, complementing co-occurrence networks with phenotypic data provides improved ecological context that can guide the practical application of derived inferences (e.g., through the design and implementation of synthetic microbial communities) [89].

Conclusions

The defined study provides a taxonomically resolved view of the soybean rhizosphere microbiome. Unique in its design, this research was carried out in situ, circumventing the often-observed discrepancy where taxa linked to host fitness in controlled settings fail to replicate symbiont status under field conditions [136]. This study revealed that both eukaryotic and prokaryotic rhizosphere microbiomes display structural and compositional variation in response to treatment, cultivar, and growth stage, consistent with earlier studies primarily leveraging short-read sequencing. Furthermore, the novelty of the present work was well-accentuated through community membership analysis, where taxonomic resolution permitted taxonomy-based functional annotation, identifying an ecologically relevant, saprotroph-rich core microbiome and demonstrating empirical evidence for host selection of mutualistic taxa and concomitant pathogen restriction. The use of multiple association methods for microbial co-occurrence network construction and the comprehensive assessment of network topology underscored the influence of experimental conditions (biostimulant application and cultivar) on co-occurrence network structure. Moreover, the augmentation of such networks with edaphic and agronomic data, complemented by regularized linear regression and a novel node prioritization criterion, identified microbial genera which may be leveraged for sustainable agriculture, many of which are known for their ecological significance. In conclusion, the application of synthetic long-read technology and an in situ experimental design yielded an unparalleled understanding of the soybean rhizosphere microbiome, signifying a considerable advancement in crop microbiome research with practical implications for microbiome-based agriculture.

Availability of data and materials

The 16S and 18S-ITS FASTQ files are available in the NCBI Sequence Read Archive database (16S BioProject ID: PRJNA1062839; 18S-ITS BioProject ID: PRJNA1063224). R Markdown and RData files required for the reproduction of this work are available at https://github.com/brett-hale/Hale_2024_SLR.git.

References

  1. Turner TR, James EK, Poole PS. The plant microbiome. Genome Biol. 2013;14(6):1. https://doi.org/10.1186/gb-2013-14-6-209.

    Article  CAS  Google Scholar 

  2. Hirsch AM. Plant-microbe symbioses: a continuum from commensalism to parasitism. Symbiosis. 2004;37(1–3):345–63.

    CAS  Google Scholar 

  3. Kiers ET, Heijden MG. Mutualistic stability in the arbuscular mycorrhizal symbiosis: exploring hypotheses of evolutionary cooperation. Ecology. 2006;87(7):1627–36. https://doi.org/10.1890/0012-9658(2006)87[1627:MSITAM]2.0.CO;2.

    Article  PubMed  Google Scholar 

  4. Trivedi P, Leach JE, Tringe SG, Sa T, Singh BK. Plant–microbiome interactions: from community assembly to plant health. Nat Rev Microbiol. 2020;18(11):607–21. https://doi.org/10.1038/s41579-020-0412-1.

    Article  CAS  PubMed  Google Scholar 

  5. Bai B, Liu W, Qiu X, Zhang J, Zhang J, Bai Y. The root microbiome: Community assembly and its contributions to plant fitness. J Integr Plant Biol. 2022;64(2):230–43. https://doi.org/10.1111/jipb.13226.

    Article  PubMed  Google Scholar 

  6. Zhang J, Cook J, Nearing JT, Zhang J, Raudonis R, Glick BR, Langille MG, Cheng Z. Harnessing the plant microbiome to promote the growth of agricultural crops. Microbiol Res. 2021;245: 126690. https://doi.org/10.1016/j.micres.2020.126690.

    Article  CAS  PubMed  Google Scholar 

  7. Tsiknia M, Tsikou D, Papadopoulou KK, Ehaliotis C. Multi-species relationships in legume roots: from pairwise legume-symbiont interactions to the plant–microbiome–soil continuum. FEMS Microbiol Ecol. 2021;97(2):fiaa222. https://doi.org/10.1093/femsec/fiaa222.

    Article  CAS  PubMed  Google Scholar 

  8. Mendes LW, Kuramae EE, Navarrete AA, Van Veen JA, Tsai SM. Taxonomical and functional microbial community selection in soybean rhizosphere. ISME J. 2014;8(8):1577–87. https://doi.org/10.1038/ismej.2014.17.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Zhang B, Zhang J, Liu Y, Shi P, Wei G. Co-occurrence patterns of soybean rhizosphere microbiome at a continental scale. Soil Biol Biochem. 2018;118:178–86. https://doi.org/10.1016/j.soilbio.2017.12.011.

    Article  CAS  Google Scholar 

  10. Srour AY, Gibson DJ, Leandro LF, Malvick DK, Bond JP, Fakhoury AM. Unraveling microbial and edaphic factors affecting the development of sudden death syndrome in soybean. Phytobiomes. 2017;1(2):91–101. https://doi.org/10.1094/PBIOMES-02-17-0009-R.

    Article  Google Scholar 

  11. Díaz-Cruz GA, Cassone BJ. Changes in the phyllosphere and rhizosphere microbial communities of soybean in the presence of pathogens. FEMS Microbiol Ecol. 2022;98(3):fiac022. https://doi.org/10.1093/femsec/fiac022.

    Article  CAS  PubMed  Google Scholar 

  12. Hamid MI, Hussain M, Wu Y, Zhang X, Xiang M, Liu X. Successive soybean-monoculture cropping assembles rhizosphere microbial communities for the soil suppression of soybean cyst nematode. FEMS Microbiol Ecol. 2017;93(1):fiw222. https://doi.org/10.1093/femsec/fiw222.

    Article  CAS  Google Scholar 

  13. Longley R, Noel ZA, Benucci GM, Chilvers MI, Trail F, Bonito G. Crop management impacts the soybean (Glycine max) microbiome. Front Microbiol. 2020;3(11):1116. https://doi.org/10.3389/fmicb.2020.01116.

    Article  Google Scholar 

  14. Goss-Souza D, Mendes LW, Borges CD, Rodrigues JL, Tsai SM. Amazon forest-to-agriculture conversion alters rhizosphere microbiome composition while functions are kept. FEMS Microbiol Ecol. 2019;95(3):fiz009. https://doi.org/10.1093/femsec/fiz009.

    Article  CAS  PubMed  Google Scholar 

  15. Goss-Souza D, Mendes LW, Rodrigues JL, Tsai SM. Ecological processes shaping bulk soil and rhizosphere microbiome assembly in a long-term Amazon forest-to-agriculture conversion. Microb Ecol. 2020;79:110–22. https://doi.org/10.1007/s00248-019-01401-y.

    Article  CAS  PubMed  Google Scholar 

  16. Han LL, Wang JT, Yang SH, Chen WF, Zhang LM, He JZ. Temporal dynamics of fungal communities in soybean rhizosphere. J Soils Sedim. 2017;17:491–8. https://doi.org/10.1007/s11368-016-1534-y.

    Article  CAS  Google Scholar 

  17. Sugiyama A, Ueda Y, Zushi T, Takase H, Yazaki K. Changes in the bacterial community of soybean rhizospheres during growth in the field. PLoS ONE. 2014;9(6): e100709. https://doi.org/10.1371/journal.pone.0100709.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Liu F, Hewezi T, Lebeis SL, Pantalone V, Grewal PS, Staton ME. Soil indigenous microbiome and plant genotypes cooperatively modify soybean rhizosphere microbiome assembly. BMC Microbiol. 2019;19(1):1–19. https://doi.org/10.1186/s12866-019-1572-x.

    Article  CAS  Google Scholar 

  19. Baldrian P. The known and the unknown in soil microbial ecology. FEMS Microbiol Ecol. 2019;95(2):fiz005. https://doi.org/10.1093/femsec/fiz005.

    Article  CAS  Google Scholar 

  20. Gupta S, Mortensen MS, Schjørring S, Trivedi U, Vestergaard G, Stokholm J, Bisgaard H, Krogfelt KA, Sørensen SJ. Amplicon sequencing provides more accurate microbiome information in healthy children compared to culturing. Commun Biol. 2019;2(1):291. https://doi.org/10.1038/s42003-019-0540-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Usyk M, Peters BA, Karthikeyan S, McDonald D, Sollecito CC, Vazquez-Baeza Y, Shaffer JP, Gellman MD, Talavera GA, Daviglus ML, Thyagarajan B. Comprehensive evaluation of shotgun metagenomics, amplicon sequencing, and harmonization of these platforms for epidemiological studies. Cell Rep Methods. 2023;3(1):100391. https://doi.org/10.1016/j.crmeth.2022.100391.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Matsuo Y, Komiya S, Yasumizu Y, Yasuoka Y, Mizushima K, Takagi T, Kryukov K, Fukuda A, Morimoto Y, Naito Y, Okada H. Full-length 16S rRNA gene amplicon analysis of human gut microbiota using MinION™ nanopore sequencing confers species-level resolution. BMC Microbiol. 2021;21:1–3. https://doi.org/10.1186/s12866-021-02094-5.

    Article  CAS  Google Scholar 

  23. Clarridge JE III. Impact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases. Clin Microbiol Rev. 2004;17(4):840–62. https://doi.org/10.1128/cmr.17.4.840-862.2004.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Hugerth LW, Muller EE, Hu YO, Lebrun LA, Roume H, Lundin D, Wilmes P, Andersson AF. Systematic design of 18S rRNA gene primers for determining eukaryotic diversity in microbial consortia. PLoS ONE. 2014;9(4): e95567. https://doi.org/10.1371/journal.pone.0095567.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, Chen W, Fungal Barcoding Consortium, Fungal Barcoding Consortium Author List, Bolchacova E, Voigt K. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc Natl Acad Sci. 20127;109(16):6241–6. https://doi.org/10.1073/pnas.1117018109

  26. Porras-Alfaro A, Liu KL, Kuske CR, Xie G. From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition. Appl Environ Microbiol. 2014;80(3):829–40. https://doi.org/10.1128/AEM.02894-13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, Owens SM, Betley J, Fraser L, Bauer M, Gormley N. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012;6(8):1621–4. https://doi.org/10.1038/ismej.2012.8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Guo F, Ju F, Cai L, Zhang T. Taxonomic precision of different hypervariable regions of 16S rRNA gene and annotation methods for functional bacterial groups in biological wastewater treatment. PLoS ONE. 2013;8(10): e76185. https://doi.org/10.1371/journal.pone.0076185.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Yu T, Cheng L, Liu Q, Wang S, Zhou Y, Zhong H, Tang M, Nian H, Lian T. Effects of waterlogging on soybean rhizosphere bacterial community using V4, LoopSeq, and PacBio 16S rRNA Sequence. Microbiol Spectrum. 2022;10(1):e02011-e2021. https://doi.org/10.1128/spectrum.02011-21.

    Article  CAS  Google Scholar 

  30. Tedersoo L, Tooming-Klunderud A, Anslan S. PacBio metabarcoding of Fungi and other eukaryotes: errors, biases and perspectives. New Phytol. 2018;217(3):1370–85. https://doi.org/10.1111/nph.14776.

    Article  CAS  PubMed  Google Scholar 

  31. Nilsson RH, Anslan S, Bahram M, Wurzbacher C, Baldrian P, Tedersoo L. Mycobiome diversity: high-throughput sequencing and identification of fungi. Nat Rev Microbiol. 2019;17(2):95–109. https://doi.org/10.1038/s41579-018-0116-y.

    Article  CAS  PubMed  Google Scholar 

  32. Kono N, Arakawa K. Nanopore sequencing: Review of potential applications in functional genomics. Dev Growth Differ. 2019;61(5):316–26. https://doi.org/10.1111/dgd.12608.

    Article  PubMed  Google Scholar 

  33. Callahan BJ, Grinevich D, Thakur S, Balamotis MA, Yehezkel TB. Ultra-accurate microbial amplicon sequencing with synthetic long reads. Microbiome. 2021;9(1):1–3. https://doi.org/10.1186/s40168-021-01072-3.

    Article  CAS  Google Scholar 

  34. Jeong J, Yun K, Mun S, Chung WH, Choi SY, Nam YD, Lim MY, Hong CP, Park C, Ahn YJ, Han K. The effect of taxonomic classification by full-length 16S rRNA sequencing with a synthetic long-read technology. Sci Rep. 2021;11(1):1727. https://doi.org/10.1038/s41598-020-80826-9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Arslan S, Garcia FJ, Guo M, Kellinger MW, Kruglyak S, LeVieux JA, Mah AH, Wang H, Zhao J, Zhou C, Altomare A. Sequencing by avidity enables high accuracy with low reagent consumption. Nat Biotechnol. 2023;25:1–7. https://doi.org/10.1038/s41587-023-01750-7.

    Article  CAS  Google Scholar 

  36. Fehr WR, Caviness CE. Stages of soybean development. Special Report. 87. Co-operative Extension Service. Iowa State University, Ames, Iowa. 1977.

  37. Durán P, Thiergart T, Garrido-Oter R, Agler M, Kemen E, Schulze-Lefert P, Hacquard S. Microbial interkingdom interactions in roots promote Arabidopsis survival. Cell. 2018;175(4):973–83. https://doi.org/10.1016/j.cell.2018.10.020.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Sugiyama A. The soybean rhizosphere: Metabolites, microbes, and beyond—a review. J Adv Res. 2019;19:67–73. https://doi.org/10.1016/j.jare.2019.03.005.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Salter SJ, Cox MJ, Turek EM, Calus ST, Cookson WO, Moffatt MF, Turner P, Parkhill J, Loman NJ, Walker AW. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014;12:1–2. https://doi.org/10.1186/s12915-014-0087-z.

    Article  CAS  Google Scholar 

  40. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. https://doi.org/10.1093/bioinformatics/btu170.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–77. https://doi.org/10.1089/cmb.2012.0021.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJ, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13(7):581–3. https://doi.org/10.1038/nmeth.3869.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2012;41(D1):D590–6. https://doi.org/10.1093/nar/gks1219.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Nilsson RH, Larsson KH, Taylor AF, Bengtsson-Palme J, Jeppesen TS, Schigel D, Kennedy P, Picard K, Glöckner FO, Tedersoo L, Saar I. The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. Nucleic Acids Res. 2019;47(D1):D259–64. https://doi.org/10.1093/nar/gky1022.

    Article  CAS  PubMed  Google Scholar 

  45. Kõljalg U, Nilsson HR, Schigel D, Tedersoo L, Larsson KH, May TW, Taylor AF, Jeppesen TS, Frøslev TG, Lindahl BD, Põldmaa K. The taxon hypothesis paradigm—on the unambiguous detection and communication of taxa. Microorganisms. 2020;8(12):1910. https://doi.org/10.3390/microorganisms8121910.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. RStudio T. RStudio: integrated development for R. Rstudio Team, PBC, Boston, MA. http://www.rstudio.com. 2020.

  47. McMurdie PJ, Holmes S. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS ONE. 2013;8(4): e61217. https://doi.org/10.1371/journal.pone.0061217.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Davis NM, Proctor DM, Holmes SP, Relman DA, Callahan BJ. Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. Microbiome. 2018;6:1–4. https://doi.org/10.1186/s40168-018-0605-2.

    Article  Google Scholar 

  49. Chao A. Nonparametric estimation of the number of classes in a population. Scand J Stat. 1984;11:265–70.

    Google Scholar 

  50. Simpson EH. Measurement of diversity. Nature. 1949;163(4148):688. https://doi.org/10.1038/163688a0.

    Article  Google Scholar 

  51. Hill MO. Diversity and evenness: a unifying notation and its consequences. Ecology. 1973;54(2):427–32. https://doi.org/10.2307/1934352.

    Article  Google Scholar 

  52. Pielou EC. The measurement of diversity in different types of biological collections. J Theor Biol. 1966;13:131–44. https://doi.org/10.1016/0022-5193(66)90013-0.

    Article  Google Scholar 

  53. Lahti L, Shetty S. Introduction to the microbiome R package. Preprint at https://microbiome.github.io/tutorials. 2018.

  54. Spearman C. The proof and measurement of association between two things. https://doi.org/10.1037/11491-005

  55. Wickham H. ggplot2. Wiley interdisciplinary reviews: computational statistics. 2011;3(2):180–5. https://doi.org/10.1002/wics.147.

    Article  Google Scholar 

  56. Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3/4):591–611. https://doi.org/10.2307/2333709.

    Article  Google Scholar 

  57. Delignette-Muller ML, Dutang C. fitdistrplus: An R package for fitting distributions. J Stat Softw. 2015;64:1–34. https://doi.org/10.18637/jss.v064.i04.

    Article  Google Scholar 

  58. Fox J, Weisberg S, Adler D, Bates D, Baud-Bovy G, Ellison S, Firth D, Friendly M, Gorjanc G, Graves S, Heiberger R. Package ‘car.’ Vienna: R Foundation for Statistical Computing; 2012. p. 16.

    Google Scholar 

  59. Ripley B, Venables B, Bates DM, Hornik K, Gebhardt A, Firth D, Ripley MB. Package ‘mass.’ Cran r. 2013;8(538):113–20.

    Google Scholar 

  60. Brooks ME, Kristensen K, Van Benthem KJ, Magnusson A, Berg CW, Nielsen A, Skaug HJ, Machler M, Bolker BM. glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. R J. 2017;9(2):378–400. https://doi.org/10.3929/ethz-b-000240890.

    Article  Google Scholar 

  61. Barton K. MuMIn: multi-model inference. R package version 1. 0. 0. http://r-forge.r-project.org/projects/mumin/. 2009.

  62. Burnham KP, Anderson DR. Model selection and multimodel inference. A practical information-theoretic approach. 2004;2.

  63. Lüdecke D, Ben-Shachar MS, Patil I, Waggoner P, Makowski D (2021) performance: An R package for assessment, comparison and testing of statistical models. J Open Source Softw 6(60): 3139. https://doi.org/10.21105/joss.03139

  64. Lai J, Zou Y, Zhang S, Zhang X, Mao L. glmm. hp: an R package for computing individual effect of predictors in generalized linear mixed models. J Plant Ecol. 2022;15(6):1302–7. https://doi.org/10.1093/jpe/rtac096.

    Article  Google Scholar 

  65. Green P, MacLeod CJ. SIMR: an R package for power analysis of generalized linear mixed models by simulation. Methods Ecol Evol. 2016;7(4):493–8. https://doi.org/10.1111/2041-210X.12504.

    Article  Google Scholar 

  66. Leeper TJ. Interpreting regression results using average marginal effects with R’s margins. Available at the comprehensive R Archive Network (CRAN). 2017 Mar 22:1-32.

  67. Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10(12):1200–2. https://doi.org/10.1038/nmeth.2658.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Bray JR, Curtis JT. An ordination of the upland forest communities of southern Wisconsin. Ecol Monogr. 1957;27(4):326–49. https://doi.org/10.2307/1942268.

    Article  Google Scholar 

  69. Jaccard P. Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat. 1908;44:223–70.

    Google Scholar 

  70. Dixon P. VEGAN, a package of R functions for community ecology. J Veg Sci. 2003;14(6):927–30. https://doi.org/10.1111/j.1654-1103.2003.tb02228.x.

    Article  Google Scholar 

  71. Clarke KR. Non-parametric multivariate analyses of changes in community structure. Aust J Ecol. 1993;18(1):117–43. https://doi.org/10.1111/j.1442-9993.1993.tb00438.x.

    Article  Google Scholar 

  72. Gu Z. Complex heatmap visualization. Imeta. 2022;1(3): e43. https://doi.org/10.1002/imt2.43.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Põlme S, Abarenkov K, Henrik Nilsson R, Lindahl BD, Clemmensen KE, Kauserud H, Nguyen N, Kjøller R, Bates ST, Baldrian P, Frøslev TG. FungalTraits: a user-friendly traits database of fungi and fungus-like stramenopiles. Fungal Divers. 2020;105:1–6. https://doi.org/10.1007/s13225-020-00466-2.

    Article  Google Scholar 

  74. Reimer LC, Sardà Carbasse J, Koblitz J, Ebeling C, Podstawka A, Overmann J. Bac Dive in 2022: the knowledge base for standardized bacterial and archaeal data. Nucleic Acids Res. 2022;50(D1):D741–6. https://doi.org/10.1093/nar/gky879.

    Article  CAS  PubMed  Google Scholar 

  75. M. Goeker, BacDive: BacDive API Client.

  76. Smith S. phylosmith: an R-package for reproducible and efficient microbiome analysis with phyloseq-objects. J Open Source Softw. 2019;4(38): 1442. https://doi.org/10.21105/joss.01442

  77. Yan L, Yan ML. Package ‘ggvenn’.

  78. Mallick H, Rahnavard A, McIver LJ, Ma S, Zhang Y, Nguyen LH, Tickle TL, Weingart G, Ren B, Schwager EH, Chatterjee S. Multivariable association discovery in population-scale meta-omics studies. PLoS Comput Biol. 2021;17(11): e1009442. https://doi.org/10.1371/journal.pcbi.1009442.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  79. Boolchandani M, Blake KS, Tilley DH, Cabada MM, Schwartz DJ, Patel S, Morales ML, Meza R, Soto G, Isidean SD, Porter CK. Impact of international travel and diarrhea on gut microbiome and resistome dynamics. Nat Commun. 2022;13(1):7485. https://doi.org/10.1038/s41467-022-34862-w.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Huang B, Chau SW, Liu Y, Chan JW, Wang J, Ma SL, Zhang J, Chan PK, Yeoh YK, Chen Z, Zhou L. Gut microbiome dysbiosis across early Parkinson’s disease, REM sleep behavior disorder and their first-degree relatives. Nat Commun. 2023;14(1):2501. https://doi.org/10.1038/s41467-023-38248-4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  81. Wen T, Xie P, Yang S, Niu G, Liu X, Ding Z, Xue C, Liu YX, Shen Q, Yuan J. ggClusterNet: an R package for microbiome network analysis and modularity-based multiple network layouts. Meta. 2022;1(3):e32. https://doi.org/10.1002/imt2.32.

    Article  Google Scholar 

  82. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Syst. 2006;1695(5):1–9.

    Google Scholar 

  83. Pons P, Latapy M. Computing communities in large networks using random walks. J Graph Algorithms Appl. 2006;10(2):191–218. https://doi.org/10.7155/jgaa.00124.

    Article  Google Scholar 

  84. Ma B, Wang Y, Ye S, Liu S, Stirling E, Gilbert JA, Faust K, Knight R, Jansson JK, Cardona C, Röttjers L. Earth microbial co-occurrence network reveals interconnection pattern across microbiomes. Microbiome. 2020;8:1–2. https://doi.org/10.1186/s40168-020-00857-2.

    Article  CAS  Google Scholar 

  85. Kleinberg JM. Authoritative sources in a hyperlinked environment. J ACM (JACM). 1999;46(5):604–32. https://doi.org/10.1145/324133.324140.

    Article  Google Scholar 

  86. Sharma AK, Davison S, Pafco B, Clayton JB, Rothman JM, McLennan MR, Cibot M, Fuh T, Vodicka R, Robinson CJ, Petrzelkova K. The primate gut mycobiome-bacteriome interface is impacted by environmental and subsistence factors. npj Biofilms Microbiomes. 2022;8(1):12. https://doi.org/10.1038/s41522-022-00274-3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  87. Pearson K. Determination of the coefficient of correlation. Science. 1909;30(757):23–5. https://doi.org/10.1126/science.30.757.23.

    Article  CAS  PubMed  Google Scholar 

  88. Hollander M, Wolfe DA, Chicken E. Nonparametric statistical methods. New York: Wiley; 2013.

    Google Scholar 

  89. Poudel R, Jumpponen A, Kennelly MM, Rivard C, Gomez-Montano L, Garrett KA. Integration of phenotypes in microbiome networks for designing synthetic communities: a study of mycobiomes in the grafted tomato system. Appl Environ Microbiol. 2023;24:e01843-e1922. https://doi.org/10.1128/aem.01843-22.

    Article  CAS  Google Scholar 

  90. Doulcier G, Stouffer D. Rnetcarto: Fast network modularity and roles computation by simulated annealing. R package version 0.2. 2015;4.

  91. Röttjers L, Faust K. From hairballs to hypotheses–biological insights from microbial networks. FEMS Microbiol Rev. 2018;42(6):761–80. https://doi.org/10.1093/femsre/fuy030.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  92. Moscatelli MC, Lagomarsino A, Garzillo AM, Pignataro A, Grego S. β-Glucosidase kinetic parameters as indicators of soil quality under conventional and organic cropping systems applying two analytical approaches. Ecol Ind. 2012;13(1):322–7. https://doi.org/10.1016/j.ecolind.2011.06.031.

    Article  CAS  Google Scholar 

  93. Deng S, Popova I. Carbohydrate hydrolases. Methods Soil Enzymol. 2011;9:185–209. https://doi.org/10.2136/sssabookser9.c9.

    Article  CAS  Google Scholar 

  94. Parham JA, Deng SP. Detection, quantification and characterization of β-glucosaminidase activity in soil. Soil Biol Biochem. 2000;32(8–9):1183–90. https://doi.org/10.1016/S0038-0717(00)00034-1.

    Article  CAS  Google Scholar 

  95. Nannipieri P, Giagnoni L, Landi L, Renella G. Role of phosphatase enzymes in soil. Phosphorus in action: biological processes in soil phosphorus cycling. Plants (Basel) 2011:215–43. https://doi.org/10.1007/978-3-642-15271-9_9

  96. Tabatabai MA, Bremner JM. Arylsulfatase activity of soils. Soil Sci Soc Am J. 1970;34(2):225–9. https://doi.org/10.2136/sssaj1970.03615995003400020016x.

    Article  CAS  Google Scholar 

  97. Klose S, Bilen S, Ali Tabatabai M, Dick WA. Sulfur cycle enzymes. Methods Soil Enzymol. 2011;5(9):125–59. https://doi.org/10.2136/sssabookser9.c7.

    Article  CAS  Google Scholar 

  98. Mehlich A. Mehlich 3 soil test extractant: a modification of Mehlich 2 extractant. Commun Soil Sci Plant Anal. 1984;15(12):1409–16. https://doi.org/10.1080/00103628409367568.

    Article  CAS  Google Scholar 

  99. Shoemaker HE, McLean EO, Pratt PF. Buffer methods for determining lime requirement of soils with appreciable amounts of extractable aluminum. Soil Sci Soc Am J. 1961;25(4):274–7. https://doi.org/10.2136/sssaj1961.03615995002500040014x.

    Article  CAS  Google Scholar 

  100. Ball DF. Loss-on-ignition as an estimate of organic matter and organic carbon in non-calcareous soils. J Soil Sci. 1964;15(1):84–92. https://doi.org/10.1111/j.1365-2389.1964.tb00247.x.

    Article  CAS  Google Scholar 

  101. Swift RS, Sparks DL. Methods of soil analysis: Part 3. Chemical methods. Soil Science Society of America Book Series. 1996;5:1018–20.

  102. Gavlak R, Horneck D, Miller RO, Kotuby-Amacher J. Soil, plant and water reference methods for the western region. WCC-103 Publication, Fort Collins, CO. 2003.

  103. Gould SB, Tham WH, Cowman AF, McFadden GI, Waller RF. Alveolins, a new family of cortical proteins that define the protist infrakingdom Alveolata. Mol Biol Evol. 2008;25(6):1219–30. https://doi.org/10.1093/molbev/msn070.

    Article  CAS  PubMed  Google Scholar 

  104. Cavalier-Smith T, Chao EE. Phylogeny of choanozoa, apusozoa, and other protozoa and early eukaryote megaevolution. J Mol Evol. 2003;56:540–63. https://doi.org/10.1007/s00239-002-2424-z.

    Article  CAS  PubMed  Google Scholar 

  105. Schlegel M, Hülsmann N. Protists–A textbook example for a paraphyletic taxon. Org Divers Evol. 2007;7(2):166–72. https://doi.org/10.1016/j.ode.2006.11.001.

    Article  Google Scholar 

  106. Cavalier-Smith T, Chao EE, Lewis R. Multigene phylogeny and cell evolution of chromist infrakingdom Rhizaria: contrasting cell organisation of sister phyla Cercozoa and Retaria. Protoplasma. 2018;255:1517–74. https://doi.org/10.1007/s00709-018-1241-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  107. Derelle R, López-García P, Timpano H, Moreira D. A phylogenomic framework to study the diversity and evolution of stramenopiles (= heterokonts). Mol Biol Evol. 2016;33(11):2890–8. https://doi.org/10.1093/molbev/msw168.

    Article  CAS  PubMed  Google Scholar 

  108. Chiu CH, Wang YT, Walther BA, Chao A. An improved nonparametric lower bound of species richness via a modified good–turing frequency formula. Biometrics. 2014;70(3):671–82. https://doi.org/10.1111/biom.12200.

    Article  PubMed  Google Scholar 

  109. Jost L. The relation between evenness and diversity. Diversity. 2010;2(2):207–32. https://doi.org/10.3390/d2020207.

    Article  Google Scholar 

  110. Ushiki N, Fujitani H, Aoi Y, Tsuneda S. Isolation of Nitrospira belonging to sublineage II from a wastewater treatment plant. Microbes Environ. 2013;28(3):346–53. https://doi.org/10.1264/jsme2.ME13042.

    Article  PubMed  PubMed Central  Google Scholar 

  111. Kämpfer P, Young CC, Arun AB, Shen FT, Jäckel U, Rosselló-Mora R, Lai WA, Rekha PD. Pseudolabrys taiwanensis gen. nov., sp. nov., an alphaproteobacterium isolated from soil. Int J Syst Evolut Microbiol. 2006;56(10):2469–72. https://doi.org/10.1099/ijs.0.64124-0.

    Article  CAS  Google Scholar 

  112. Saranraj P, Sivasakthivelan P, Al-Tawaha AR, Sudha A, Al-Tawaha AR, Sirajuddin SN. Diversity and evolution of Bradyrhizobium communities relating to Soybean cultivation: a review. In: IOP Conference Series: Earth and Environmental Science 2021 Jun 1 (Vol. 788, No. 1, p. 012208). IOP Publishing. https://doi.org/10.1088/1755-1315/788/1/012208

  113. Castaldi S, Masi M, Sautua F, Cimmino A, Isticato R, Carmona M, Tuzi A, Evidente A. Pseudomonas fluorescens showing antifungal activity against Macrophomina phaseolina, a severe pathogenic fungus of soybean, produces phenazine as the main active metabolite. Biomolecules. 2021;11(11):1728. https://doi.org/10.3390/biom11111728.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  114. Tiwari S, Prasad V, Lata C. Bacillus: Plant growth promoting bacteria for sustainable agriculture and environment. In: New and future developments in microbial biotechnology and bioengineering 2019 (pp. 43–55). Elsevier. https://doi.org/10.1016/B978-0-444-64191-5.00003-1

  115. Vurukonda SS, Giovanardi D, Stefani E. Plant growth promoting and biocontrol activity of Streptomyces spp. as endophytes. Int J Mol Sci. 2018;19(4):952. https://doi.org/10.3390/ijms19040952.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  116. Malar CM, Wang Y, Stajich JE, Kokkoris V, Villeneuve-Laroche M, Yildirir G, Corradi N. Early branching arbuscular mycorrhizal fungus Paraglomus occultum carries a small and repeat-poor genome compared to relatives in the Glomeromycotina. Microbial genomics. 2022;8(4): 000810. https://doi.org/10.1099/mgen.0.000810.

    Article  CAS  Google Scholar 

  117. Hale B, Brown E, Wijeratne A. An updated assessment of the soybean–Phytophthora sojae pathosystem. Plant Pathol. 2023. https://doi.org/10.1111/ppa.13713.

    Article  Google Scholar 

  118. Lin HA, Villamil MB, Mideros SX. Characterization of Septoria brown spot disease development and yield effects on soybean in Illinois. Can J Plant Path. 2021;43(1):62–72. https://doi.org/10.1080/07060661.2020.1755366.

    Article  CAS  Google Scholar 

  119. Liu S, Han P, Hink L, Prosser JI, Wagner M, Bruggemann N. Abiotic conversion of extracellular NH2OH contributes to N2O emission during ammonia oxidation. Environ Sci Technol. 2017;51(22):13122–32. https://doi.org/10.1021/acs.est.7b02360.

    Article  CAS  PubMed  Google Scholar 

  120. Pioli RN, Morandi EN, Martínez MC, Lucca F, Tozzini A, Bisaro V, Hopp HE. Morphologic, molecular, and pathogenic characterization of Diaporthe phaseolorum variability in the core soybean-producing area of Argentina. Phytopathology. 2003;93(2):136–46. https://doi.org/10.1094/PHYTO.2003.93.2.136.

    Article  PubMed  Google Scholar 

  121. Reznikov S, Chiesa MA, Pardo EM, De Lisi V, Bogado N, González V, Ledesma F, Morandi EN, Ploper LD, Castagnaro AP. Soybean-Macrophomina phaseolina-specific interactions and identification of a novel source of resistance. Phytopathology. 2019;109(1):63–73. https://doi.org/10.1094/PHYTO-08-17-0287-R.

    Article  CAS  PubMed  Google Scholar 

  122. Ajayi-Oyetunde OO, Bradley CA. Rhizoctonia solani: taxonomy, population biology and management of rhizoctonia seedling disease of soybean. Plant Pathol. 2018;67(1):3–17. https://doi.org/10.1111/ppa.12733.

    Article  CAS  Google Scholar 

  123. Soares AP, Guillin EA, Borges LL, Silva AC, Almeida ÁM, Grijalba PE, Gottlieb AM, Bluhm BH, Oliveira LO. More Cercospora species infect soybeans across the Americas than meets the eye. PLoS ONE. 2015;10(8): e0133495. https://doi.org/10.1371/journal.pone.0133495.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  124. Cui X, Joannou CL, Hughes MN, Cammack R. The bacteriocidal effects of transition metal complexes containing the NO+ group on the food-spoilage bacterium Clostridium sporogenes. FEMS Microbiol Lett. 1992;98(1–3):67–70. https://doi.org/10.1111/j.1574-6968.1992.tb05491.x.

    Article  CAS  Google Scholar 

  125. Nataro JP, Kaper JB. Diarrheagenic escherichia coli. Clin Microbiol Rev. 1998;11(1):142–201. https://doi.org/10.1128/cmr.11.1.142.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  126. Zheng BX, Ibrahim M, Zhang DP, Bi QF, Li HZ, Zhou GW, Ding K, Peñuelas J, Zhu YG, Yang XR. Identification and characterization of inorganic-phosphate-solubilizing bacteria from agricultural fields with a rapid isolation method. AMB Express. 2018;8:1–2. https://doi.org/10.1186/s13568-018-0575-6.

    Article  Google Scholar 

  127. Lin H, Peddada SD. Analysis of compositions of microbiomes with bias correction. Nat Commun. 2020;11(1):3514. https://doi.org/10.1038/s41467-020-17041-7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  128. Matchado MS, Lauber M, Reitmeier S, Kacprowski T, Baumbach J, Haller D, List M. Network analysis methods for studying microbial communities: a mini review. Comput Struct Biotechnol J. 2021;19:2687–98. https://doi.org/10.1016/j.csbj.2021.05.001.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  129. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Stat Methodol. 1996;58(1):267–88. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x.

    Article  Google Scholar 

  130. Anthony MA, Bender SF, van der Heijden MG. Enumerating soil biodiversity. Proc Natl Acad Sci. 2023;120(33): e2304663120. https://doi.org/10.1073/pnas.2304663120.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  131. Maherali H, Klironomos JN. Influence of phylogeny on fungal community assembly and ecosystem functioning. Science. 2007;316(5832):1746–8. https://doi.org/10.1126/science.1143082.

    Article  CAS  PubMed  Google Scholar 

  132. Wall DH, Nielsen UN, Six J. Soil biodiversity and human health. Nature. 2015;528(7580):69–76. https://doi.org/10.1038/nature15744.

    Article  CAS  PubMed  Google Scholar 

  133. Zhou J, Ning D. Stochastic community assembly: does it matter in microbial ecology? Microbiol Mol Biol Rev. 2017;81(4):10–128. https://doi.org/10.1128/mmbr.00002-17.

    Article  Google Scholar 

  134. White RA III, Callister SJ, Moore RJ, Baker ES, Jansson JK. The past, present and future of microbiome analyses. Nat Protoc. 2016;11(11):2049–53. https://doi.org/10.1038/nprot.2016.148.

    Article  CAS  Google Scholar 

  135. Peiffer JA, Spor A, Koren O, Jin Z, Tringe SG, Dangl JL, Buckler ES, Ley RE. Diversity and heritability of the maize rhizosphere microbiome under field conditions. Proc Natl Acad Sci. 2013;110(16):6548–53. https://doi.org/10.1073/pnas.1302837110.

    Article  PubMed  PubMed Central  Google Scholar 

  136. Russ D, Fitzpatrick CR, Teixeira PJ, Dangl JL. Deep discovery informs difficult deployment in plant microbiome science. Cell. 2023;186(21):4496–513. https://doi.org/10.1016/j.cell.2023.08.035.

    Article  CAS  PubMed  Google Scholar 

  137. Finkel OM, Castrillo G, Paredes SH, González IS, Dangl JL. Understanding and exploiting plant beneficial microbes. Curr Opin Plant Biol. 2017;38:155–63. https://doi.org/10.1016/j.pbi.2017.04.018.

    Article  PubMed  PubMed Central  Google Scholar 

  138. Bouffaud ML, Poirier MA, Muller D, Moënne-Loccoz Y. Root microbiome relates to plant host evolution in maize and other Poaceae. Environ Microbiol. 2014;16(9):2804–14. https://doi.org/10.1111/1462-2920.12442.

    Article  PubMed  Google Scholar 

  139. Chaparro JM, Badri DV, Vivanco JM. Rhizosphere microbiome assemblage is affected by plant development. ISME J. 2014;8(4):790–803. https://doi.org/10.1038/ismej.2013.196.

    Article  CAS  PubMed  Google Scholar 

  140. Mendes R, Garbeva P, Raaijmakers JM. The rhizosphere microbiome: significance of plant beneficial, plant pathogenic, and human pathogenic microorganisms. FEMS Microbiol Rev. 2013;37(5):634–63. https://doi.org/10.1111/1574-6976.12028.

    Article  CAS  PubMed  Google Scholar 

  141. Sugiyama A, Ueda Y, Takase H, Yazaki K. Do soybeans select specific species of Bradyrhizobium during growth? Commun Integr Biol. 2015;8(1): e992734. https://doi.org/10.4161/19420889.2014.992734.

    Article  PubMed  PubMed Central  Google Scholar 

  142. Minamisawa K, Onodera S, Tanimura Y, Kobayashi N, Yuhashi KI, Kubota M. Preferential nodulation of Glycine max, Glycine soja and Macroptilium atropurpureum by two Bradyrhizobium species japonicum and elkanii. FEMS Microbiol Ecol. 1997;24(1):49–56. https://doi.org/10.1111/j.1574-6941.1997.tb00422.x.

    Article  CAS  Google Scholar 

  143. Zitnick-Anderson KK, Nelson BD Jr. Identification and pathogenicity of Pythium on soybean in North Dakota. Plant Dis. 2015;99(1):31–8. https://doi.org/10.1094/PDIS-02-14-0161-RE.

    Article  PubMed  Google Scholar 

  144. Geisen S, Laros I, Vizcaíno A, Bonkowski M, De Groot GA. Not all are free-living: High-throughput DNA metabarcoding reveals a diverse community of protists parasitizing soil metazoa. Mol Ecol. 2015;24(17):4556–69. https://doi.org/10.1111/mec.13238.

    Article  CAS  PubMed  Google Scholar 

  145. Geisen S, Mitchell EA, Adl S, Bonkowski M, Dunthorn M, Ekelund F, Fernández LD, Jousset A, Krashevska V, Singer D, Spiegel FW. Soil protists: a fertile frontier in soil biology research. FEMS Microbiol Rev. 2018;42(3):293–323. https://doi.org/10.1093/femsre/fuy006.

    Article  CAS  PubMed  Google Scholar 

  146. Moroenyane I, Tremblay J, Yergeau É. Temporal and spatial interactions modulate the soybean microbiome. FEMS Microbiol Ecol. 2021;97(1):fiaa206. https://doi.org/10.1093/femsec/fiaa206.

    Article  CAS  Google Scholar 

  147. Xu Y, Wang G, Jin J, Liu J, Zhang Q, Liu X. Bacterial communities in soybean rhizosphere in response to soil type, soybean genotype, and their growth stage. Soil Biol Biochem. 2009;41(5):919–25. https://doi.org/10.1016/j.soilbio.2008.10.027.

    Article  CAS  Google Scholar 

  148. Toju H, Peay KG, Yamamichi M, Narisawa K, Hiruma K, Naito K, Fukuda S, Ushio M, Nakaoka S, Onoda Y, Yoshida K. Core microbiomes for sustainable agroecosystems. Nat Plants. 2018;4(5):247–57. https://doi.org/10.1038/s41477-018-0139-4.

    Article  PubMed  Google Scholar 

  149. Ji N, Liang D, Clark LV, Sacks EJ, Kent AD. Host genetic variation drives the differentiation in the ecological role of the native Miscanthus root-associated microbiome. Microbiome. 2023;11(1):1–3. https://doi.org/10.1186/s40168-023-01646-3.

    Article  Google Scholar 

  150. Clocchiatti A, Hannula SE, van den Berg M, Korthals G, De Boer W. The hidden potential of saprotrophic fungi in arable soil: Patterns of short-term stimulation by organic amendments. Appl Soil Ecol. 2020;147: 103434. https://doi.org/10.1016/j.apsoil.2019.103434.

    Article  Google Scholar 

  151. Sari M, Nawangsih AA, Wahyudi AT. Rhizosphere Streptomyces formulas as the biological control agent of phytopathogenic fungi Fusarium oxysporum and plant growth promoter of soybean. Biodivers J Biol Divers. 2021;22(6):3015–23. https://doi.org/10.13057/biodiv/d220602.

    Article  Google Scholar 

  152. Al-Fadhal FA, AL-Abedy AN, Alkhafije DA. Isolation and molecular identification of Rhizoctonia solani and Fusarium solani isolated from cucumber (Cucumis sativus L.) and their control feasibility by Pseudomonas fluorescens and Bacillus subtilis. Egypt J Biol Pest Control. 2019;29:1–1. https://doi.org/10.1186/s41938-019-0145-5.

    Article  Google Scholar 

  153. Vijayabharathi R, Sathya A, Gopalakrishnan S. Extracellular biosynthesis of silver nanoparticles using Streptomyces griseoplanus SAI-25 and its antifungal activity against Macrophomina phaseolina, the charcoal rot pathogen of sorghum. Biocatal Agric Biotechnol. 2018;14:166–71. https://doi.org/10.1016/j.bcab.2018.03.006.

    Article  Google Scholar 

  154. Nishijima F, Evans WR, Vesper SJ. Enhanced nodulation of soybean by Bradyrhizobium in the presence of Pseudomonas fluorescens. Plant Soil. 1988;111:149–50. https://doi.org/10.1007/BF02182049.

    Article  Google Scholar 

  155. Pawar PU, Kumbhar CT, Patil VS, Khot GG. Effect of co-inoculation of Bradyrhizobium japonicum and Pseudomonas fluorescens on growth, yield and nutrient uptake in soybean [Glycine max (L.) Merrill]. Crop Res. 2018;53(12):57–62. https://doi.org/10.5958/2454-1761.2018.00009.8.

    Article  Google Scholar 

  156. Xia Y. Correlation and association analyses in microbiome study integrating multiomics in health and disease. Prog Mol Biol Transl Sci. 2020;171:309–491. https://doi.org/10.1016/bs.pmbts.2020.04.003.

    Article  CAS  PubMed  Google Scholar 

  157. Szparaga A, Kocira S, Findura P, Kapusta I, Zaguła G, Świeca M. Uncovering the multi-level response of Glycine max L. to the application of allelopathic biostimulant from Levisticum officinale Koch. Sci Rep. 2021;11(1):15360. https://doi.org/10.1038/s41598-021-94774-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  158. Li F, Yu H, Li Y, Wang Y, Hu D, Feng B, Han Y. The quality of compost was improved by low concentrations of fulvic acid owing to its optimization of the exceptional microbial structure. Biores Technol. 2021;342: 125843. https://doi.org/10.1016/j.biortech.2021.125843.

    Article  CAS  Google Scholar 

  159. Blanchet FG, Cazelles K, Gravel D. Co-occurrence is not evidence of ecological interactions. Ecol Lett. 2020;23(7):1050–63. https://doi.org/10.1111/ele.13525.

    Article  PubMed  Google Scholar 

  160. Alteio LV, Séneca J, Canarini A, Angel R, Jansa J, Guseva K, Kaiser C, Richter A, Schmidt H. A critical perspective on interpreting amplicon sequencing data in soil ecological research. Soil Biol Biochem. 2021;160: 108357. https://doi.org/10.1016/j.soilbio.2021.108357.

    Article  CAS  Google Scholar 

  161. Lindahl BD, Tunlid A. Ectomycorrhizal fungi–potential organic matter decomposers, yet not saprotrophs. New Phytol. 2015;205(4):1443–7. https://doi.org/10.1111/nph.13201.

    Article  CAS  PubMed  Google Scholar 

  162. Tao X, Feng J, Yang Y, Wang G, Tian R, Fan F, Ning D, Bates CT, Hale L, Yuan MM, Wu L. Winter warming in Alaska accelerates lignin decomposition contributed by Proteobacteria. Microbiome. 2020;8(1):1–2. https://doi.org/10.1186/s40168-020-00838-5.

    Article  CAS  Google Scholar 

  163. Zhu B, Cheng W. Nodulated soybean enhances rhizosphere priming effects on soil organic matter decomposition more than non-nodulated soybean. Soil Biol Biochem. 2012;51:56–65. https://doi.org/10.1016/j.soilbio.2012.04.016.

    Article  CAS  Google Scholar 

  164. Jiao S, Chen W, Wei G. Core microbiota drive functional stability of soil microbiome in reforestation ecosystems. Glob Change Biol. 2022;28(3):1038–47. https://doi.org/10.1111/gcb.16024.

    Article  CAS  Google Scholar 

  165. Ramírez MD, España M, Aguirre C, Kojima K, Ohkama-Ohtsu N, Sekimoto H, Yokoyama T. Burkholderia and Paraburkholderia are predominant soybean rhizobial genera in Venezuelan soils in different climatic and topographical regions. Microbes Environ. 2019;34(1):43–58. https://doi.org/10.1264/jsme2.ME18076.

    Article  PubMed  PubMed Central  Google Scholar 

  166. Brown JK, Rant JC. Fitness costs and trade-offs of disease resistance and their consequences for breeding arable crops. Plant Pathol. 2013;62:83–95. https://doi.org/10.1111/ppa.12163.

    Article  Google Scholar 

Download references

Acknowledgements

The authors thank Monty Malone, Soybean Variety Development Lead at BASF, for providing the soybean seed used in this study. Additionally, they acknowledge and express appreciation to Nikhita Shankar and Caroline Obert, Study Managers at Element Biosciences, for their technical support regarding the LoopSeq platform.

Funding

The sequencing and enzyme analyses in this study were financially supported by AgriGro, Inc. This research is also partially funded by university startup funds to AJW.

Author information

Authors and Affiliations

Authors

Contributions

BH, EB, and AJW designed the study. CW, MC, and EB managed the test site throughout the duration of the experiments. BH, CW, and MC collected and processed samples. BH performed all statistical analyses. BH and AJW wrote the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Asela J. Wijeratne.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

BH is employed by AgriGro, Inc.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hale, B., Watts, C., Conatser, M. et al. Fine-scale characterization of the soybean rhizosphere microbiome via synthetic long reads and avidity sequencing. Environmental Microbiome 19, 46 (2024). https://doi.org/10.1186/s40793-024-00590-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40793-024-00590-5

Keywords