Simone Giachello

and 8 more

Protists are major actors of soil communities and play key roles in shaping food webs, community assembly, and ecosystem processes, yet their functional diversity is understudied. High-throughput sequencing data have revealed their ubiquity and diversity, but lack of standardized traits has hampered the integration of functional information, limiting our understanding of soil ecosystems. Here we propose a framework for soil protists, identify a set of common traits to characterize their functional diversity, and apply the framework on a broad-scale, real-world dataset. We reviewed studies on soil protists to identify the traits used in the literature, and define a framework based on 10 key traits that satisfy two criteria: availability of information, and applicability to most taxa. The framework was tested on a dataset of environmental DNA metabarcoding data from 1123 soil samples collected in 48 glacier forelands worldwide. Traits were assigned to all the 570 Molecular Operational Taxonomic Units (MOTUs) detected in our dataset, leading to the production of a global trait-based dataset from glacier forelands. We estimated the functional space of protist communities and evaluated if the selected traits were effective in describing protist diversity. The functional space of protist communities showed that the MOTUs are clustered in three regions, mainly reflecting different nutritional and habitat preferences. The proposed framework is appropriate for multiple applications, including estimation of functional diversity and food web analyses, and provides a basis for ecological studies on soil protists, enabling the functional characterization of this essential but often neglected component of soil biodiversity.

Aurélie Bonin

and 2 more

Clustering approaches are pivotal to handle the many sequence variants obtained in DNA metabarcoding datasets, therefore they have become a key step of metabarcoding analysis pipelines. Clustering often relies on a sequence similarity threshold to gather sequences in Molecular Operational Taxonomic Units (MOTUs), each of which ideally representing a homogeneous taxonomic entity, e.g. a species or a genus. However, the choice of the clustering threshold is rarely justified, and its impact on MOTU over-splitting or over-merging even less tested. Here, we evaluated clustering threshold values for several metabarcoding markers under different criteria: limitation of MOTU over-merging, limitation of MOTU over-splitting, and trade-off between over-merging and over-splitting. We extracted sequences from a public database for eigt markers, ranging from generalist markers targeting Bacteria or Eukaryota, to more specific markers targeting a class or a subclass (e.g. Insecta, Oligochaeta). Based on the distributions of pairwise sequence similarities within species and within genera, and on the rates of over-splitting and over-merging across different clustering thresholds, we were able to propose threshold values minimizing the risk of over-splitting, that of over-merging, or offering a trade-off between the two risks. For generalist markers, high similarity thresholds (0.96-0.99) are generally appropriate, while more specific markers require lower values (0.85-0.96). These results do not support the use of a fixed clustering threshold. Instead, we advocate a careful examination of the most appropriate threshold based on the research objectives, the potential costs of over-splitting and over-merging, and the features of the studied markers.

Alessia Guerrieri

and 17 more

Ice-free areas are increasing worldwide due to the dramatic glacier shrinkage and are undergoing rapid colonization by multiple lifeforms, thus representing key environments to study ecosystem development. Soils have a complex vertical structure. However, we know little about how microbial and animal communities differ across soil depths and development stages during the colonization of deglaciated terrains, how these differences evolve through time, and whether patterns are consistent among different taxonomic groups. Here, we used environmental DNA metabarcoding to describe how community diversity and composition of six groups (Eukaryota, Bacteria, Mycota, Collembola, Insecta, Oligochaeta) differ between surface (0-5 cm) and relatively deep (7.5-20 cm) soils at different stages of development across five Alpine glaciers. Taxonomic diversity increased with time since glacier retreat and with soil evolution; the pattern was consistent across different groups and soil depths. For Eukaryota, and particularly Mycota, alpha-diversity was generally the highest in soils close to the surface. Time since glacier retreat was a more important driver of community composition compared to soil depth; for nearly all the taxa, differences in community composition between surface and deep soils decreased with time since glacier retreat, suggesting that the development of soil and/or of vegetation tends to homogenize the first 20 cm of soil through time. Within both Bacteria and Mycota, several molecular operational taxonomic units were significant indicators of specific depths and/or soil development stages, confirming the strong functional variation of microbial communities through time and depth. The complexity of community patterns highlights the importance of integrating information from multiple taxonomic groups to unravel community variation in response to ongoing global changes.

Alessia Guerrieri

and 5 more

Environmental DNA metabarcoding is becoming a key tool for biodiversity monitoring over large geographical or taxonomic scales and for elusive taxa like soil organisms. Increasing sample sizes and interest in remote or extreme areas often require the preservation of soil samples and thus deviations from optimal standardized protocols. However, we still ignore the impact of different methods of soil sample preservation on the results of metabarcoding studies and there is no guidelines for best practices so far. Here, we assessed the impact of four methods of soil sample preservation commonly used in metabarcoding studies (preservation at room temperature for 6h, preservation at 4°C for three days, desiccation immediately after sampling and preservation for 21 days, and desiccation after 6h at room temperature and preservation for 21 days). For each preservation method, we benchmarked resulting estimates of taxon diversity and community composition of three different taxonomic groups (bacteria, fungi and eukaryotes) in three different habitats (forest, river bank and grassland) against results obtained under optimal conditions (i.e. extraction of eDNA right after sampling). Overall, the different preservation methods only marginally impaired results and only under certain conditions. When rare taxa were considered, we detected small but significant changes in MOTU richness of bacteria, fungi and eukaryotes across treatments, while the exclusion of rare taxa led to robust results across preservation methods. The differences in community structure among habitats were evident for all treatments, and the communities retrieved using the different preservation conditions were extremely similar. We propose guidelines on the selection of the optimal soil sample preservation conditions for metabarcoding studies, depending on the practical constraints, costs and ultimate research goals.