GithubHelp home page GithubHelp logo

bglarkin / tallgrass_restoration_legacies Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 105.7 MB

Agricultural practices in cropland, like soil tillage and fertilizer applications, profoundly alter soil physical and chemical properties over time.

R 100.00%
climate-data etl arbuscular-mycorrhizal-fungi microbial-ecology plant-pathogens

tallgrass_restoration_legacies's Introduction

Introduction

Agricultural practices in cropland, like soil tillage and fertilizer applications, profoundly alter the soil's physical and chemical properties over time. Soil carbon is greatly reduced and belowground metabolic pathways are greatly simplified, leading to attrition of microbial diversity. Restoration of abandoned cropland is increasingly popular, and usually involves seeding a great density of native plants into post-agricultural soil. Although these plants can establish, bacterial communities often fail to recover to a pre-agricultural state, instead forming novel communities (Badger and Docherty 2022). We sampled soil from fields in current row-crop agriculture, active restoration of varying ages, and prairie remnants to investigate differences in fungal communities. Our goal was to learn whether fungi, like bacteria, transition to novel communities during prairie restoration, or whether post-restoration communities begin to resemble those in remnants.

Source Data

Raw source data were output from QIIME2. ETL of these files is accomplished in process_data.md, which produces clean .csv files which are included in this here and available for use.

Site Data

The file site_locations.md shows locations of regions and sites and displays associated metadata. Climate data (precipitation) are also downloaded, processed, and displayed by this script.

Sites are tested for autocorrelation between geographic distance and fungal species or soil chemical properties in spatial_correlation.md. Mild or insignificant autocorrelation was found, particularly in the Blue Mounds restored fields, suggesting that we can at least present these fields as a pseudochronosequence.

Additional Site Data

Site data include experimental metadata as described above. It also includes some measured data that is aggregated and the field level rather than containing subsamples in fields:

  • Soil abiotic properties: This script provides a quick overview of the soil abiotic property data and produces products (e.g., ordination axis values) for use in downstream analysis. A basic correlation with microbial biomass is also displayed.
  • Percent water stable aggregates: This script provides a quick overview of WSA in fields and regions. WSA is higher in restored fields based on a mixed linear model, but isn't correlated with years since restoration.
  • Microbial biomass data include site-species tables derived from high-throughput sequencing and PLFA/NLFA extractions and analysis data which Ylva did. This report presents basic visualizations of microbial biomass inferred with PLFA/NLFA quantification.

Diagnostics

In iterative workflow is used to discover the optimal number of samples to keep from each field to ensure equal sampling effort and an adequate representation of diversity. For details, see the associated scripts listed here.

Workflow

  1. The script process_data.R is run first.
  2. Next, microbial_diagnostics_pre.R is run to investigate sequencing depth in samples and species accumulation in fields.
  3. Then, process_data.R is run again, this time with the number of samples retained per field set to the levels recommended in microbial_diagnostics_pre.R.
  4. Finally, microbial_diagnostics_post.R is run. It is very similar to the "_pre" script, but a different file is used so that the two may be compared.

Microbial Species Diversity

Microbial data analyzed here include site-species tables derived from high-throughput sequencing of ITS and 18S genes and clustering into 97% similar OTUs. The report microbial_diversity.md presents basic statistics and visualizations of species richness, Shannon's diversity/evenness, and Simpson's diversity/evenness in the microbial species data across field types.

Microbial Taxonomy and Guilds

OTUs are annotated with taxonomic information, and ITS OTUs additionally are annotated with fungal traits "primary lifestyles" (aka guilds). In this report, sequence abundances in taxonomic groups or fungal guilds are compared across field types and with time since restoration. Indicator species analysis is also performed to identify particular species matches with interesting stories.

Microbial Communities

In this report, multivariate analysis is performed on three datasets: ITS (rarefied, Bray-Curtis distance), 18S (rarefied, Bray-Curtis distance), and 18S (rarefied, UNIFRAC distance). Unconstrained ordinations are produced. Cornfields clustered away from all other field types, but separation of remnants and restored fields differed among datasets. With ITS, years since restoration looked like a strong signal among restored fields, but remnants appeared intermediate among restored fields where age is concerned. This could be evidence of a "novel microbial assemblage" as a successional endpoint in restored fields (although other scenarios are equally plausible). With both 18S datasets, remnant and restored fields clustered closer together, and away from corn. Years since restoration was still a significant signal, but less so than with ITS.

Years since will also be tested post-hoc using envfit().

Plant Composition

Plant data comprises two data sets. Quadrat surveys from 10 plots in each field were done in Wisconsin sites in 2016. At Fermi, we only have relevé data with presence/absence from summer 2017. Plant metadata includes taxonomy and life history traits, and should coverboth plant data sets. With the abundance-data sites, trait data are reported in percent cover. In the presence-data sites, traits are in counts of species with that trait per field.

This report produces basic visualization and diagnostic views of the plant data. Two traits matrices are output, one for sites with abundance data (16 sites), and one for sites with presence data only (20 sites).

Constrained Analyses

This report attempts to find strong explanations for microbial community differences. Because the site differences due to region might confound such an analysis, site variables like soil abiotic properties and precipitation were first condensed into two ordination axes using PCA and used as covariables in the constrained analysis. The constrained analysis was done with dbRDA and tested the explanatory power of years since restoration, plant community axes or plant traits, and soil properties that we don't believe are due to regional differences and rather come from an agricultural legacy: SOM, NO3, P, and K.

Summary

The goal of this final report is to present results and discuss whether a path forward with these data exists. If so, we will determine the strategy that presents the best story, organization, and interpretation of these results.

tallgrass_restoration_legacies's People

Contributors

bglarkin avatar

Stargazers

 avatar

Watchers

 avatar

tallgrass_restoration_legacies's Issues

Final diagnostics on OTU accumulation

Remember to do the final diagnostics (which you should have done to start with!)

  • Plot sequence abundance vs richness. Whatever the outcome, you may have to explain it if there is a relationship. The lines may look different among field types
  • Produce rarefaction curves for each site. Look for outliers or obvious evidence that we have significantly undersampled. Do this with the set of 6 samples per field, and/or with all the samples we have.

# s axis sequence abundance, y axis richness

Fit years since with `envfit()`

Remember to produce this plot and test in the microbial communities script. It isn't a constrained anaylsis; those will be done later with environmental variables. But this is a quick way to get a naive test on years since restoration, and it's likely to be significant with all microbial datasets.

#' Try fitting years since restoration with envfit...

NA point in ITS PCoA

Now for some reason one of the remnants is NA. I need to re-run all the code and troubleshoot. This is endless.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.