Authorea

Jenna M. Lang edited Methods.md over 9 years ago

Commit id: d82622ddff6c2a10fc9ce9310b7b2c1a20fc52ec

deletions | additions

##Microbial community analysis Microbial plate counts were performed by Covance Laboratories (Covance Inc., Madison, WI). Aerobic plate counts were performed according to [SPCM:7](http://www.fda.gov/Food/FoodScienceResearch/LaboratoryMethods/ucm063346.htm), anaerobic plate counts were performed according to [APCM:5](http://www.fda.gov/food/foodscienceresearch/laboratorymethods/ucm073598.htm) and the yeast and mold counts were performed according to Chapter 23 of the FDA's [Bacteriological Analytical Manual](http://www.fda.gov/food/foodscienceresearch/laboratorymethods/ucm073598.htm). Plate counts were reported as colony forming units (CFU) per gram for each meal composite. The CFU/g values were multiplied by the total number of grams in each meal to obtain the CFU per meal, and the values for meals for each day were added to obtain the CFU per day for each dietary pattern (Table 1). The taxonomic composition of each meal microbiome was assessed via amplification and sequencing of 16S rDNA from the homogenized meals. DNA was extracted from homogenized food samples with the Power Food Microbial DNA Isolation Kit (MoBio Laboratories, Inc.) according to the manufacturer’s protocol. Microbial DNA was amplified by a two-step PCR enrichment of the 16S rRNA gene (V4 region) using primers 515F and 806R, modified by addition of Illumina adaptor and barcodes sequences. All primer sequences and a detailed PCR protocol are provided Supplementary Datafile 1. in Tables 7 and 8, respectively. Libraries were sequenced using an Illumina MiSeq system, generating 150bp paired-end amplicon reads. The amplicon data was multiplexed using dual barcode combinations for each sample. We used a custom script (available on [github](https://github.com/gjospin/scripts/blob/master/Demul_trim_prep.pl), with code provided in Supplementary Datafile 2) to assign each pair of reads to their respective samples when parsing the raw data. This script allows for 1 base pair difference per barcode used (2 per sample) to accommodate for read errors from the machine. The paired reads were then aligned and a consensus was computed using FLASH \cite{21903629} with maximum overlap of 120 and a minimum overlap of 70 (other parameters were left as default). The custom script automatically demultiplexes the data into fastq files, executes FLASH, and parses its results to reformat the sequences with appropriate naming conventions for QIIME v. 1.8.0 \cite{20383131} in fasta format. The resulting consensus sequences were analyzed using the QIIME pipeline. ##Statistical Analyses Unless otherwise noted, all statistical analyses were performed using python scripts implemented in QIIME v. 1.8.0, and all python scripts referenced here are QIIME scripts. The IPython notebook file used for all QIIME analyses is available HERE. To explore the differences in overall microbial community composition across the 15 meals, both the phylogenetic weighted UniFrac distances \cite{20827291} and the taxonomic Bray-Curtis dissimilarities \cite{Bray_1957} were calculated using the beta\_diversity\_through\_plots.py script. This script also produced a principal coordinates analysis (PCoA) plot in which the Bray-Curtis dissimilarities between samples were used to visualize differences among groups of samples (see Figure 1 for this type of visualization for the three Diet Types.) To test for the significance of dietary pattern on the overall microbial community composition, we used a permutational multivariate ANOVA as implemented in the compare\_categories.py script. To test for significant differences in taxonomic richness across dietary patterns, we used the non-parametric Kruskal-Wallis test \cite{Kruskal_1952} with the FDR (false discovery rate) correction as implemented in compare\_alpha\_diversity.py. To test for the significant variation in frequency of individual OTUs across dietary patterns, we used the Kruskal-Wallis test with the FDR correction as implemented in the group\_significance.py script. To test for significant correlation between the relative abundance of a single taxonomic group and meal metadata categories (i.e. nutrient composition, whether a meal contains fermented foods, etc) at 5 taxonomic levels (phylum-genus) Pearson correlation coefficients \cite{Pearson_1895} were calculated and tested for statistical significance using Stata (StataCorp. 2013. Stata Statistical Software: Release 13. College Station, TX: StataCorp LP.)