this is for holding javascript data
Jenna M. Lang edited Building a 16S rDNA Phylogenetic Tree.md
almost 10 years ago
Commit id: cd9422fb1af0cc02ad936185df2b84b925889100
deletions | additions
diff --git a/Building a 16S rDNA Phylogenetic Tree.md b/Building a 16S rDNA Phylogenetic Tree.md
index ed677c3..21b1c40 100644
--- a/Building a 16S rDNA Phylogenetic Tree.md
+++ b/Building a 16S rDNA Phylogenetic Tree.md
...
#Constructing #Building a
Phylogenetic Tree using 16S rDNA
sequence
There are two points during Phylogenetic Tree
What _is_ this thing? At this point, you have an organism in pure culture, but you don't know what it is. If you found an creature crawling on the
workflow where making a 16S phylogenetic tree may be useful. The first is after identification of candidate organisms by Sanger sequencing ground and
wanted to identify (or classify) it, you might look at it's morphology and
ask what it most _looks_ like. If it has six legs, you might guess it's some kind of
insect. If it has hard outer wings folded over its back, you might guess that it's some kind of beetle. If it also had antler-style horns on its head, you might
guess that it's some kind of stag beetle. If you don't have enough information available to guess what kind of stag beetle you have, then you have reached the
second is after assembly limit of
*taxonomic resolution* for your creature. With an unknown microbial species, the
genome. The process best way to identify it is
identical in both cases, but the additional length and improved quality to sequence one of
its genes (most people
use the
post-assembly full-length 16S rRNA
gene sequence may generate a better tree. The tree can be used for identification of gene) and ask what _it_ most looks like. With animal classification, the key features to examine are things like legs and wings and horns; with microbial
classification, the
candidate (e.g. is key features to examine are the
candidate found nucleotides in different positions in a
single species clade?), for naming DNA sequence. Fortunately, we have computer programs to help us make
sense of the
candidate (does it fall in DNA sequence information. Our preferred approach to identifying a
clade containing only members microbial species is to place the unknown sequence in the context of
that species, and other members a
phylogenetic tree of
known sequences. Building a phylogenetic tree from a 16S rRNA sequence is fairly straightforward, but the
species are not found outside that clade?), and for placement interpretation of the
organism into tree can be
a
phylogenetic context. bit of an art. Here, we attempt to guide you through both. However, some complicated cases will require consultation with an expert in the field of
phylogenetics or systematics.
The outline of the workflow is to use the Ribosomal Database Project (RDP) to generate an alignment of the sequence with close relatives and an outgroup,
following by cleanup of the RDP headers, tree-building with FastTree and
viewing/analysis viewing/interpretation of the tree
in Dendroscope.
Obtain the Full-Length 16S Sequence from the Assembly
(Skip this step if you are building the tree using
the 16S sequence from Sanger sequencing)
1. Go to RAST and sign in
On the “Jobs Overview" page, click on view details (under annotation progress) for the microbe you are working with.
2. Click on Browse annotated genome in SEED viewer (At the top of the page)
3. Click on Browse through the features of [organism name]
4. Under Function search for “ssurna” or “SSU rRNA”
(if it doesn't work at first then refresh the page)
5. Find the ssuRNA that is 1400-1800 bp in length (often Illumina assemblies also have fragments of 16S sequence that are only a few hundred bp long)
6. Click on the Feature ID for that sequence
7. Click on the Sequences tab (around the middle of the page )
8. Click on Show Fasta
9. Click on Download Sequences and save as a fasta file. Rename the file to something useful.
11. Double check the identity of the sequence at BLAST:
http://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastn&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome
Obtaining Dendroscope.
##Obtain an RDP alignment
The goal of this section is to obtain a 16S alignment from RDP that can be used to build a tree.
The "art" is determining which sequences to put in the alignment. 1. Go to rdp.cme.msu.edu
2. Create an account
3. Click on my RDP/login
4. Upload the fasta file containing your 16s sequence
5. Assign it a group name (this is what the program will label your sequence/organism). Choose this carefully since that will be the name on the final tree.
Upload the fasta file (may take some time to process, status=P (pending) wait until A (aligned)
If it is taking awhile refresh the screen 6. Click the "+" next to the sequence, to add it to your cart
7. Click on CLASSIFIER at the top of the page
8. Click on "Do Classification With Selected Sequences" button. This will show you a hierarchical view of the classification of your sequence (from Phylum to Genus.) You will use this information to find other sequences that you want to include in your alignment that you will use to build your phylogenetic tree.
8. Under "Data Set Options," Specify isolates as the Source, leave the rest as defaults
click on BROWSERS at the top of the page
Specify isolates as the Source, leave the rest as defaults
Click Browse
Browse Sequences