Madison edited 16S rDNA Sequencing and Analysis (Organism Identification).md  almost 10 years ago

Commit id: 6d7e1f68fe35e5ea42041267b3617a036a7727bb

deletions | additions      

       

4. Pull up the Project Settings by clicking on the picture of tools at the top of the page. Click on the Sequence Processing tab, under Sequence trimming unclick the Automatically trim sequence ends button. You should also decrease the Min. confidence score under Consensus settings. The default option is 30 which represents a 99.9% quality score, for many reads this will be too stringent and will not allow you to get enough overlap to create a consensus sequence. A minimum confidence score between 15 and 25 is normally okay but tuning may be required depending on your read quality. For an example see Figure 4.  5. Group your forward and reverse reads by highlighting both of them and clicking Group selected forward/reverse files (under Traces)  6. Under Sequences go to Generate Finished Sequences and click on for all trace files. (you will need to redo this every time you change the project settings).  7. To view your consensus sequence, click on the read pair group and then click on the magnifying glass at the top of the page. You should see something like Figure 12. 5.  8. The Trace View shows the quality scores, the chromatogram display, and the raw base calls from both the forward and reverse reads, as well as the consensus sequence. The consensus sequence is the middle list of nucleotides. If the program is giving you a string of Ns where your forward and reverse reads do not overlap, you need to decrease the Min confidence score.  9. To export the consensus from the trace view, go to File and click on Export working sequence. This will create a file containing the consensus sequence, which you can upload to BLAST and use to identify the organism.