Authorea

Jenna M. Lang edited Introduction.md almost 10 years ago

Commit id: 625ea738dc1674931ffcc7a636083c9a840d7487

deletions | additions

#Introduction The objective of the present study was to design, test, troubleshoot, and publish a comprehensive workflow for microbial genome sequencing; enabling even a lab with limited resources and bioinformatics experience to perform it. In the fall of 2011, with support from the Alfred P. Sloan foundation, our lab began an undergraduate research project attempting to sequence microbial reference genomes from the built environment. Several students went through the process of isolating, identifying, sequencing and assembling microbial genomes, followed by submission to NCBI and publication of each genome \cite{Lo_2013}\cite{Bendiks_2013}\cite{Flanagan_2013}\cite{Diep_2013}\cite{Coil_2013}\cite{Holland_Moritz_2013}. Through the course of this project we found that while many of the steps have become much easier in recent years (e.g. kits for library prep, relatively cheap sequencing, bioinformatics pipelines), there were still a significant number of stumbling blocks. In addition to these hurdles, some portions of the project involve choosing between a wide variety of options (e.g. choice of assembly program) which can create a large activation energy for a lab without a bioinformatician. Each option comes with its own advantages and disadvantages in terms of complexity, expense, computing power, time, and experience required. In this workflow we have chosen one path through these choices, allowing a researcher to go from a swab to a published paper. We used this workflow to process a novel Tatumella sp. isolate and publish the genome \cite{Dunitz_2014}. The data from every step of the workflow, using this Tatumella isolate, is available on Figshare (REF DOI)