Richard Smith-Unna edited abstract.mediawiki  about 10 years ago

Commit id: 1d8d3ccc4a63b3a868032c6035fa297dea9bb1fd

deletions | additions      

       

## Abstract  Improvements in short-read sequencing technology combined with rapidly decreasing prices have enabled the use of RNA-seq to assay the transcriptome of species whose genome has not been sequenced. *De-novo* transcriptome assembly attempts to reconstruct the original transcript sequences from short reads. Transcriptome assemblies are relied upon for gene expression studies, phylogenetic analyses, and molecular tooling. It is therefore important to ensure that assemblies are as accurate as possible, but to date there are no tools for deep quality assessment of assemblies. We present **Transrate**, an open source command-line program and library implemented in the Ruby and C languages that automates deep analysis of assembly quality. Transrate evaluates assemblies based on contig metrics, read mapping, comparison to reference species, and network analysis. We demonstrate using both published and simulated data that using Transrate identifies the strengths and weaknesses of different assembly strategies and enables informed optimisation of assembly pipelines.