this is for holding javascript data
Jennifer Shelton edited introduction.tex
over 8 years ago
Commit id: 4090d7b1936688f8b52050dc88b9cd2c37fe2726
deletions | additions
diff --git a/introduction.tex b/introduction.tex
index 4f61154..c8a4ac6 100644
--- a/introduction.tex
+++ b/introduction.tex
...
MSG: Each line of the fasta entry must be the same length except the last. Line above #5 'CTAGAGCGCAGCTCTGGGGG..' is 61 != 86 chars...
\end{verbatim}
EMBOSS
Seqret seqret was designed as a very flexible tool to convert from one properly formatted file to another properly but distinctly formatted file. It also was designed to accept poorly formatted data (e.g. a FASTA missing the final new line that is improperly wrapped) and export a reformatted file (e.g. wrapped after 60 bases with a final newline).
After submitting an inconsistently wrapped FASTA record that is missing a final new line character, seqret produced a properly formatted FASTA record.
Code:
...
TATATATATATTGCGCTCTCGTCTCCT
\end{verbatim}
However,
Seqret did seqret does not log the detected errors in the format. Another feature of Seqret is that an output file is created even if the output is identical to the input. Storing two identical files is an inefficient use of disk space. Seqtk \cite{Li2013} is another example of a tool that can automate FASTA reformatting but does not first check original format or report format issues.
Restarting The process of restarting analysis manually after wrapping a FASTA file may only take
minutes but the issue minutes. The time consuming aspect of this interruption is
how long the time it takes the analyst to become
available. available and the number of jobs this step must be repeated for. Likewise, storage
of one extra FASTA file is trivial unless the FASTA files in question store whole genomes in which case the burden can add up for a bioinformatics core. Efficiency and automation
a are crucial as
bioinformatics bioinformatic analysis projects become more numerous and time consuming. Many tools can either detect a format issue or repair a format issue. No existing tool was found that both validates FASTA format and reformats automatically only where required for a user defined list of non-fatal FASTA format issues.