Data Files
A genome Assembly release will comprise the following data file formats:
- FASTA contigs, scaffolds, superscaffolds/chromosome (compressed)
- FASTQ Read files RAW data (compressed)
- BAM alignment
- AGP Coordinate system translation
- GFF3 Feature Format File features, liftover
- YAML Data registry metadata file
- TAB patches
- Common indices (bwa, bowtie2, blast, ...)
- Naming conventions file names
- Naming convention for entities
- Patch files, patch scripts
- BSGenome packages