3.4 Evaluation and repeat annotation
We finally obtained a data set that contained 889 complete BUSCO groups
(90.90%) and 423 (92.36%) CEGMA groups. The mapping ratio for Illumina
data was up to 97.45%. The BUSCO, CEGMA, and the mapping ratio for
Illumina directly supported the high-quality Asian Clam genome that we
assembled. More detailed information may be found in Supporting
Information Table S3. More than 1.06 Gb of genomic sequences were
identified and marked as repeats, representing 69.66% of the total
genomic sequences (Table 2). Approximately 608.85 Mb (57.54%) of LARDs
was the predominant repeat type. Other types of repeats with high
proportions were TIRs (10.46%), PLEs (12.38%), and LINEs (7.07%)
(Supporting Information Table S4).