3.4 Evaluation and repeat annotation
We finally obtained a data set that contained 889 complete BUSCO groups (90.90%) and 423 (92.36%) CEGMA groups. The mapping ratio for Illumina data was up to 97.45%. The BUSCO, CEGMA, and the mapping ratio for Illumina directly supported the high-quality Asian Clam genome that we assembled. More detailed information may be found in Supporting Information Table S3. More than 1.06 Gb of genomic sequences were identified and marked as repeats, representing 69.66% of the total genomic sequences (Table 2). Approximately 608.85 Mb (57.54%) of LARDs was the predominant repeat type. Other types of repeats with high proportions were TIRs (10.46%), PLEs (12.38%), and LINEs (7.07%) (Supporting Information Table S4).