Figure legends
Figure 1 t-SNE plot of genome embeddings. Genomes of different species were shown with different colors and the submitting date of different SARS-CoV-2 isolates were indicated with different gray intensity. SARS2: SARS-CoV-2; SARS: SARS-CoV; MERS: MERS-CoV; BCov: bat coronavirus; PCov: pangolin coronavirus.
Figure 2 Mutations in the 5’UTR of SARS-CoV-2 genomes. A. Sequence logo of part of 5’UTR. B&C. Predicted secondary structure of two alleles of 5’UTR. The negative number showed the distance to ORF1ab start codon and the mutated site was showed as indicated.
Figure 3 Mutations of SARS-CoV-2 proteins. For all the analyzed genomes, the encoded proteins were deduced and most frequently mutated sites were shown as indicated in each protein.
Figure 4 Mutation mapping of the spike protein. The mutation sites on the ribbon representation of SARS-CoV-2 S proteins (PDB ID: 6vsb) were shown as indicated.
All these mutations of the S protein were frequently happeded in early submitted SARS-CoV-2 genomes.