Authorea

Guido Uguzzoni edited untitled.tex almost 9 years ago

Commit id: 6051cd17102be780419c6c3ee8f695742dd61fed

deletions | additions

We remain with TOT pfam ... We compute the DCA predictions, using pseudo-likelihood approach (see details in []), over all the Pfam multi sequence alignment (MSA) of the selected domains. \section{Preliminary Results} We select all the couples of residues predicted by DCA to have a strong co-evolutionary signal and that are not in contact according to the intra-domain solved structures (intra-chain false positive). We test whether these couples have a inter-chains parings explanation. The figure () show the inter-chain true positive rate of the couples of residues that are not in contact in the domain structure (intra-chain false positive). (Comment) The distribution of the absolute ranking of the first intra-domain false positive couple across the different families (showed in figure () ) shows a great heterogeneity, due to different factors. First the rate of the predictions depends on the quality of the MSA and scale with the length of the protein. Moreover the diversity in the architectures present within the domain families and the variety of possible inter-chain binding affect the magnitude between intra and inter domain co-evolutionary signals. We can select the intra-chain false positive more reliable looking at the value of the DCA score. This is more significative than the absolute ranking of the couples when comparing different pfam. In figure () () () are summarize the results of this analysis. (comment)