Residue-wise comparison in thermodynamic reversibility accuracy for different structural-based stability predictors. Comparison for the average \(\Delta\Delta G_R\) for each residue present in the dataset for (A) FEPc versus FEPa, (B) FEPc versus FoldX and (C) Rosetta versus FoldX.