Compare the 3 proteins and how input span range is really importance and how if you don't have a wide enough range of input data you get shit performance (i.e. in TEM-1 fitting was ass until we pulled randomly from the entire dataset instead of just part of it).