2.4 Data clustering & Multiple linear regression analysis
Data clustering was carried out using k-means clustering in Weka platform using Elbow method. Multiple Linear Regression (MLR) analysis was carried out using lm library in R to decipher correlation between outcome variable, ΔΔG and all its components that contribute to total free energy calculations as predictor variable assuming that they follow approximately linear relationship.