Methodology 
First, we only keep the desired data columns and remove all the unwanted one. The data frame after preprocessed contains tripduration, start time and usertype, as shown in Figure 3 below.