I acquired the Citibike data which is available here for the specific months of January 2016 and April 2015 in order to account for some of the seasonal as well as temporal effects. There are many variables available but I kept just the user-type and the trip-duration columns as those were the only ones relevant to my analysis. Also, I group the data based on user type in order to separate the 2 groups and do further analysis.